rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-07-06 13:40:37 +00:00

Author	SHA1	Message	Date
Arseny Sher	9fe4548e13	Reimplement explicit timeline creation on safekeepers. With the ability to pass commit_lsn. This allows to perform project WAL recovery through different (from the original) set of safekeepers (or under different ttid) by 1) moving WAL files to s3 under proper ttid; 2) explicitly creating timeline on safekeepers, setting commit_lsn to the latest point; 3) putting the lastest .parital file to the timeline directory on safekeepers, if desired. Extend test_s3_wal_replay to exersise this behaviour. Also extends timeline_status endpoint to return postgres information.	2022-10-13 21:43:10 +04:00
Heikki Linnakangas	14c623b254	Make it possible to build with old cargo version. I'm using the Rust compiler and cargo versions from Debian packages, but the latest available cargo Debian package is quite old, version 1.57. The 'named-profiles' features was not stabilized at that version yet, so ever since commit `a463749f5`, I've had to manually add this line to the Cargo.toml file to compile. I've been wishing that someone would update the cargo Debian package, but it doesn't seem to be happening any time soon. This doesn't seem to bother anyone else but me, but it shouldn't hurt anyone else either. If there was a good reason, I could install a newer cargo version with 'rustup', but if all we need is this one line in Cargo.toml, I'd prefer to continue using the Debian packages.	2022-10-13 15:17:00 +03:00
Alexander Bayandin	ebf54b0de0	Nightly Benchmarks: Add 50 GB projects (#2612 )	2022-10-13 10:00:29 +01:00
Andrés	09dda35dac	Return broken tenants due to non existing timelines dir (#2552 ) (#2575 ) Co-authored-by: andres <andres.rodriguez@outlook.es>	2022-10-12 22:28:39 +03:00
Dmitry Ivanov	6ace79345d	[proxy] Add more context to console requests logging (#2583 )	2022-10-12 21:00:44 +03:00
danieltprice	771e61425e	Update release-pr.md (#2600 ) Update the Release Notes PR example that is referenced from the checklist. The Release Notes file structure changed recently.	2022-10-12 08:38:28 -03:00
Alexander Bayandin	93775f6ca7	GitHub Actions: replace deprecated set-output with GITHUB_OUTPUT (#2608 )	2022-10-12 10:22:24 +01:00
Arseny Sher	6d0dacc4ce	Recreate timeline on pageserver in s3_wal_replay test. That's closer to real usage than switching to brand new pageserver.	2022-10-12 11:46:21 +04:00
Heikki Linnakangas	e5e40a31f4	Clean up terms "delete timeline" and "detach tenant". You cannot attach/detach an individual timeline, attach/detach always applies to the whole tenant. However, you can delete a single timeline from a tenant. Fix some comments and error messages that confused these two operations.	2022-10-11 17:47:41 +03:00
Heikki Linnakangas	676c63c329	Improve comments.	2022-10-11 17:47:41 +03:00
Heikki Linnakangas	47366522a8	Make the return type of 'list_timelines' simpler. It's enough to return just the Timeline references. You can get the timeline's ID easily from Timeline.	2022-10-11 17:47:41 +03:00
Heikki Linnakangas	db26bc49cc	Remove obsolete FIXME comment. Commit `c634cb1d36` removed the trait and changed the function to return a &TimelineWriter, as the FIXME said we should do, but forgot to remove the FIXME.	2022-10-11 17:47:41 +03:00
Lassi Pölönen	e520293090	Add build info metric to pageserver, safekeeper and proxy (#2596 ) * Test that we emit build info metric for pageserver, safekeeper and proxy with some non-zero length revision label * Emit libmetrics_build_info on startup of pageserver, safekeeper and proxy with label "revision" which tells the git revision.	2022-10-11 09:54:32 +03:00
Sergey Melnikov	241e549757	Switch neon-stress etcd to dedicatd instance (#2602 )	2022-10-10 22:07:19 +00:00
Sergey Melnikov	34bea270f0	Fix POSTGRES_DISTRIB_DIR for benchmarks on ec2 runner (#2594 )	2022-10-10 09:12:50 +00:00
Kirill Bulatov	13f0e7a5b4	Deploy pageserver_binutils to the envs	2022-10-09 08:21:11 +03:00
Kirill Bulatov	3e35f10adc	Add a script to reformat the project	2022-10-09 08:21:11 +03:00
Kirill Bulatov	3be3bb7730	Be more verbose with initdb for pageserver timeline creation	2022-10-09 08:21:11 +03:00
Kirill Bulatov	01d2c52c82	Tidy up feature reporting	2022-10-09 08:21:11 +03:00
Kirill Bulatov	9f79e7edea	Merge pageserver helper binaries and provide it for deployment (#2590 )	2022-10-08 12:42:17 +00:00
Heikki Linnakangas	a22165d41e	Add tests for comparing root and child branch performance. Author: Thang Pham <thang@neon.tech>	2022-10-08 10:07:33 +03:00
Arseny Sher	725be60bb7	Storage messaging rfc 2.	2022-10-07 21:22:17 +04:00
Dmitry Ivanov	e516c376d6	[proxy] Improve logging (#2554 ) * [proxy] Use `tracing::` instead of `println!` for logging Fix a minor misnomer * Log more stuff	2022-10-07 14:34:57 +03:00
Kirill Bulatov	8e51c27e1a	Restore artifact versions (#2578 ) Context: https://github.com/neondatabase/neon/pull/2128/files#r989489965 Co-authored-by: Rory de Zoete <rory@neon.tech>	2022-10-07 10:58:31 +00:00
Heikki Linnakangas	9e1eb69d55	Increase default compaction_period setting to 20 s. The previous default of 1 s caused excessive CPU usage when there were a lot of projects. Polling every timeline once a second was too aggressive so let's reduce it. Fixes https://github.com/neondatabase/neon/issues/2542, but we probably also want do to something so that we don't poll timelines that have received no new WAL or layers since last check.	2022-10-07 13:55:19 +03:00
Arthur Petukhovsky	687ba81366	Display sync safekeepers output in compute_ctl (#2571 ) Pipe postgres output to compute_ctl stdout and create a test to check that compute_ctl works and prints postgres logs.	2022-10-06 13:53:52 +00:00
Andrés	47bae68a2e	Make get_lsn_by_timestamp available in mgmt API (#2536 ) (#2560 ) Co-authored-by: andres <andres.rodriguez@outlook.es>	2022-10-06 12:42:50 +03:00
Joonas Koivunen	e8b195acb7	fix: apply notify workaround on m1 mac docker (#2564 ) workaround as discussed in the notify repository.	2022-10-06 11:13:40 +03:00
Anastasia Lubennikova	254cb7dc4f	Update CI script to push compute-node-v15 to dockerhub	2022-10-06 10:50:08 +03:00
Anastasia Lubennikova	ed85d97f17	bump vendor/postgres-v15. Rebase it to Stamp 15rc2	2022-10-06 10:50:08 +03:00
Anastasia Lubennikova	4a216c5f7f	Use PostGIS 3.3.1 that is compatible with pg 15	2022-10-06 10:50:08 +03:00
Anastasia Lubennikova	c5a428a61a	Update Dockerfile.compute-node-v15 to match v14 version. Fix build script to promote the image for v15 to neon dockerhub	2022-10-06 10:50:08 +03:00
Konstantin Knizhnik	ff8c481777	Normalize last_record LSN in wal receiver (#2529 ) * Add test for branching on page boundary * Normalize start recovery point Co-authored-by: Heikki Linnakangas <heikki@neon.tech> Co-authored-by: Thang Pham <thang@neon.tech>	2022-10-06 09:01:56 +03:00
Arthur Petukhovsky	f25dd75be9	Fix deadlock in safekeeper metrics (#2566 ) We had a problem where almost all of the threads were waiting on a futex syscall. More specifically: - `/metrics` handler was inside `TimelineCollector::collect()`, waiting on a mutex for a single Timeline - This exact timeline was inside `control_file::FileStorage::persist()`, waiting on a mutex for Lazy initialization of `PERSIST_CONTROL_FILE_SECONDS` - `PERSIST_CONTROL_FILE_SECONDS: Lazy<Histogram>` was blocked on `prometheus::register` - `prometheus::register` calls `DEFAULT_REGISTRY.write().register()` to take a write lock on Registry and add a new metric - `DEFAULT_REGISTRY` lock was already taken inside `DEFAULT_REGISTRY.gather()`, which was called by `/metrics` handler to collect all metrics This commit creates another Registry with a separate lock, to avoid deadlock in a case where `TimelineCollector` triggers registration of new metrics inside default registry.	2022-10-06 01:07:02 +03:00
Sergey Melnikov	b99bed510d	Move proxies to neon-proxy namespace (#2555 )	2022-10-05 16:14:09 +03:00
sharnoff	580584c8fc	Remove control_plane deps on pageserver/safekeeper (#2513 ) Creates new `pageserver_api` and `safekeeper_api` crates to serve as the shared dependencies. Should reduce both recompile times and cold compile times. Decreases the size of the optimized `neon_local` binary: 380M -> 179M. No significant changes for anything else (mostly as expected).	2022-10-04 11:14:45 -07:00
Kirill Bulatov	d823e84ed5	Allow attaching tenants with zero timelines	2022-10-04 18:13:51 +03:00
Kirill Bulatov	231dfbaed6	Do not remove empty timelines/ directory for tenants	2022-10-04 18:13:51 +03:00
Dmitry Rodionov	5cf53786f9	Improve pytest ergonomics 1. Disable perf tests by default 2. Add instruction to run tests in parallel	2022-10-04 14:53:01 +03:00
Heikki Linnakangas	9b9bbad462	Use 'notify' crate to wait for PostgreSQL startup. Compute node startup time is very important. After launching PostgreSQL, use 'notify' to be notified immediately when it has updated the PID file, instead of polling. The polling loop had 100 ms interval so this shaves up to 100 ms from the startup time.	2022-10-04 13:00:15 +03:00
Heikki Linnakangas	537b2c1ae6	Remove unnecessary check for open PostgreSQL TCP port. The loop checked if the TCP port is open for connections, by trying to connect to it. That seems unnecessary. By the time the postmaster.pid file says that it's ready, the port should be open. Remove that check.	2022-10-04 12:09:13 +03:00
Joonas Koivunen	31123d1fa8	Silence clippies, minor doc fix (#2543 ) * doc: remove stray backtick * chore: clippy::let_unit_value * chore: silence useless_transmute, duplicate_mod * chore: remove allowing deref_nullptr not needed since bindgen 0.60.0. * chore: remove repeated allowed lints they are already allowed from the crate root.	2022-10-03 17:44:17 +03:00
Kirill Bulatov	4f2ac51bdd	Bump rustc to 1.61	2022-10-03 16:36:03 +03:00
Kirill Bulatov	7b2f9dc908	Reuse existing tenants during attach (#2540 )	2022-10-03 13:33:55 +03:00
Arthur Petukhovsky	dabb6d2675	Fix log level for sk startup logs (#2526 )	2022-09-27 10:36:17 +00:00
Arthur Petukhovsky	fc7087b16f	Add metric for loaded safekeeper timelines (#2509 )	2022-09-27 11:57:59 +03:00
Vadim Kharitonov	2233ca2a39	seqwait.rs unit tests don't check return value	2022-09-27 11:47:59 +03:00
Dmitry Rodionov	fb68d01449	Preserve task result in TaskHandle by keeping join handle around (#2521 ) * Preserve task result in TaskHandle by keeping join handle around The solution is not great, but it should hep to debug staging issue I tried to do it in a least destructive way. TaskHandle used only in one place so it is ok to use something less generic unless we want to extend its usage across the codebase. In its current current form for its single usage place it looks too abstract Some problems around this code: 1. Task can drop event sender and continue running 2. Task cannot be joined several times (probably not needed, but still, can be surprising) 3. Had to split task event into two types because ahyhow::Error does not implement clone. So TaskContinueEvent derives clone but usual task evend does not. Clone requirement appears because we clone the current value in next_task_event. Taking it by reference is complicated. 4. Split between Init and Started is artificial and comes from watch::channel requirement to have some initial value. To summarize from 3 and 4. It may be a better idea to use RWLock or a bounded channel instead	2022-09-26 20:57:02 +00:00
Arthur Petukhovsky	d15116f2cc	Update pg_version for old timelines	2022-09-26 19:57:03 +03:00
Stas Kelvich	df45c0d0e5	Disable plv8 again	2022-09-26 12:22:46 +03:00

... 36 37 38 39 40 ...

4043 Commits