rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-01-15 17:32:56 +00:00

Author	SHA1	Message	Date
Alex Chi	f3fdaf8ef1	parallel compaction Signed-off-by: Alex Chi <chi@neon.tech>	2023-06-27 16:42:56 -04:00
Alex Chi	eb93e686ab	fix deletion Signed-off-by: Alex Chi <chi@neon.tech>	2023-06-27 14:47:04 -04:00
Alex Chi	2cb79ae3ff	fix deletion Signed-off-by: Alex Chi <chi@neon.tech>	2023-06-27 14:43:20 -04:00
Alex Chi	dfe8527806	remove assertion Signed-off-by: Alex Chi <chi@neon.tech>	2023-06-27 13:52:12 -04:00
Alex Chi	335710cec6	bring back original compaction Signed-off-by: Alex Chi <chi@neon.tech>	2023-06-27 13:38:02 -04:00
Alex Chi	a78008ad82	max_merge_width Signed-off-by: Alex Chi <chi@neon.tech>	2023-06-27 13:30:23 -04:00
Alex Chi	30e7ffcd28	adjust compaction strategy Signed-off-by: Alex Chi <chi@neon.tech>	2023-06-26 15:52:37 -04:00
Alex Chi	43d564ce0a	incremental image layer Signed-off-by: Alex Chi <chi@neon.tech>	2023-06-26 15:25:35 -04:00
Alex Chi	f86ff5e54b	dump more Signed-off-by: Alex Chi <chi@neon.tech>	2023-06-26 14:57:00 -04:00
Alex Chi	9ed6ad1d24	fix weak ptr Signed-off-by: Alex Chi <chi@neon.tech>	2023-06-26 14:33:00 -04:00
Alex Chi	91f28cb516	include delta l0 in compaction, more metrics Signed-off-by: Alex Chi <chi@neon.tech>	2023-06-26 13:56:29 -04:00
Alex Chi	0b459eb414	fix ratio compute Signed-off-by: Alex Chi <iskyzh@gmail.com>	2023-06-22 15:11:15 -04:00
Alex Chi	0865ed623c	fix comment Signed-off-by: Alex Chi <iskyzh@gmail.com>	2023-06-22 15:01:25 -04:00
Alex Chi	9e0f103c7b	insert at 0 Signed-off-by: Alex Chi <iskyzh@gmail.com>	2023-06-22 15:00:58 -04:00
Alex Chi	9f216a78a1	print Signed-off-by: Alex Chi <iskyzh@gmail.com>	2023-06-22 15:00:10 -04:00
Alex Chi	6967b4837b	fix Signed-off-by: Alex Chi <iskyzh@gmail.com>	2023-06-22 14:53:53 -04:00
Alex Chi	9b50350857	threshold = 3 Signed-off-by: Alex Chi <iskyzh@gmail.com>	2023-06-22 14:37:29 -04:00
Alex Chi	8ebfa32a0c	compaction l0 adds to sorted runs Signed-off-by: Alex Chi <iskyzh@gmail.com>	2023-06-22 14:24:57 -04:00
Alex Chi	9905d75715	dump file size Signed-off-by: Alex Chi <iskyzh@gmail.com>	2023-06-22 14:18:54 -04:00
Alex Chi	b0b616f3ac	dump Signed-off-by: Alex Chi <iskyzh@gmail.com>	2023-06-22 14:12:33 -04:00
Alex Chi	820685fe92	remove all contents Signed-off-by: Alex Chi <iskyzh@gmail.com>	2023-06-22 13:52:34 -04:00
Alex Chi	a593d96b79	neon_local: support force init Signed-off-by: Alex Chi <iskyzh@gmail.com>	2023-06-22 13:52:28 -04:00
Alex Chi	867b656ef2	bypass ut Signed-off-by: Alex Chi <iskyzh@gmail.com>	2023-06-22 11:28:34 -04:00
Alex Chi	76b339b150	create partial image layers Signed-off-by: Alex Chi <iskyzh@gmail.com>	2023-06-21 14:38:11 -04:00
Alex Chi	9b3fa1a2e1	fix compile error Signed-off-by: Alex Chi <iskyzh@gmail.com>	2023-06-21 14:16:44 -04:00
Alex Chi	17781776c8	add two compaction triggers Signed-off-by: Alex Chi <iskyzh@gmail.com>	2023-06-21 10:27:24 -04:00
Alex Chi	5274f487e4	add tiered compaction skeleton Signed-off-by: Alex Chi <iskyzh@gmail.com>	2023-06-20 14:43:07 -04:00
Alex Chi	9b7747436c	incremental image? Signed-off-by: Alex Chi <iskyzh@gmail.com>	2023-06-20 11:05:16 -04:00
Alex Chi	a2056666ae	pgserver: move mapping logic to layer cache Signed-off-by: Alex Chi <iskyzh@gmail.com>	2023-06-14 15:07:38 -04:00
Alex Chi	fc190a2a19	resolve merge conflicts Signed-off-by: Alex Chi <iskyzh@gmail.com>	2023-06-13 13:56:50 -04:00
Alex Chi	faee3152f3	refactor: use LayerDesc in LayerMap (part 2) Signed-off-by: Alex Chi <iskyzh@gmail.com>	2023-06-13 13:54:59 -04:00
Christian Schwarz	3693d1f431	turn Timeline::layers into tokio::sync::RwLock (#4441 ) This is preliminary work for/from #4220 (async `Layer::get_value_reconstruct_data`). # Full Stack Of Preliminary PRs Thanks to the countless preliminary PRs, this conversion is relatively straight-forward. 1. Clean-ups * https://github.com/neondatabase/neon/pull/4316 * https://github.com/neondatabase/neon/pull/4317 * https://github.com/neondatabase/neon/pull/4318 * https://github.com/neondatabase/neon/pull/4319 * https://github.com/neondatabase/neon/pull/4321 * Note: these were mostly to find an alternative to #4291, which I thought we'd need in my original plan where we would need to convert `Tenant::timelines` into an async locking primitive (#4333). In reviews, we walked away from that, but these cleanups were still quite useful. 2. https://github.com/neondatabase/neon/pull/4364 3. https://github.com/neondatabase/neon/pull/4472 4. https://github.com/neondatabase/neon/pull/4476 5. https://github.com/neondatabase/neon/pull/4477 6. https://github.com/neondatabase/neon/pull/4485 # Significant Changes In This PR ## `compact_level0_phase1` & `create_delta_layer` This commit partially reverts "pgserver: spawn_blocking in compaction (#4265)" `4e359db4c7`. Specifically, it reverts the `spawn_blocking`-ificiation of `compact_level0_phase1`. If we didn't revert it, we'd have to use `Timeline::layers.blocking_read()` inside `compact_level0_phase1`. That would use up a thread in the `spawn_blocking` thread pool, which is hard-capped. I considered wrapping the code that follows the second `layers.read().await` into `spawn_blocking`, but there are lifetime issues with `deltas_to_compact`. Also, this PR switches the `create_delta_layer` _function_ back to async, and uses `spawn_blocking` inside to run the code that does sync IO, while keeping the code that needs to lock `Timeline::layers` async. ## `LayerIter` and `LayerKeyIter` `Send` bounds I had to add a `Send` bound on the `dyn` type that `LayerIter` and `LayerKeyIter` wrap. Why? Because we now have the second `layers.read().await` inside `compact_level0_phase`, and these iterator instances are held across that await-point. More background: https://github.com/neondatabase/neon/pull/4462#issuecomment-1587376960 ## `DatadirModification::flush` Needed to replace the `HashMap::retain` with a hand-rolled variant because `TimelineWriter::put` is now async.	2023-06-13 18:38:41 +02:00
Christian Schwarz	fdf7a67ed2	init_empty_layer_map: use `try_write` (#4485 ) This is preliminary work for/from #4220 (async `Layer::get_value_reconstruct_data`). Or more specifically, #4441, where we turn Timeline::layers into a tokio::sync::RwLock. By using try_write() here, we can avoid turning init_empty_layer_map async, which is nice because much of its transitive call(er) graph isn't async.	2023-06-13 13:49:40 +02:00
Alexey Kondratov	1299df87d2	[compute_ctl] Fix logging if catalog updates are skipped (#4480 ) Otherwise, it wasn't clear from the log when Postgres started up completely if catalog updates were skipped. Follow-up for `4936ab6`	2023-06-13 13:34:56 +02:00
Christian Schwarz	754ceaefac	make TimelineWriter `Send` by using `tokio::sync Mutex` internally (#4477 ) This is preliminary work for/from #4220 (async `Layer::get_value_reconstruct_data`). There, we want to switch `Timeline::layers` to be a `tokio::sync::RwLock`. That will require the `TimelineWriter` to become async, because at times its functions need to lock `Timeline::layers` in order to freeze the open layer. While doing that, rustc complains that we're now holding `Timeline::write_lock` across await points (lock order is that `write_lock` must be acquired before `Timelines::layers`). So, we need to switch it over to an async primitive.	2023-06-13 10:15:25 +02:00
Arseny Sher	143fa0da42	Remove timeout on test_close_on_connections_exit We have 300s timeout on all tests, and doubling logic in popen.wait sometimes exceeds 5s, making the test flaky. ref https://github.com/neondatabase/neon/issues/4211	2023-06-13 06:26:03 +04:00
bojanserafimov	4936ab6842	compute_ctl: add flag to avoid config step (#4457 ) Add backwards-compatible flag that cplane can use to speed up startup time	2023-06-12 13:57:02 -04:00
Christian Schwarz	939593d0d3	refactor check_checkpoint_distance to prepare for async Timeline::layers (#4476 ) This is preliminary work for/from #4220 (async `Layer::get_value_reconstruct_data`). There, we want to switch `Timeline::layers` to be a `tokio::sync::RwLock`. That will require the `TimelineWriter` to become async. That will require `freeze_inmem_layer` to become async. So, inside check_checkpoint_distance, we will have `freeze_inmem_layer().await`. But current rustc isn't smart enough to understand that we `drop(layers)` earlier, and hence, will complain about the `!Send` `layers` being held across the `freeze_inmem_layer().await`-point. This patch puts the guard into a scope, so rustc will shut up in the next patch where we make the transition for `TimelineWriter`. obsoletes https://github.com/neondatabase/neon/pull/4474	2023-06-12 17:45:56 +01:00
Christian Schwarz	2011cc05cd	make Delta{Value,Key}Iter Send (#4472 ) ... by switching the internal RwLock to a OnceCell. This is preliminary work for/from #4220 (async `Layer::get_value_reconstruct_data`). See https://github.com/neondatabase/neon/pull/4462#issuecomment-1587398883 for more context. fixes https://github.com/neondatabase/neon/issues/4471	2023-06-12 17:45:56 +01:00
Arthur Petukhovsky	b0286e3c46	Always truncate WAL after restart (#4464 ) `c058e1cec2` skipped `truncate_wal()` it if `write_lsn` is equal to truncation position, but didn't took into account that `write_lsn` is reset on restart. Fixes regression looking like: ``` ERROR WAL acceptor{cid=22 ...}:panic{thread=WAL acceptor 19b6c1743666ec02991a7633c57178db/b07db8c88f4c76ea5ed0954c04cc1e74 location=safekeeper/src/wal_storage.rs:230:13}: unexpected write into non-partial segment file ``` This fix will prevent skipping WAL truncation when we are running for the first time after restart.	2023-06-12 13:42:28 +00:00
Heikki Linnakangas	e4f05ce0a2	Enable sanity check that disk_consistent_lsn is valid on created timeline. Commit `create_test_timeline: always put@initdb_lsn the minimum required keys` already switched us over to using valid initdb_lsns. All that's left to do is to actually flush the minimum keys so that we move from disk_consistent_lsn=Lsn(0) to disk_consistent_lsn=initdb_lsn. Co-authored-by: Christian Schwarz <christian@neon.tech> Part of https://github.com/neondatabase/neon/pull/4364	2023-06-12 11:56:49 +01:00
Heikki Linnakangas	8d106708d7	Clean up timeline initialization code. Clarify who's responsible for initializing the layer map. There were previously two different ways to do it: - create_empty_timeline and bootstrap_timeline let prepare_timeline() initialize an empty layer map. - branch_timeline passed a flag to initialize_with_lock() to tell initialize_with_lock to call load_layer_map(). Because it was a newly created timeline, load_layer_map() never found any layer files, so it just initialized an empty layer map. With this commit, prepare_new_timeline() always does it. The LSN to initialize it with is passed as argument. Other changes per function: prepare_timeline: - rename to 'prepare_new_timeline' to make it clear that it's only used when creating a new timeline, not when loading an existing timeline - always initialize an empty layer map. The caller can pass the LSN to initialize it with. (Previously, prepare_timeline would optionally load the layer map at 'initdb_lsn'. Some caller used that, while others let initialize_with_lock do it initialize_with_lock: - As mentioned above, remove the option to load the layer map - Acquire the 'timelines' lock in the function itself. None of the callers did any other work while holding the lock. - Rename it to finish_creation() to make its intent more clear. It's only used when creating a new timeline now. create_timeline_data: - Rename to create_timeline_struct() for clarity. It just initializes the Timeline struct, not any other "data" create_timeline_files: - use create_dir rather than create_dir_all, to be a little more strict. We know that the parent directory should already exist, and the timeline directory should not exist. - Move the call to create_timeline_struct() to the caller. It was just being "passed through" Part of https://github.com/neondatabase/neon/pull/4364	2023-06-12 11:56:49 +01:00
Christian Schwarz	f450369b20	timeline_init_and_sync: don't hold Tenant::timelines while load_layer_map This patch inlines `initialize_with_lock` and then reorganizes the code such that we can `load_layer_map` without holding the `Tenant::timelines` lock. As a nice aside, we can get rid of the dummy() uninit mark, which has always been a terrible hack. Part of https://github.com/neondatabase/neon/pull/4364	2023-06-12 11:56:49 +01:00
Christian Schwarz	aad918fb56	create_test_timeline: tests for put@initdb_lsn optimization code	2023-06-12 11:04:49 +01:00
Christian Schwarz	86dd8c96d3	add infrastructure to expect use of initdb_lsn flush optimization	2023-06-12 11:04:49 +01:00
Christian Schwarz	6a65c4a4fe	create_test_timeline: always put@initdb_lsn the minimum required keys (#4451 ) See the added comment on `create_empty_timeline`. The various test cases now need to set a valid `Lsn` instead of `Lsn(0)`. Rough context: https://github.com/neondatabase/neon/pull/4364#discussion_r1221995691	2023-06-12 09:28:34 +00:00
Vadim Kharitonov	e9072ee178	Compile rdkit (#4442 ) `rdkit` extension ``` postgres=# create extension rdkit; CREATE EXTENSION postgres=# select 'c1[o,s]ncn1'::qmol; qmol ------------- c1[o,s]ncn1 (1 row) ```	2023-06-12 11:13:33 +02:00
Joonas Koivunen	7e17979d7a	feat: http request logging on safekeepers. With RequestSpan, successfull GETs are not logged, but all others, errors and warns on cancellations are.	2023-06-11 22:53:08 +04:00
Arseny Sher	227271ccad	Switch safekeepers to async. This is a full switch, fs io operations are also tokio ones, working through thread pool. Similar to pageserver, we have multiple runtimes for easier `top` usage and isolation. Notable points: - Now that guts of safekeeper.rs are full of .await's, we need to be very careful not to drop task at random point, leaving timeline in unclear state. Currently the only writer is walreceiver and we don't have top level cancellation there, so we are good. But to be safe probably we should add a fuse panicking if task is being dropped while operation on a timeline is in progress. - Timeline lock is Tokio one now, as we do disk IO under it. - Collecting metrics got a crutch: since prometheus Collector is synchronous, it spawns a thread with current thread runtime collecting data. - Anything involving closures becomes significantly more complicated, as async fns are already kinda closures + 'async closures are unstable'. - Main thread now tracks other main tasks, which got much easier. - The only sync place left is initial data loading, as otherwise clippy complains on timeline map lock being held across await points -- which is not bad here as it happens only in single threaded runtime of main thread. But having it sync doesn't hurt either. I'm concerned about performance of thread pool io offloading, async traits and many await points; but we can try and see how it goes. fixes https://github.com/neondatabase/neon/issues/3036 fixes https://github.com/neondatabase/neon/issues/3966	2023-06-11 22:53:08 +04:00
dependabot[bot]	fbf0367e27	build(deps): bump cryptography from 39.0.1 to 41.0.0 (#4409 )	2023-06-11 19:14:30 +01:00

1 2 3 4 5 ...

3322 Commits