rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-01-07 13:32:57 +00:00

Author	SHA1	Message	Date
Heikki Linnakangas	2319e0ec8f	Define a layer's start and end bounds more precisely. After this, a layer's start bound is always defined to be inclusive, and end bound exclusive. For example, if you have a layer in the range 100-200, that layer can be used for GetPage@LSN requests at LSN 100, 199, or anything in between. But for LSN 200, you need to look at the next layer (if one exists). This is one part of a fix for https://github.com/zenithdb/zenith/issues/517. After this, the page server shouldn't create layers for the same segment with the same LSN, which avoids the issue. However, the same thing would still happen, if you managed to create layers with same start LSN again. That could happen e.g. if you had two page servers running, or in some weird crash/restart scenario, or due to bugs or features added later. The next commit makes the layer map more robust, so that it tolerates that situation without deleting wrong files.	2021-09-24 14:10:49 +03:00
Arthur Petukhovsky	d4e037f1e7	Support for `--sync-safekeepers` in tests (#647 ) New command has been added to append specially crafted records in safekeeper WAL. This command takes json for append, encodes LogicalMessage based on json fields, and processes new AppendRequest to append and commit WAL in safekeeper. Python test starts up walkeepers and creates config for walproposer, then appends WAL and checks --sync-safekeepers works without errors. This test is simplest one, more useful test cases (like in #545) for different setups will be added soon.	2021-09-24 13:19:59 +03:00
Max Sharnoff	139936197a	bump vendor/postgres: Catch walkeeper ErrorResponse (#650 ) Postgres commit message: PQgetCopyData can sometimes indicate that the copy is done if the backend returns an error response. So while we still expect that the walkeeper never sends CopyDone, we can't expect it to never produce errors.	2021-09-23 14:55:38 -07:00
Heikki Linnakangas	d4eed61f57	Refactor code for parsing and creating postgresql.conf. There's surely more that could be done, but this makes it a bit more readable at least.	2021-09-23 19:34:27 +03:00
Patrick Insinger	7db3a9e7d9	walredo - don't use RefCell on stdin/stdout	2021-09-23 08:42:58 -07:00
Patrick Insinger	c81ee3bd5b	Add some comments to the checkpoint process	2021-09-23 13:19:45 +03:00
anastasia	7fb7f67bb4	Fix relish extention after it was dropped or truncated. - Turn dropped layers into non-writeable in get_layer_for_write(). - Handle non-writeable dropped layers in checkpointer. They don't need freezing, so just remove them from list of open_segs and write out to disk. - Remove code that handles dropped layers in freeze() function. It is not used anymore.	2021-09-23 13:19:45 +03:00
anastasia	86164c8b33	Add unit tests for drop_lsn. test_drop_extend and test_truncate_extend illustrate what happens if we dropped a segment and then created it again within the same layer.	2021-09-23 13:19:45 +03:00
Arseny Sher	97c4cd4434	bump vendor/postgres	2021-09-23 12:22:53 +03:00
anastasia	a4fc6da57b	Fix gc_internal to treat dropped layers. Some dropped layers serve as tombstones for earlier layers and thus cannot be garbage collected. Add new fields to GcResult for layers that are preserved as tombstones	2021-09-23 12:21:47 +03:00
anastasia	c934e724a8	Enable test_list_rels_drop test	2021-09-23 12:21:47 +03:00
anastasia	e554f9514f	gc refactoring - rename 'compact' argument of GC to 'checkpoint_before_gc'. - gc_iteration_internal() refactoring	2021-09-23 12:21:47 +03:00
Max Sharnoff	d7cff8fbaf	Show more detailed query errors from postgres_backend (#651 ) anyhow uses the alternate formatting style ("{:#}") to display all of the causes of an error instead of the outermost context. Without this, there's less information available to figure out what's going on. It's probably too much to display in the compute node logs though, so it's better to leave that formatting as-is.	2021-09-22 14:51:14 -07:00
Max Sharnoff	90ef661673	Fix rustc & clippy warnings for nightly (2021-09-19) (#629 ) Fix clippy warnings for nightly (2021-09-19)	2021-09-22 11:24:43 -07:00
Dmitry Rodionov	579b5ee944	exclude labels formatting for every operation in LOGICAL_TIMELINE_SIZE gauge metric	2021-09-22 18:03:48 +03:00
Arthur Petukhovsky	8ebf2fe550	Add test for acceptor restarts under load (#591 ) In this test safekeepers are restarted one by one, while bank transactions are executed and validated in the background. Bank transactions consist of balance transfers and log writes. In the end balance sum should remain the same and there should be progress from every client, when 2 of 3 safekeeper nodes are up.	2021-09-22 11:59:20 +03:00
Dmitry Rodionov	16d3dc821a	disable parallelization for benchmarks	2021-09-21 23:08:22 +03:00
Heikki Linnakangas	a91eeb1c65	Buffer the writes when writing a layer to disk. Significantly reduces the CPU time spent on libc::write.	2021-09-21 16:54:29 +03:00
Heikki Linnakangas	49c8c03465	Add performance test for bulk INSERT	2021-09-21 13:25:46 +03:00
Dmitry Rodionov	5344ffc3de	try to reenable parallel test runs in CI	2021-09-20 21:43:09 +03:00
Heikki Linnakangas	296586b7ce	bump vendor/postgres	2021-09-20 18:52:55 +03:00
Dmitry Rodionov	b7aac87ec1	fix port distribution so services do not use ephemeral ports	2021-09-20 18:44:42 +03:00
Patrick Insinger	ea4c3639e3	Include layer metadata in layer summary chapters Include all data stored in layer filenames and the tenant+timeline IDs inside a summary chapter. Use this chapter in the `dump_layerfile` utility.	2021-09-20 07:57:51 -07:00
Heikki Linnakangas	745627c8ca	Remove unused FE/BE ControlFile message. It's a remnant of some old tests in Zenith, but isn't used anymore. It doesn't exist in PostgreSQL.	2021-09-17 20:06:04 +03:00
Heikki Linnakangas	c2af6d98db	Don't print 'pg_controldata' output after every startup in tests. It's not interesting for most tests, and clutters the output. If there are individual tests where it is worthwhole, let's add pg_controldata calls to those tests, but I don't think it's needed for now.	2021-09-17 20:04:29 +03:00
Heikki Linnakangas	540973eac4	Don't get confused on request of latest page version with very old LSN. If the 'latest' flag in the client request is true, the client wants the latest page version regardless of the LSN in the request. The LSN is just a hint in that case, indicating that the page hasn't been modified since since that LSN. The LSN can be very old, so it's possible that the page server has already garbage collected away the layer at that LSN. We tried to fetch the old layer and errored out if that happened. To fix, always fetch the data as of last-record-LSN, if 'latest' is set in the client request. We now only use the LSN to wait if the requested LSN hasn't been received and processed yet. Fixes https://github.com/zenithdb/zenith/issues/567	2021-09-17 18:56:05 +03:00
Heikki Linnakangas	ad5f16f724	Improve the protocol between Postgres and page server. - Use different message formats for different kinds of response messages. - Add an Error message, for passing errors from page server to Postgres. Previously, we would respond to 'exists' request with 'false', and to 'nblocks' request with 0, if an error happened. Fix those to return an error message to the client. GetPage requests had a mechanism to return an error, but it was just a flag with no error message. - Add a flag to requests, to indicate that we actually want the latest page version on the timeline, and the LSN is just a hint that we know that there haven't been any modifications since that LSN. The flag isn't used for anything yet, but I'm planning to use it to fix https://github.com/zenithdb/zenith/issues/567	2021-09-17 16:38:14 +03:00
Kirill Bulatov	1aa7218fd6	Show underlying pageserver error details	2021-09-17 16:16:05 +03:00
Kirill Bulatov	1d5abf1253	Initial version of the relish storage	2021-09-17 15:30:22 +03:00
Dmitry Ivanov	7b3fb760fa	[test_runner] psql should be oblivious to user's preferences This makes psql ignore $HOME/.psqlrc	2021-09-17 14:16:23 +03:00
Max Sharnoff	3743344e64	Add `get_timeline_for_tenant()` to `tenant_mgr` (#615 ) Most of the previous usages of get_repository_for_tenant were followed by immediately getting a timeline in that repository, without keeping it around for longer. The new `get_timeline_for_tenant` function implements that same behavior, but in one line.	2021-09-16 10:38:21 -07:00
Max Sharnoff	bbe4f39790	walkeeper: Add parsing check for hot standby tag (#597 )	2021-09-16 09:04:35 -07:00
Kirill Bulatov	7dda9f2894	Fix clippy lints and enable clippy checking in CI	2021-09-16 15:09:16 +03:00
anastasia	8de41f1d70	Change checkpoint_distance type to u64	2021-09-16 12:33:50 +03:00
anastasia	6984d33b4e	Run GC and checkpointer separate threads. Add checkpoint_period configuration parameter	2021-09-16 12:33:50 +03:00
anastasia	98d4f9cea5	Add checkpoint_distance config parameter. - Change hardcoded OLDEST_INMEM_DISTANCE value to pageserver config option checkpoint_distance. - Get rid of 'force' flag in checkpoint_internal(). Use checkpoint_distance=0 instead.	2021-09-16 12:33:50 +03:00
Arseny Sher	87bc18972f	bump vendor/postgres	2021-09-16 11:41:29 +03:00
Patrick Insinger	25b7d424ab	Prevent frozen InMemoryLayer races Instead of panicking when a race happens, retry the operation after getting a new layer.	2021-09-15 20:50:51 -07:00
Patrick Insinger	a5bd306db9	Ensure InMemoryLayer predecessor updated correctly When the new open InMemoryLayer predecessor is updated, ensure it was pointing to the old frozen layer.	2021-09-15 16:04:49 -07:00
Patrick Insinger	0cbee4a416	Don't hold lock on LayerMap while writing to disk	2021-09-15 16:04:49 -07:00
Patrick Insinger	91ff09151d	Remove disk IO from `InMemoryLayer::freeze` Move the creation of Image and Delta layers from `InMemoryLayer::freeze()` to `InMemoryLayer::write_to_disk`.	2021-09-15 16:04:49 -07:00
Patrick Insinger	fea5954b18	Change filling gap println! to trace!	2021-09-15 14:22:04 -07:00
Max Sharnoff	b11b0bb088	bin_ser: reject trailing bytes by default (#587 ) Changes `LeSer`/`BeSer::des`. Also adds a new `des_prefix` function to keep a way to allow trailing bytes.	2021-09-15 11:48:19 -07:00
Dmitry Rodionov	0ede933719	temporary disable parallel test runs as it seems to misbehave when there are several concurrent CI runs	2021-09-15 18:59:59 +03:00
Kirill Bulatov	3ab60ce76f	Unify tokio deps and bump cargo resolver version	2021-09-15 16:00:08 +03:00
Dmitry Rodionov	01ef2baef0	show more context for zenith cli run errors	2021-09-15 14:02:15 +03:00
Dmitry Rodionov	6a2e4bfdd9	use parallel test execution in ci	2021-09-15 14:02:15 +03:00
Dmitry Rodionov	9563336d9a	Bring back check for interferring processes, add more comments and descriptive errors	2021-09-15 14:02:15 +03:00
Dmitry Rodionov	4ebe643d0c	Support parallel test running for python tests Support is done via pytest-xdist plugin. To use the feature add -n<concurrency> to pytest invocation e.g. pytest -n8 to run 8 tests in parallel. Changes in code are mostly about ports assigning. Previously port for pageserver was hardcoded without the ability to override through zenith cli and ports for started compute nodes were calculated twice, in zenith cli and in test code. Now zenith cli supports port arguments for pageserver and compute nodes to be passed explicitly. Tests are modified in such a way that each worker gets a non overlapping port range which can be configured and now contains 100 ports. These ports are distributed to test services (pageserver, wal acceptors, compute nodes) so they can work independently.	2021-09-15 14:02:15 +03:00
Dmitry Rodionov	dc897fb864	remove pageserver remotes support since we do not have tests for that and feature itself is delayed (#136 )	2021-09-15 13:24:35 +03:00

1 2 3 4 5 ...

861 Commits