rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-01-07 05:22:56 +00:00

Author	SHA1	Message	Date
anastasia	a4fc6da57b	Fix gc_internal to treat dropped layers. Some dropped layers serve as tombstones for earlier layers and thus cannot be garbage collected. Add new fields to GcResult for layers that are preserved as tombstones	2021-09-23 12:21:47 +03:00
anastasia	c934e724a8	Enable test_list_rels_drop test	2021-09-23 12:21:47 +03:00
anastasia	e554f9514f	gc refactoring - rename 'compact' argument of GC to 'checkpoint_before_gc'. - gc_iteration_internal() refactoring	2021-09-23 12:21:47 +03:00
Max Sharnoff	d7cff8fbaf	Show more detailed query errors from postgres_backend (#651 ) anyhow uses the alternate formatting style ("{:#}") to display all of the causes of an error instead of the outermost context. Without this, there's less information available to figure out what's going on. It's probably too much to display in the compute node logs though, so it's better to leave that formatting as-is.	2021-09-22 14:51:14 -07:00
Max Sharnoff	90ef661673	Fix rustc & clippy warnings for nightly (2021-09-19) (#629 ) Fix clippy warnings for nightly (2021-09-19)	2021-09-22 11:24:43 -07:00
Dmitry Rodionov	579b5ee944	exclude labels formatting for every operation in LOGICAL_TIMELINE_SIZE gauge metric	2021-09-22 18:03:48 +03:00
Arthur Petukhovsky	8ebf2fe550	Add test for acceptor restarts under load (#591 ) In this test safekeepers are restarted one by one, while bank transactions are executed and validated in the background. Bank transactions consist of balance transfers and log writes. In the end balance sum should remain the same and there should be progress from every client, when 2 of 3 safekeeper nodes are up.	2021-09-22 11:59:20 +03:00
Dmitry Rodionov	16d3dc821a	disable parallelization for benchmarks	2021-09-21 23:08:22 +03:00
Heikki Linnakangas	a91eeb1c65	Buffer the writes when writing a layer to disk. Significantly reduces the CPU time spent on libc::write.	2021-09-21 16:54:29 +03:00
Heikki Linnakangas	49c8c03465	Add performance test for bulk INSERT	2021-09-21 13:25:46 +03:00
Dmitry Rodionov	5344ffc3de	try to reenable parallel test runs in CI	2021-09-20 21:43:09 +03:00
Heikki Linnakangas	296586b7ce	bump vendor/postgres	2021-09-20 18:52:55 +03:00
Dmitry Rodionov	b7aac87ec1	fix port distribution so services do not use ephemeral ports	2021-09-20 18:44:42 +03:00
Patrick Insinger	ea4c3639e3	Include layer metadata in layer summary chapters Include all data stored in layer filenames and the tenant+timeline IDs inside a summary chapter. Use this chapter in the `dump_layerfile` utility.	2021-09-20 07:57:51 -07:00
Heikki Linnakangas	745627c8ca	Remove unused FE/BE ControlFile message. It's a remnant of some old tests in Zenith, but isn't used anymore. It doesn't exist in PostgreSQL.	2021-09-17 20:06:04 +03:00
Heikki Linnakangas	c2af6d98db	Don't print 'pg_controldata' output after every startup in tests. It's not interesting for most tests, and clutters the output. If there are individual tests where it is worthwhole, let's add pg_controldata calls to those tests, but I don't think it's needed for now.	2021-09-17 20:04:29 +03:00
Heikki Linnakangas	540973eac4	Don't get confused on request of latest page version with very old LSN. If the 'latest' flag in the client request is true, the client wants the latest page version regardless of the LSN in the request. The LSN is just a hint in that case, indicating that the page hasn't been modified since since that LSN. The LSN can be very old, so it's possible that the page server has already garbage collected away the layer at that LSN. We tried to fetch the old layer and errored out if that happened. To fix, always fetch the data as of last-record-LSN, if 'latest' is set in the client request. We now only use the LSN to wait if the requested LSN hasn't been received and processed yet. Fixes https://github.com/zenithdb/zenith/issues/567	2021-09-17 18:56:05 +03:00
Heikki Linnakangas	ad5f16f724	Improve the protocol between Postgres and page server. - Use different message formats for different kinds of response messages. - Add an Error message, for passing errors from page server to Postgres. Previously, we would respond to 'exists' request with 'false', and to 'nblocks' request with 0, if an error happened. Fix those to return an error message to the client. GetPage requests had a mechanism to return an error, but it was just a flag with no error message. - Add a flag to requests, to indicate that we actually want the latest page version on the timeline, and the LSN is just a hint that we know that there haven't been any modifications since that LSN. The flag isn't used for anything yet, but I'm planning to use it to fix https://github.com/zenithdb/zenith/issues/567	2021-09-17 16:38:14 +03:00
Kirill Bulatov	1aa7218fd6	Show underlying pageserver error details	2021-09-17 16:16:05 +03:00
Kirill Bulatov	1d5abf1253	Initial version of the relish storage	2021-09-17 15:30:22 +03:00
Dmitry Ivanov	7b3fb760fa	[test_runner] psql should be oblivious to user's preferences This makes psql ignore $HOME/.psqlrc	2021-09-17 14:16:23 +03:00
Max Sharnoff	3743344e64	Add `get_timeline_for_tenant()` to `tenant_mgr` (#615 ) Most of the previous usages of get_repository_for_tenant were followed by immediately getting a timeline in that repository, without keeping it around for longer. The new `get_timeline_for_tenant` function implements that same behavior, but in one line.	2021-09-16 10:38:21 -07:00
Max Sharnoff	bbe4f39790	walkeeper: Add parsing check for hot standby tag (#597 )	2021-09-16 09:04:35 -07:00
Kirill Bulatov	7dda9f2894	Fix clippy lints and enable clippy checking in CI	2021-09-16 15:09:16 +03:00
anastasia	8de41f1d70	Change checkpoint_distance type to u64	2021-09-16 12:33:50 +03:00
anastasia	6984d33b4e	Run GC and checkpointer separate threads. Add checkpoint_period configuration parameter	2021-09-16 12:33:50 +03:00
anastasia	98d4f9cea5	Add checkpoint_distance config parameter. - Change hardcoded OLDEST_INMEM_DISTANCE value to pageserver config option checkpoint_distance. - Get rid of 'force' flag in checkpoint_internal(). Use checkpoint_distance=0 instead.	2021-09-16 12:33:50 +03:00
Arseny Sher	87bc18972f	bump vendor/postgres	2021-09-16 11:41:29 +03:00
Patrick Insinger	25b7d424ab	Prevent frozen InMemoryLayer races Instead of panicking when a race happens, retry the operation after getting a new layer.	2021-09-15 20:50:51 -07:00
Patrick Insinger	a5bd306db9	Ensure InMemoryLayer predecessor updated correctly When the new open InMemoryLayer predecessor is updated, ensure it was pointing to the old frozen layer.	2021-09-15 16:04:49 -07:00
Patrick Insinger	0cbee4a416	Don't hold lock on LayerMap while writing to disk	2021-09-15 16:04:49 -07:00
Patrick Insinger	91ff09151d	Remove disk IO from `InMemoryLayer::freeze` Move the creation of Image and Delta layers from `InMemoryLayer::freeze()` to `InMemoryLayer::write_to_disk`.	2021-09-15 16:04:49 -07:00
Patrick Insinger	fea5954b18	Change filling gap println! to trace!	2021-09-15 14:22:04 -07:00
Max Sharnoff	b11b0bb088	bin_ser: reject trailing bytes by default (#587 ) Changes `LeSer`/`BeSer::des`. Also adds a new `des_prefix` function to keep a way to allow trailing bytes.	2021-09-15 11:48:19 -07:00
Dmitry Rodionov	0ede933719	temporary disable parallel test runs as it seems to misbehave when there are several concurrent CI runs	2021-09-15 18:59:59 +03:00
Kirill Bulatov	3ab60ce76f	Unify tokio deps and bump cargo resolver version	2021-09-15 16:00:08 +03:00
Dmitry Rodionov	01ef2baef0	show more context for zenith cli run errors	2021-09-15 14:02:15 +03:00
Dmitry Rodionov	6a2e4bfdd9	use parallel test execution in ci	2021-09-15 14:02:15 +03:00
Dmitry Rodionov	9563336d9a	Bring back check for interferring processes, add more comments and descriptive errors	2021-09-15 14:02:15 +03:00
Dmitry Rodionov	4ebe643d0c	Support parallel test running for python tests Support is done via pytest-xdist plugin. To use the feature add -n<concurrency> to pytest invocation e.g. pytest -n8 to run 8 tests in parallel. Changes in code are mostly about ports assigning. Previously port for pageserver was hardcoded without the ability to override through zenith cli and ports for started compute nodes were calculated twice, in zenith cli and in test code. Now zenith cli supports port arguments for pageserver and compute nodes to be passed explicitly. Tests are modified in such a way that each worker gets a non overlapping port range which can be configured and now contains 100 ports. These ports are distributed to test services (pageserver, wal acceptors, compute nodes) so they can work independently.	2021-09-15 14:02:15 +03:00
Dmitry Rodionov	dc897fb864	remove pageserver remotes support since we do not have tests for that and feature itself is delayed (#136 )	2021-09-15 13:24:35 +03:00
Max Sharnoff	a2498f3e67	Improve walkeeper replication error messages & context (#585 )	2021-09-14 11:59:14 -07:00
Patrick Insinger	d150f3ce8c	Detect writes on frozen InMemoryLayers Data written to frozen layers is lost. It will not appear in on-disk structures or in successor InMemoryLayers. Here we detect this race, and fail. I think this race is rare, but this should make it easier to track down when it happens.	2021-09-14 11:44:48 -07:00
Patrick Insinger	cff4572774	Avoid race in `get_layer_for_write` Implement the changes suggested in a comment, create `get_layer_for_read_locked` so that `get_layer_for_write` doesn't have to drop the LayerMap lock when searching for the predecessor.	2021-09-14 11:24:24 -07:00
Dmitry Rodionov	84008a2560	factor out common logging initialisation routine This contains a lowest common denominator of pageserver and safekeeper log initialisation routines. It uses daemonize flag to decide where to stream log messages. In case daemonize is true log messages are forwarded to file. Otherwise streaming to stdout is used. Usage of stdout for log output is the default in docker side of things, so make it easier to browse our logs via builtin docker commands.	2021-09-14 18:09:14 +03:00
Dmitry Ivanov	6b7f3bc78c	Add inter-repo CI job to CircleCI configuration This job will be responsible for triggering remote CI pipeline in zenithdb/console repository. That way, we'll always know when a PR to zenithdb/zenith breaks the cloud console app.	2021-09-14 16:56:04 +03:00
Arseny Sher	a68c23448a	Skip the bootstrap hole in safekeeper's find_end_of_wal. Otherwise restart of safekeeper before the first segment is filled makes it report 0 as flushed LSN. To this end, tweak find_end_of_wal_segment to allow starting from given LSN, not only from the start of the segment. While here, make it less panicky.	2021-09-13 22:46:04 +03:00
Dmitry Rodionov	9043f45489	removes protobuf dependency (brought by prometheus default features)	2021-09-13 15:57:41 +03:00
Heikki Linnakangas	6afd99c73f	Fix misc typos in comments.	2021-09-13 12:31:04 +03:00
nkotlyarov	18b5165b22	Update README.md typo	2021-09-12 15:35:18 +03:00

1 2 3 4 5 ...

852 Commits