rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-01-07 05:22:56 +00:00

Author	SHA1	Message	Date
Arthur Petukhovsky	8a44796905	Increase parallel workers to trigger more errors	2021-09-29 12:23:49 +03:00
Arthur Petukhovsky	ed521e05e7	Hide debug logs in test_wal_acceptor_async	2021-09-29 11:47:53 +03:00
Arthur Petukhovsky	6a13500da4	Fix print in last test	2021-09-29 11:47:53 +03:00
Arthur Petukhovsky	9b8168ebde	Don't log test output while running	2021-09-29 11:47:53 +03:00
Arthur Petukhovsky	f9bb4dbf08	Use f-strings for logs	2021-09-29 11:47:52 +03:00
Arthur Petukhovsky	20ee204c27	Fix string formatting	2021-09-29 11:47:52 +03:00
Arthur Petukhovsky	3fdd85bcb8	Use logging in python tests	2021-09-29 11:47:52 +03:00
Kirill Bulatov	fb05e4cb0b	Show better error messages on pageserver failures	2021-09-29 01:55:41 +03:00
Egor Suvorov	b0a7234759	pageserver: fix stale default listen addrs * In command line help * In dummy_conf	2021-09-28 20:57:51 +03:00
Egor Suvorov	ddf4b15ebc	pageserver: use const_format crate to generate default listen addrs	2021-09-28 20:57:51 +03:00
Egor Suvorov	3065532f15	pageserver: fix mistype in listen-http arg help	2021-09-28 20:57:51 +03:00
Arthur Petukhovsky	d6fc74a412	Various fixes for test_sync_safekeepers (#668 ) * Send ProposerGreeting manually in tests * Move test_sync_safekeepers to test_wal_acceptor.py * Capture test_sync_safekeepers output * Add comment for handle_json_ctrl * Save captured output in CI	2021-09-28 19:25:05 +03:00
Arseny Sher	7a370394a7	Wait till previous victim recovers in run_restarts_under_load. Fixes test flakiness, as recovery easily might take the whole iteration.	2021-09-28 19:15:41 +03:00
Stas Kelvich	0f3cf8ac94	Cleanup Dockerfile. * make .dockerignore `ncdu -X` compatible to easily inspect build context * remove cargo-chef as it was introducing more problems than it was solving * remove rocksdb packages * add ca-certs in the resulting image. We need that to be able to make https connections from container with proxy to the console.	2021-09-28 18:26:20 +03:00
Heikki Linnakangas	014be8b230	Use Iterator, to avoid making one copy of page_versions BTreeMap Reduces the CPU time spent in checkpointing, in the write_to_disk() function.	2021-09-27 19:28:02 +03:00
Heikki Linnakangas	08978458be	Refactor write_to_disk, handling dropped segment as a special case. Similar to what commit `7fb7f67b` did to 'freeze', dealing with the dropped segment separately from the rest of the logic makes the code easier to follow. It is also needed by the next commit that replaces the code to build new BTreeMap with an iterator; we cannot pass one of two kinds of closures as argument, it has to always be the same one. Having separate DeltaLayer::create() calls for the case of dropped segment and the other cases works around that.	2021-09-27 19:23:32 +03:00
Heikki Linnakangas	2252d9faa8	Switch to RwLock in InMemoryLayer Allows more parallelism basically for free.	2021-09-27 19:15:40 +03:00
Arthur Petukhovsky	22e15844ae	Fix clippy errors (#673 )	2021-09-27 18:59:30 +03:00
Konstantin Knizhnik	ca9af37478	Do not write WAL at pageserver (#645 ) * Do not write WAL at pageserver * Remove import_timeline_wal function	2021-09-27 14:15:55 +03:00
Stas Kelvich	aae41e8661	Proxy pass for existing users. Ask console to check per-cluster auth info.	2021-09-27 11:56:43 +03:00
Stas Kelvich	8331ce865c	Interceipt and log error in mgmt interface. That PostgresBackend is better be replaced with the http server or redis subscription. For now let's improve logging and move on.	2021-09-27 11:56:43 +03:00
Stas Kelvich	3bac4d485d	Fix EncryptionResponse message in pq_proto.rs Positive EncryptionResponse should set 'S' byte, not 'Y'. With that fix it is possible to connect to proxy with SSL enabled and read deciphered notice text. But after the first query everything stucks.	2021-09-27 11:56:43 +03:00
Stas Kelvich	f84eaf4f05	Leave only pkcs8 keys support for proxy. rsa_private_keys() function returns an empty vector when tries to read pkcs8-encoded file instead of returning an error. So previous check was failing on pkcs8. Leave only pkcs8 for now.	2021-09-27 11:56:43 +03:00
Arseny Sher	70b08923ed	Disable new safekeepers tests as not stable enough.	2021-09-26 22:33:58 +03:00
Heikki Linnakangas	c846a824de	Bump vendor/postgres, to use buffered I/O in WAL redo process. Greatly reduces the CPU overhead in the WAL redo process.	2021-09-24 21:48:30 +03:00
Heikki Linnakangas	b71e3a40e2	Add more details to the log, when an error happens in GetPage request.	2021-09-24 21:44:22 +03:00
Heikki Linnakangas	41dfc117e7	Buffer the writes to the WAL redo process pipe. Reduces the CPU time spent in the write() syscalls. I noticed that we were spending a lot of CPU time in libc::write, coming from request_redo(), in the 'bulk_insert' test. According to some quick profiling with 'perf', this reduces the CPU time spent in request_redo() from about 30% to 15%. For some reason, it doesn't reduce the overall runtime of the 'bulk_insert' test much, maybe by one second if you squint (from about 37s to 36s), so there must be some other bottleneck, like I/O. But this is surely still a good idea, just based on the reduced CPU cycles.	2021-09-24 21:12:38 +03:00
sharnoff	a72707b8cb	Redo #655 with fix: Allow `LeSer`/`BeSer` impls missing either `Serialize` or `Deserialize` Commit message copied below: * Allow LeSer/BeSer impls missing Serialize/Deserialize Currently, using `LeSer` or `BeSer` requires that the type implements both `Serialize` and `DeserializeOwned`, even if we're only using the trait for one of those functionalities. Moving the bounds to the methods gives the convenience of the traits without requiring unnecessary derives. * Remove unused #[derive(Serialize/Deserialize)] This should hopefully reduce compile times - if only by a little bit. Some of these were already unused (we weren't using LeSer/BeSer for the types), but most are have become unused with the change to LeSer/BeSer.	2021-09-24 10:58:01 -07:00
Max Sharnoff	0f770967b4	Revert "Allow `LeSer`/`BeSer` impls missing either `Serialize` or `Deserialize` (#655 ) This reverts commit `bd9f4794d9`.	2021-09-24 10:18:36 -07:00
Max Sharnoff	bd9f4794d9	Allow `LeSer`/`BeSer` impls missing either `Serialize` or `Deserialize` (#655 ) * Allow LeSer/BeSer impls missing Serialize/Deserialize Currently, using `LeSer` or `BeSer` requires that the type implements both `Serialize` and `DeserializeOwned`, even if we're only using the trait for one of those functionalities. Moving the bounds to the methods gives the convenience of the traits without requiring unnecessary derives. * Remove unused #[derive(Serialize/Deserialize)] This should hopefully reduce compile times - if only by a little bit. Some of these were already unused (we weren't using LeSer/BeSer for the types), but most are have become unused with the change to LeSer/BeSer.	2021-09-24 10:06:03 -07:00
Heikki Linnakangas	ff5cbe2694	Support overlapping and nested Layers in the layer map. This introduces a new tree data structure for holding intervals, and queries of the form "which intervals contain the given point?". It then uses that to store the Layers in the layer map, instead of the BTreeMap. While we don't currently create overlapping layers in the page server, that situation might arise in the future if we start to create extra layers for performance purposes, or as part of some multi-stage garbage collection operation that creates new layers in some interval and then removes old ones. The situation might also arise if you have multiple page servers running on the same timeline, freezing layers at different points, and both uploading them to S3. So even though overlapping layers might not happen currently, let's avoid getting confused if it does happen for some reason. Fixes https://github.com/zenithdb/zenith/issues/517.	2021-09-24 14:10:52 +03:00
Heikki Linnakangas	2319e0ec8f	Define a layer's start and end bounds more precisely. After this, a layer's start bound is always defined to be inclusive, and end bound exclusive. For example, if you have a layer in the range 100-200, that layer can be used for GetPage@LSN requests at LSN 100, 199, or anything in between. But for LSN 200, you need to look at the next layer (if one exists). This is one part of a fix for https://github.com/zenithdb/zenith/issues/517. After this, the page server shouldn't create layers for the same segment with the same LSN, which avoids the issue. However, the same thing would still happen, if you managed to create layers with same start LSN again. That could happen e.g. if you had two page servers running, or in some weird crash/restart scenario, or due to bugs or features added later. The next commit makes the layer map more robust, so that it tolerates that situation without deleting wrong files.	2021-09-24 14:10:49 +03:00
Arthur Petukhovsky	d4e037f1e7	Support for `--sync-safekeepers` in tests (#647 ) New command has been added to append specially crafted records in safekeeper WAL. This command takes json for append, encodes LogicalMessage based on json fields, and processes new AppendRequest to append and commit WAL in safekeeper. Python test starts up walkeepers and creates config for walproposer, then appends WAL and checks --sync-safekeepers works without errors. This test is simplest one, more useful test cases (like in #545) for different setups will be added soon.	2021-09-24 13:19:59 +03:00
Max Sharnoff	139936197a	bump vendor/postgres: Catch walkeeper ErrorResponse (#650 ) Postgres commit message: PQgetCopyData can sometimes indicate that the copy is done if the backend returns an error response. So while we still expect that the walkeeper never sends CopyDone, we can't expect it to never produce errors.	2021-09-23 14:55:38 -07:00
Heikki Linnakangas	d4eed61f57	Refactor code for parsing and creating postgresql.conf. There's surely more that could be done, but this makes it a bit more readable at least.	2021-09-23 19:34:27 +03:00
Patrick Insinger	7db3a9e7d9	walredo - don't use RefCell on stdin/stdout	2021-09-23 08:42:58 -07:00
Patrick Insinger	c81ee3bd5b	Add some comments to the checkpoint process	2021-09-23 13:19:45 +03:00
anastasia	7fb7f67bb4	Fix relish extention after it was dropped or truncated. - Turn dropped layers into non-writeable in get_layer_for_write(). - Handle non-writeable dropped layers in checkpointer. They don't need freezing, so just remove them from list of open_segs and write out to disk. - Remove code that handles dropped layers in freeze() function. It is not used anymore.	2021-09-23 13:19:45 +03:00
anastasia	86164c8b33	Add unit tests for drop_lsn. test_drop_extend and test_truncate_extend illustrate what happens if we dropped a segment and then created it again within the same layer.	2021-09-23 13:19:45 +03:00
Arseny Sher	97c4cd4434	bump vendor/postgres	2021-09-23 12:22:53 +03:00
anastasia	a4fc6da57b	Fix gc_internal to treat dropped layers. Some dropped layers serve as tombstones for earlier layers and thus cannot be garbage collected. Add new fields to GcResult for layers that are preserved as tombstones	2021-09-23 12:21:47 +03:00
anastasia	c934e724a8	Enable test_list_rels_drop test	2021-09-23 12:21:47 +03:00
anastasia	e554f9514f	gc refactoring - rename 'compact' argument of GC to 'checkpoint_before_gc'. - gc_iteration_internal() refactoring	2021-09-23 12:21:47 +03:00
Max Sharnoff	d7cff8fbaf	Show more detailed query errors from postgres_backend (#651 ) anyhow uses the alternate formatting style ("{:#}") to display all of the causes of an error instead of the outermost context. Without this, there's less information available to figure out what's going on. It's probably too much to display in the compute node logs though, so it's better to leave that formatting as-is.	2021-09-22 14:51:14 -07:00
Max Sharnoff	90ef661673	Fix rustc & clippy warnings for nightly (2021-09-19) (#629 ) Fix clippy warnings for nightly (2021-09-19)	2021-09-22 11:24:43 -07:00
Dmitry Rodionov	579b5ee944	exclude labels formatting for every operation in LOGICAL_TIMELINE_SIZE gauge metric	2021-09-22 18:03:48 +03:00
Arthur Petukhovsky	8ebf2fe550	Add test for acceptor restarts under load (#591 ) In this test safekeepers are restarted one by one, while bank transactions are executed and validated in the background. Bank transactions consist of balance transfers and log writes. In the end balance sum should remain the same and there should be progress from every client, when 2 of 3 safekeeper nodes are up.	2021-09-22 11:59:20 +03:00
Dmitry Rodionov	16d3dc821a	disable parallelization for benchmarks	2021-09-21 23:08:22 +03:00
Heikki Linnakangas	a91eeb1c65	Buffer the writes when writing a layer to disk. Significantly reduces the CPU time spent on libc::write.	2021-09-21 16:54:29 +03:00
Heikki Linnakangas	49c8c03465	Add performance test for bulk INSERT	2021-09-21 13:25:46 +03:00

1 2 3 4 5 ...

892 Commits