rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-01-04 12:02:55 +00:00

Author	SHA1	Message	Date
Arseny Sher	8f0cafd508	Grab safekeeper.lock on the whole directory instead of per tli. closes #976	2021-12-13 22:11:04 +03:00
Heikki Linnakangas	e0d41ac6a3	Move constants related to metadata file to metadata.rs. They're not used anywhere else, so seems like a better place.	2021-12-13 16:57:16 +02:00
Heikki Linnakangas	72ef59c378	Fix small typos in comments, add a comment. The introducing paragraph README could use some more love, but let's at least fix the typos.	2021-12-13 13:44:08 +02:00
Kirill Bulatov	673c297949	Download timelines on demand	2021-12-10 17:23:35 +02:00
Kirill Bulatov	e61732ca7c	Compress checkpoint files before streaming into S3	2021-12-10 17:23:35 +02:00
Heikki Linnakangas	cb4a8396fb	Use rustls rather than native-tls in all dependencies. We depends on rustls in postgres_backend anyway, so might as well use it for all TLS stuff. Seems better to depend on only one library both from a security point of view, and because fewer dependencies means less code to compile. With this commit, we no longer depend on OpenSSL.	2021-12-10 15:14:27 +02:00
Heikki Linnakangas	c77e30116e	Split waldecoder.rs into two source files. Move the code for decoding a WAL stream into WAL records into 'postgres_ffi', and keep the code to parse the WAL records deeper in 'pageserver' crate, renamed to walrecord.rs. This tidies up the dependencies a bit. 'walkeeper' reuses the same waldecoder routines, and it used to depend on 'pageserver' because of that. Now it only depends on 'postgres_ffi'. (The comment in walkeeper/Cargo.toml that claimed that the dependency was needed for ZTimelineId was obsolete. ZTimelineId is defined in 'zenith_utils', the dependency was actually needed for the waldecoder.)	2021-12-10 15:14:13 +02:00
Heikki Linnakangas	9d369f158c	Update rust-s3 to version 0.28.0 0.28.0 includes two changes I submitted to upstream: - Add support for older ListObjects API, needed to use rust-s3 with Google Cloud Storage: https://github.com/durch/rust-s3/pull/229 - If file is smaller than one chunk, don't initiate multi-part upload. https://github.com/durch/rust-s3/pull/228 These are not critical for Zenith right now, but let's stay up-to-date.	2021-12-10 14:52:08 +02:00
Heikki Linnakangas	6ecd442fb9	Remove a bunch of unnecessary dependencies.	2021-12-10 14:24:33 +02:00
Heikki Linnakangas	f3f059c1f8	Fix a few cases where request beyond end of rel would error out. Currently, we return an all-zeros page if you request a block beyond end of a relation. That has been implemented in LayeredTimeline::materialize_page, so that if Layer::get_page_reconstruct_data returns Missing, it returns and all-zeros page. However InMemoryLayer and DeltaLayer would return Continue, not Missing, in that case, and materialize_page would try to find the predecessor layer. If there was a preceding image layer, then everything would still work, but if there wasn't, it would return a "could not find predecessor of layer" error. Fix that in InMemoryLayer and DeltaLayer, making them check the size of the relation and return Missing in that case. This is hard to reproduce at the moment, but it happened quickly with pgbench when I modified InMemoryLayer::write_to_disk so that it didn't always create a new ImageLayer.	2021-12-09 17:46:48 +02:00
Dmitry Ivanov	8388e14bbd	[scripts/git-upload] Fix logic of --forbid-overwrite	2021-12-09 14:06:17 +03:00
anastasia	5293e183c5	callmemaybe. review code cleanup	2021-12-09 13:31:49 +03:00
anastasia	93ff5f7ff0	Add default value for safekeeper --recall option. DEFAULT_RECALL_PERIOD is 1 second.	2021-12-09 13:31:49 +03:00
anastasia	41dce68bdd	callmemaybe refactoring - Don't spawn a separate thread for each connection. Instead use one thread per safekeeper, that iterates over all connections and sends callback requests for them. -Use tokio postgres to connect to the pageserver, to avoid spawning a new thread for each connection. callmemaybe review fixes: - Spawn all request_callback tasks separately. - Remember 'last_call_time' and only send request_callback if 'recall_period' has passed. - If task hasn't finished till next recall, abort it and try again. - Add pause/resume CallmeEvents to avoid spamming pageserver when connection already established.	2021-12-09 13:31:49 +03:00
Dmitry Rodionov	7dece8e4a0	skip temporary table files when comparing directories in regress tests	2021-12-09 12:53:26 +03:00
Arseny Sher	37c85d5fd9	Switch safekeeper from log to tracing logging. Add context to wal acceptor and wal sender threads showing timeline id and unique id differentiating them.	2021-12-09 06:57:46 +03:00
nikitashamgunov	6094236171	Update README.md	2021-12-08 11:55:54 -08:00
anastasia	bb5aba42eb	bump vendor/postgres to use correct backpressure commit	2021-12-08 18:57:18 +03:00
Arthur Petukhovsky	450fb9eafe	Don't persist control file without sync (#966 )	2021-12-07 15:02:44 +03:00
Dmitry Rodionov	557e3024cd	Forward pageserver connection string from compute to safekeeper This is needed for implementation of tenant rebalancing. With this change safekeeper becomes aware of which pageserver is supposed to be used for replication from this particular compute.	2021-12-06 21:28:49 +03:00
Arseny Sher	bd34d7ecfc	Bump safekeeper control file version and allow reading the previous one. Should have been a part of `cba4da3f4d` to provide upgrade for previously existing clusters. Separates version independent header (magic + version) out of SafeKeeperState to choose what to deserialize.	2021-12-06 19:47:55 +03:00
Dmitry Ivanov	0a8c672630	[CI] Fix benchmarks Too bad we don't have a --dry-run in PRs :(	2021-12-06 13:52:28 +03:00
Dmitry Ivanov	b87ab17d05	Bump rust version to 1.56.1 Apparently, code coverage doesn't work that well in 1.55.	2021-12-06 13:27:52 +03:00
Dmitry Ivanov	d874675955	Collect coverage in CI	2021-12-06 13:27:52 +03:00
Dmitry Ivanov	5d37560308	Add bespoke glue script leveraging LLVM coverage tools	2021-12-06 13:27:52 +03:00
Dmitry Ivanov	7cec13d1df	Improve shutdown story for code coverage This patch introduces fixes for several problems affecting LLVM-based code coverage: * Daemonizing parent processes should call _exit() to prevent coverage data file corruption (.profraw) due to concurrent writes. Implement proper shutdown handlers in safekeeper.	2021-12-06 13:27:52 +03:00
anastasia	b7685eb6ba	Enable backpressure	2021-12-06 12:49:42 +03:00
anastasia	c7f3b4e62c	Clarify the meaning of StandbyReply LSNs: write_lsn - The last LSN received and processed by pageserver's walreceiver. flush_lsn - same as write_lsn. At pageserver it doesn't guarantees data persistence, but it's fine. We rely on safekeepers. apply_lsn - The LSN at which pageserver guaranteed persistence of all received data (disk_consistent_lsn).	2021-12-06 12:49:42 +03:00
Heikki Linnakangas	5bad2deff8	Don't hold 'timelines' lock over checkpoint. It was very noticeable that you while the checkpointer was busy, you could not e.g. open a new connection.	2021-12-03 07:42:10 -05:00
Arseny Sher	d39608c367	Fix passing start_offset to find_end_of_wal_segment.	2021-12-03 12:43:57 +03:00
Arseny Sher	cba4da3f4d	Add term history to safekeepers. Persist full history of term switches on safekeepers instead of storing only the single term of the highest entry (called epoch). This allows easily and correctly find the divergence point of two logs and truncate the obsolete part before overwriting it with entries of the newer proposer(s). Full history of the proposer is transferred in separate message before proposer starts streaming; it is immediately persisted by safekeeper, though he might not yet have entries for some older terms there. That's because we can't atomically append to WAL and update the control file anyway, so locally available WAL must be taken into account when looking at the history. We should sometimes purge term history entries beyond truncate_lsn; this is not done here. Per https://github.com/zenithdb/rfcs/pull/12 Closes #296. Bumps vendor/postgres.	2021-12-03 12:43:57 +03:00
Dmitry Rodionov	2669d140f8	use full commit sha for version info for builds in docker this is not needed, since environment variable with commit sha already contains full version	2021-12-01 17:35:57 +03:00
Heikki Linnakangas	f49ad33f1b	Initialize 'loaded' correctly in DeltaLayer. While we're at it, reuse the Book and the VirtualFile that's backing it even over unload() calls. Previously, we would keep the Book open, but on load(), we would re-open it anyway, which didn't make much sense. Now we reuse it it. Alternatively, perhaps we should close it on unload() to save some memory, but this I'm not going to think too hard about it right now as the whole load/unload thing is a bit of a hack and needs to be rewritten. This is hard to reproduce ATM, because the incorrect state would get fixed by an unload(). A checkpoint creates the DeltaLayer, and it also calls unload() afterwards, so the window is not very large. I hit it occasionally with a scale 1000 pgbench test, after I had modified InMemoryLayer::write_to_disk() to not write an image layer every time, which made the DeltaLayers be accessed more often.	2021-11-30 22:23:59 +02:00
Kirill Bulatov	670205e17a	Evict excessively failing sync tasks, improve processing for the rest of the tasks	2021-11-30 13:58:49 +02:00
Konstantin Knizhnik	f72d4814b1	Extract page images from FPI WAL records (#949 ) * Extract page images from FPI WAL records * Fix issues reported in review	2021-11-30 12:57:26 +03:00
Heikki Linnakangas	5ecf0664cc	Fix off-by-one error in check for future delta layers. This doesnt show up at the moment, because we never create a delta layer with end-LSN equal to the last LSN. We always create an image layer at that LSN instead. For example, if the latest processed LSN is 100, we would create a delta layer with end LSN 100 (exclusive), and an image layer at 100. But that's just how InMemoryLayer::write_to_disk happens to work at the moment, there's no fundamental reason it needs to always create that image layer. I noticed this bug when I tried to change the logic in InMemoryLayer::write_to_disk to only create an image layer after a few delta layers.	2021-11-29 14:35:24 +02:00
Heikki Linnakangas	7cae265447	Fix dump_layerfile. The VirtualFile machinery panics if it's not initialized	2021-11-29 11:26:54 +02:00
Heikki Linnakangas	5aa969a588	Replace in-memory layers and OOM-triggered eviction with temp files. The "in-memory layer" is misnomer now, each in-memory layer is now actually backed by a file. The files are ephemeral, in that they don't survive page server crash or shutdown. To avoid reading the file for every operation, "ephemeral files" are cached in a page cache. This includes changes from 'inmemory-layer-chunks' branch to serialize / the page versions when they are added to the open layer. The difference is that they are not serialized to the expandable in-memory "chunk buffer", but written out to the file.	2021-11-26 17:25:17 +03:00
Arthur Petukhovsky	93cc40584d	Shutdown socket on CopyFail (#938 ) Fixes #935	2021-11-26 16:48:27 +03:00
Dmitry Rodionov	130184fee9	Prohibit branch creation and basebackup at out of scope lsns Out of scope LSNs include pre initdb LSNs, and LSNs prior to latest_gc_cutoff. To get there there was also two cleanups: * Fix error handling in Execute message handler. This fixes behaviour when basebackup retured an error. Previously pageserver thread just died. * Remove "ancestor" file which previously contained ancestor id and branch lsn. Currently the same data can be obtained from metadata file. And just the way we handled ancestor file in the code introduced the case when branching fails timeline directory is created but there is no data in it except ancestor file. And this confused gc because it scans directories. So it is better to just remove ancestor file and clean up this timeline directory creation so it happens after all validity checks have passed	2021-11-25 15:27:16 +03:00
Heikki Linnakangas	d47f610606	Fix pageserver CLI parameter names and document them	2021-11-25 13:31:52 +02:00
Dmitry Rodionov	0650e51f0b	add test one more case for layer visibility	2021-11-22 11:39:20 +03:00
Dmitry Rodionov	737a557f09	add check to python tests that afteer gc number of rows is unchanged in all branches	2021-11-22 11:39:20 +03:00
Dmitry Rodionov	6f7ebe6e01	preserve data in parent branch that might be referenced in child branch	2021-11-22 11:39:20 +03:00
Dmitry Rodionov	70ab0d5b1f	add missing script	2021-11-19 00:10:40 +03:00
Dmitry Rodionov	6ac76248cf	Save performance test results from perfirmance test suit runs. Also render reports for both staging and local runs.	2021-11-19 00:00:19 +03:00
Kirill Bulatov	b32da3b42e	Use less pageserver-specific method in RemoteStorage trait	2021-11-18 22:53:40 +02:00
Dmitry Ivanov	0ccfc62e88	[proxy] Pass PostgreSQL version to client Fixes #779	2021-11-17 16:28:44 +03:00
Dmitry Ivanov	b55cf773a8	[proxy] Streamline control- and dataflow	2021-11-17 16:28:44 +03:00
Dmitry Ivanov	43ded1c54b	[proxy] Minor cleanup	2021-11-17 16:28:44 +03:00

1 2 3 4 5 ...

1154 Commits