rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-01-07 05:22:56 +00:00

Author	SHA1	Message	Date
Dmitry Rodionov	557e3024cd	Forward pageserver connection string from compute to safekeeper This is needed for implementation of tenant rebalancing. With this change safekeeper becomes aware of which pageserver is supposed to be used for replication from this particular compute.	2021-12-06 21:28:49 +03:00
Arseny Sher	bd34d7ecfc	Bump safekeeper control file version and allow reading the previous one. Should have been a part of `cba4da3f4d` to provide upgrade for previously existing clusters. Separates version independent header (magic + version) out of SafeKeeperState to choose what to deserialize.	2021-12-06 19:47:55 +03:00
Dmitry Ivanov	7cec13d1df	Improve shutdown story for code coverage This patch introduces fixes for several problems affecting LLVM-based code coverage: * Daemonizing parent processes should call _exit() to prevent coverage data file corruption (.profraw) due to concurrent writes. Implement proper shutdown handlers in safekeeper.	2021-12-06 13:27:52 +03:00
anastasia	c7f3b4e62c	Clarify the meaning of StandbyReply LSNs: write_lsn - The last LSN received and processed by pageserver's walreceiver. flush_lsn - same as write_lsn. At pageserver it doesn't guarantees data persistence, but it's fine. We rely on safekeepers. apply_lsn - The LSN at which pageserver guaranteed persistence of all received data (disk_consistent_lsn).	2021-12-06 12:49:42 +03:00
Arseny Sher	cba4da3f4d	Add term history to safekeepers. Persist full history of term switches on safekeepers instead of storing only the single term of the highest entry (called epoch). This allows easily and correctly find the divergence point of two logs and truncate the obsolete part before overwriting it with entries of the newer proposer(s). Full history of the proposer is transferred in separate message before proposer starts streaming; it is immediately persisted by safekeeper, though he might not yet have entries for some older terms there. That's because we can't atomically append to WAL and update the control file anyway, so locally available WAL must be taken into account when looking at the history. We should sometimes purge term history entries beyond truncate_lsn; this is not done here. Per https://github.com/zenithdb/rfcs/pull/12 Closes #296. Bumps vendor/postgres.	2021-12-03 12:43:57 +03:00
Arthur Petukhovsky	93cc40584d	Shutdown socket on CopyFail (#938 ) Fixes #935	2021-11-26 16:48:27 +03:00
Arseny Sher	e7ca8ef5a8	Use PG timelineid 1 everywhere. As changing it doesn't have useful meaning in Zenith. ref #824	2021-11-11 13:53:39 +03:00
Arseny Sher	5603259c53	In wal_proposer_recovery, don't wait outcoming WAL to be committed. Otherwise we're deadlocking ourselves. Oversight of `33007cc`.	2021-11-10 01:38:25 +03:00
Arseny Sher	ce15c62f35	Fix 'send WAL up to' debug logging.	2021-11-10 01:38:25 +03:00
Dmitry Rodionov	07dddfed28	Use more robust way to persist safekeeper control file. Now safekeeper control file updated in a following way: 1. Write data to temp file 2. Fsync the temporary file (if sync option is specified) 3. Rename temporary file to actual control file 4. Fsync containing directory (if sync option is specified) 5. Fsync file after rename (if sync option is specified). Note that action 5 is not mentioned anywhere as required but it is done in postgres this way (see durable_rename). Also because of the rename machinery switch to use dedicated lock file to prevent running several safekeepers concurrently on the same data cleanup fsync control file after rename to match postgres behaviour	2021-11-09 17:51:46 +03:00
Egor Suvorov	33007cc0bb	Safekeeper's `START_REPLICATION` handler: remove `stop_point`, do not handle `start_point == 0` (#777 )	2021-11-04 14:50:33 +03:00
Dmitry Rodionov	987833e0b9	Propagate git SHA to zenith binaries Git commit sha is displayed when --version flag is used and is written to logs during service startup. Uses git_version crate when git is available, and GIT_VERSION environment variable otherwise which is the case for docker builds.	2021-11-04 14:22:29 +03:00
anastasia	85f8bf97f5	Name walkeeper threads to make debugging more convenient	2021-11-01 19:09:57 +03:00
Patrick Insinger	b532470792	Set SO_REUSEADDR for all TCP listeners	2021-10-29 12:45:26 -07:00
Egor Suvorov	7e552b645f	Add disk write/sync metrics to Safekeeper (#745 )	2021-10-28 18:38:36 +03:00
Heikki Linnakangas	8c42dcc041	Fix safekeeper -D option. The -D option to specify working directory was broken: $ mkdir foobar $ ./target/debug/safekeeper -D foobar Error: failed to open "foobar/safekeeper.log" Caused by: No such file or directory (os error 2) This was because we both chdir'd into to specified directory, and also prepended the directory to all the paths. So in the above example, it actually tried to create the log file in "foobar/foobar/safekepeer.log" Change it to work the same way as in the pageserver: chdir to the specified directory, and leave 'workdir' always set to ".". We wouldn't necessarily need the 'workdir' variable in the config at all, and could assume that the current working directory is always the safekeeper data directory, but I'd like to keep this consistent with the the pageserver. The page server doesn't assume that for the sake of unit tests. We don't currently have unit tests in the safekeeper that write to disk but we might want to in the future.	2021-10-22 08:39:58 +03:00
Egor Suvorov	c058d04250	Rename WalAcceptor to Safekeeper in most places (#741 )	2021-10-21 18:26:43 +03:00
Konstantin Knizhnik	c310932121	Implement backpressure for compute node to avoid WAL overflow Co-authored-by: Arseny Sher <sher-ars@yandex.ru> Co-authored-by: Alexey Kondratov <kondratov.aleksey@gmail.com>	2021-10-21 18:15:50 +03:00
Arthur Petukhovsky	13f4e173c9	Wait for safekeepers to catch up in test_restarts_under_load (#776 )	2021-10-20 14:42:53 +03:00
Arseny Sher	de744a44dd	Add /timeline http request to safekeeper returning its status. Which is mainly generational state (terms) and useful LSNs. Also add /status basic healthcheck request which is now used in tests to determine the safekeeper is up; this fixes #726. ref #115	2021-10-14 19:02:38 +03:00
Egor Suvorov	6b6b3f68be	Safekeeper metrics refactor (#747 )	2021-10-13 16:28:24 +03:00
Arseny Sher	96f1175a80	Cleanup hardcoded oids.	2021-10-13 10:52:47 +03:00
Egor Suvorov	23f4c0a742	Rename `wal_acceptor` binary to `safekeeper` (#740 ), stage 1/2 * Rename wal_acceptor binary to safekeeper * Rename wal_acceptor.pid and wal_acceptor.log to safekeeper.pid and safekeeper.log * Change some mentions of WAL acceptor to safekeeper * Dockerfile: alias wal_acceptor to safekeeper temporarily until internal scripts are updated	2021-10-12 22:03:06 +03:00
Egor Suvorov	f3445949d1	Wal acceptor: report socket bind errors better when daemonizing (#738 ) Fixes #664	2021-10-12 16:51:28 +03:00
Arseny Sher	8c61c3e54e	Minor safekeeper readme fix.	2021-10-11 16:31:44 +03:00
anastasia	d7c9dd06f4	Implement graceful shutdown at 'pageserver stop': - perform checkpoint for each tenant repository. - wait for the completion of all threads. Add new option 'immediate' to 'pageserver stop' command to terminate the pageserver immediately.	2021-10-11 13:35:01 +03:00
Heikki Linnakangas	7216f22609	Use tracing crate to have more context in log messages. Whenever we start processing a request, we now enter a tracing "span" that includes context information like the tenant and timeline ID, and the operation we're performing. That context information gets attached to every log message we create within the span. That way, we don't need to include basic context information like that in every log message, and it also becomes easier to filter the logs programmatically. This removes the eplicit timeline and tenant IDs from most log messages, as you get that information from the enclosing span now. Also improve log messages in general, dialing down the level of some messages that are not very useful, and adding information to others. We now obey the RUST_LOG env variable, if it's set. The 'tracing' crate allows for different log formatters, like JSON or bunyan output. The one we use now is human-readable multi-line format, which is nice when reading the log directly, but hard for post-processing. For production, we'll probably want JSON output and some tools for working with it, but that's left as a TODO. The log format is easy to change.	2021-10-11 08:59:06 +03:00
Egor Suvorov	403d9779d9	safekeeper: add initial metrics and HTTP handler (#699 , #541 ) * `wal_acceptor`: add HTTP handler, /metrics endpoint only, no authentication * Two gauges are currently reported: `flush_lsn` and `commit_lsn` * Add `DEFAULT_PG_LISTEN_PORT` and `DEFAULT_PG_LISTEN_PORT` consts for uniformity	2021-10-08 18:55:41 +03:00
Egor Suvorov	530d3eaf09	Add more details to pageserver and safekeeper docs (#680 )	2021-10-05 19:10:50 +03:00
Arseny Sher	adbae62281	Rename SharedState.commit_lsn to notified_commit_lsn. ref #682	2021-09-30 17:29:15 +03:00
Egor Suvorov	3127a4a13b	Safekeeper::Storage::write_wal: clarify behavior (#679 ) It previously took &SafeKeeperState similar to persist(), but only for its `server` member. Now it takes &ServerInfo only, so there it's clear the state is not persisted. Also added a comment about sync.	2021-09-29 19:58:30 +03:00
Arthur Petukhovsky	d6fc74a412	Various fixes for test_sync_safekeepers (#668 ) * Send ProposerGreeting manually in tests * Move test_sync_safekeepers to test_wal_acceptor.py * Capture test_sync_safekeepers output * Add comment for handle_json_ctrl * Save captured output in CI	2021-09-28 19:25:05 +03:00
sharnoff	a72707b8cb	Redo #655 with fix: Allow `LeSer`/`BeSer` impls missing either `Serialize` or `Deserialize` Commit message copied below: * Allow LeSer/BeSer impls missing Serialize/Deserialize Currently, using `LeSer` or `BeSer` requires that the type implements both `Serialize` and `DeserializeOwned`, even if we're only using the trait for one of those functionalities. Moving the bounds to the methods gives the convenience of the traits without requiring unnecessary derives. * Remove unused #[derive(Serialize/Deserialize)] This should hopefully reduce compile times - if only by a little bit. Some of these were already unused (we weren't using LeSer/BeSer for the types), but most are have become unused with the change to LeSer/BeSer.	2021-09-24 10:58:01 -07:00
Max Sharnoff	0f770967b4	Revert "Allow `LeSer`/`BeSer` impls missing either `Serialize` or `Deserialize` (#655 ) This reverts commit `bd9f4794d9`.	2021-09-24 10:18:36 -07:00
Max Sharnoff	bd9f4794d9	Allow `LeSer`/`BeSer` impls missing either `Serialize` or `Deserialize` (#655 ) * Allow LeSer/BeSer impls missing Serialize/Deserialize Currently, using `LeSer` or `BeSer` requires that the type implements both `Serialize` and `DeserializeOwned`, even if we're only using the trait for one of those functionalities. Moving the bounds to the methods gives the convenience of the traits without requiring unnecessary derives. * Remove unused #[derive(Serialize/Deserialize)] This should hopefully reduce compile times - if only by a little bit. Some of these were already unused (we weren't using LeSer/BeSer for the types), but most are have become unused with the change to LeSer/BeSer.	2021-09-24 10:06:03 -07:00
Arthur Petukhovsky	d4e037f1e7	Support for `--sync-safekeepers` in tests (#647 ) New command has been added to append specially crafted records in safekeeper WAL. This command takes json for append, encodes LogicalMessage based on json fields, and processes new AppendRequest to append and commit WAL in safekeeper. Python test starts up walkeepers and creates config for walproposer, then appends WAL and checks --sync-safekeepers works without errors. This test is simplest one, more useful test cases (like in #545) for different setups will be added soon.	2021-09-24 13:19:59 +03:00
Max Sharnoff	bbe4f39790	walkeeper: Add parsing check for hot standby tag (#597 )	2021-09-16 09:04:35 -07:00
Kirill Bulatov	7dda9f2894	Fix clippy lints and enable clippy checking in CI	2021-09-16 15:09:16 +03:00
Kirill Bulatov	3ab60ce76f	Unify tokio deps and bump cargo resolver version	2021-09-15 16:00:08 +03:00
Max Sharnoff	a2498f3e67	Improve walkeeper replication error messages & context (#585 )	2021-09-14 11:59:14 -07:00
Dmitry Rodionov	84008a2560	factor out common logging initialisation routine This contains a lowest common denominator of pageserver and safekeeper log initialisation routines. It uses daemonize flag to decide where to stream log messages. In case daemonize is true log messages are forwarded to file. Otherwise streaming to stdout is used. Usage of stdout for log output is the default in docker side of things, so make it easier to browse our logs via builtin docker commands.	2021-09-14 18:09:14 +03:00
Arseny Sher	a68c23448a	Skip the bootstrap hole in safekeeper's find_end_of_wal. Otherwise restart of safekeeper before the first segment is filled makes it report 0 as flushed LSN. To this end, tweak find_end_of_wal_segment to allow starting from given LSN, not only from the start of the segment. While here, make it less panicky.	2021-09-13 22:46:04 +03:00
Arseny Sher	0aec60938a	Make flush_lsn reported by safekeepers point to record boundary. Otherwise we produce corrupted record holes in WAL during compute node restart in case there was an unfinished record from the old compute, as these reports advance commit_lsn -- reliably persisted part of WAL. ref #549. Mostly by @knizhnik. I adjusted to make sure proposer always starts streaming since record beginning so we don't need special quirks for decoding in safekeeper.	2021-09-11 06:10:10 +03:00
Arseny Sher	d1f0b1eda4	Adapt safekeepers to --sync-safekeepers walproposer mode. 1) Do epoch switch without record from new epoch, immediately after recovery -- --sync-safekeepers mode doesn't generate new records. 2) Fix commit_lsn advancement by taking into account wal we have locally -- setting it further is incorrect. 3) Report it back to walproposer so he knows when sync is done. 4) Remove system id check as it is unknown in sync mode. And make logging slightly better. ref #439	2021-09-06 13:06:20 +03:00
Dmitry Rodionov	bc709561b6	fix clippy warnings	2021-09-02 18:54:44 +03:00
Dmitry Rodionov	3c5452da88	add tenant id tracking to safekeeper Previously timelines were namespaced only by ZTimelineId, so this patch adds ZTenant id to the key of a hashtable closes #381	2021-09-02 12:57:39 +03:00
Patrick Insinger	5ac3cb1c72	TLS for postgres_backend and proxy Add TLS support to `postgres_backend`. Implement this support in `proxy`. Other applications must opt-in and provide a `rustls::ServerConfig`.	2021-09-01 10:29:19 -07:00
Arseny Sher	7474cfac08	Rename VCL to epochStartLsn and restart_lsn to truncate_lsn. epochStartLsn is the LSN since which new proposer writes its WAL in its epoch, let's be more explicit here. truncate_lsn is LSN still needed by the most lagging safekeeper. restart_lsn is terminology from pg_replicaton_slots, but here we don't really have 'restart'; hopefully truncate word makes it clearer.	2021-08-27 15:22:10 +03:00
Arseny Sher	6cbc08f1fb	bump pg version	2021-08-27 15:22:10 +03:00
Arseny Sher	8d3450f4c6	Basic safekeeper refactoring and bug fixing. 1) Extract consensus logic to safekeeper.rs. 2) Change the voting flow so that acceptor tells his epoch along with giving the vote, not before it; otherwise it might get immediately stale. #294 3) Process messages from compute atomically and sync state properly. #270 4) Use separate structs for disk and network. ref #315	2021-08-27 15:22:10 +03:00

1 2 3

148 Commits