rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-07-03 20:20:38 +00:00

Author	SHA1	Message	Date
Konstantin Knizhnik	815528e0ce	Use last record LSN as flush position reported by safekeepers to walproposer to prevent moving VCL backward on compute node restart	2021-08-26 18:08:29 +03:00
Konstantin Knizhnik	a2e135b404	Maintain safe LSN position at safekeepers	2021-08-25 10:24:45 +03:00
Stas Kelvich	72de70a8cc	Change test_restart_compute to expose safekeeper problems	2021-08-25 00:42:08 +03:00
Konstantin Knizhnik	3ca4b638ac	Merge with main	2021-08-20 10:55:34 +03:00
Konstantin Knizhnik	ead94feb05	[refer #439 ] Correctly handle LSN parameter in BASEBACKUP command	2021-08-19 19:53:22 +03:00
Max Sharnoff	39bb6fb19c	Marginally improve walkeeper error visibility (#440 ) Adds a warning if a postgres query fails, and some additional context to errors generated inside `ReceiveWalConn::run`	2021-08-19 08:46:18 -07:00
Heikki Linnakangas	2450f82de5	Introduce a new "layered" repository implementation. This replaces the RocksDB based implementation with an approach using "snapshot files" on disk, and in-memory btreemaps to hold the recent changes. This make the repository implementation a configuration option. You can choose 'layered' or 'rocksdb' with "zenith init --repository-format=<format>" The unit tests have been refactored to exercise both implementations. 'layered' is now the default. Push/pull is not implemented. The 'test_history_inmemory' test has been commented out accordingly. It's not clear how we will implement that functionality; probably by copying the snapshot files directly.	2021-08-16 10:06:48 +03:00
Max Sharnoff	5eb1738e8b	Rework walkeeper protocol to use libpq (#366 ) Most of the work here was done on the postgres side. There's more information in the commit message there. (see: `04cfa326a5`) On the WAL acceptor side, we're now expecting 'START_WAL_PUSH' to initialize the WAL keeper protocol. Everything else is mostly the same, with the only real difference being that protocol messages are now discrete CopyData messages sent over the postgres protocol. For the sake of documentation, the full set of these messages is: <- recv: START_WAL_PUSH query <- recv: server info from postgres (type `ServerInfo`) -> send: walkeeper info (type `SafeKeeperInfo`) <- recv: vote info (type `RequestVote`) if node id mismatch: -> send: self node id (type `NodeId`); exit -> send: confirm vote (with node id) (type `NodeId`) loop: <- recv: info and maybe WAL block (type `SafeKeeperRequest` + bytes) (break loop if done) -> send: confirm receipt (type `SafeKeeperResponse`)	2021-08-13 11:25:16 -07:00
Heikki Linnakangas	70cb399d59	Add convenience function to create a RowDescriptor message for an int8 col. Makes the code to construct a result set a bit more terse and readable.	2021-08-11 20:17:33 +03:00
Dmitry Rodionov	ce5333656f	Introduce authentication v0.1. Current state with authentication. Page server validates JWT token passed as a password during connection phase and later when performing an action such as create branch tenant parameter of an operation is validated to match one submitted in token. To allow access from console there is dedicated scope: PageServerApi, this scope allows access to all tenants. See code for access validation in: PageServerHandler::check_permission. Because we are in progress of refactoring of communication layer involving wal proposer protocol, and safekeeper<->pageserver. Safekeeper now doesn’t check token passed from compute, and uses “hardcoded” token passed via environment variable to communicate with pageserver. Compute postgres now takes token from environment variable and passes it as a password field in pageserver connection. It is not passed through settings because then user will be able to retrieve it using pg_settings or SHOW .. I’ve added basic test in test_auth.py. Probably after we add authentication to remaining network paths we should enable it by default and switch all existing tests to use it.	2021-08-11 20:05:54 +03:00
anastasia	5dd9a66f9e	Move postgres backend messages to trace level	2021-08-10 14:26:28 +03:00
Stas Kelvich	fa04096733	cargo fmt pass	2021-08-04 23:51:02 +03:00
Dmitry Ivanov	754892402c	Enable full feature set for hyper in zenith_utils Server functionality requires not only the "server" feature flag, but also either "http1" or "http2" (or both). To make things simpler (and prevent analogous problems), enable all features.	2021-08-04 21:41:17 +03:00
Dmitry Ivanov	ed634ec320	Extract message processing function from PostgresBackend's event loop This patch has been extracted from #348, where it became unnecessary after we had decided that we didn't want to measure anything inside PostgresBackend. IMO the change is good enough to make its way into the codebase, even though it brings nothing "new" to the code.	2021-08-04 10:49:02 +03:00
Dmitry Ivanov	cb1b4a12a6	Add some prometheus metrics to pageserver The metrics are served by an http endpoint, which is meant to be spawned in a new thread. In the future the endpoint will provide more APIs, but for the time being, we won't bother with proper routing.	2021-08-03 21:42:24 +03:00
Max Sharnoff	3f4815efa2	Correct `LeSer` doc: "Big Endian" -> "Little Endian" (#362 )	2021-07-23 12:38:37 -07:00
Dmitry Ivanov	8b656bad5f	Add a missing [cfg(test)] We don't always need to compile tests.	2021-07-22 16:46:27 +03:00
Dmitry Ivanov	6a3b9b1d46	Fix accidental busyloop in walkeeper's background thread It used to be the case that walkeeper's background thread failed to recognize the end of stream (EOF) signaled by the `Ok(None)` result of `FeMessage::read`.	2021-07-22 12:12:55 +03:00
Stas Kelvich	79d9314ba6	terminate socket explicitly	2021-07-19 14:52:41 +03:00
Stas Kelvich	2b33894e7b	few more review fixes	2021-07-19 14:52:41 +03:00
Stas Kelvich	a118557331	review fixes	2021-07-19 14:52:41 +03:00
Stas Kelvich	1b6d99db7c	unfreeze client session upon callback	2021-07-19 14:52:41 +03:00
Stas Kelvich	605b90c6c7	do an actual proxy pass	2021-07-19 14:52:41 +03:00
Stas Kelvich	dab34c3dd6	distinguish between new and old users	2021-07-19 14:52:41 +03:00
Stas Kelvich	bf45bef284	md5 auth for postgres_backend.rs	2021-07-19 14:52:41 +03:00
Heikki Linnakangas	befefe8d84	Run 'cargo fmt'. Fixes a few formatting discrepancies had crept in recently.	2021-07-14 22:03:14 +03:00
Konstantin Knizhnik	ad92b66eed	Fix TimestampTz type to i64 to be compatbile with Postgres	2021-07-14 15:55:12 +03:00
Dmitry Rodionov	75e717fe86	allow both domains and ip addresses in connection options for pageserver and wal keeper. Also updated PageServerNode definition in control plane to account for that. resolves #303	2021-07-09 16:46:21 +03:00
Eric Seppanen	d2d5a01522	minor clippy fixes	2021-06-15 10:52:11 -07:00
Arseny Sher	b2f51026aa	Consolidate PG proto parsing-deparsing and backend code. Now postgres_backend communicates with the client, passing queries to the provided handler; we have two currently, for wal_acceptor and pageserver. Now BytesMut is again used for writing data to avoid manual message length calculation. ref #118	2021-06-08 17:31:40 +03:00
Konstantin Knizhnik	874d82fd4c	Fix tests in lsn.rs after changing wal_seg_size type	2021-05-20 14:45:09 +03:00
Konstantin Knizhnik	06f96f9600	Do not transfer WAL to computation nodes: use pg_resetwal for node startup	2021-05-20 14:13:47 +03:00
Eric Seppanen	1ec157653e	bin_ser: expand serialize error type, add serialized_size	2021-05-19 14:36:41 -07:00
Eric Seppanen	858ca3a4ce	bin_ser: simplify ser_into_slice The conversion of &mut [u8] into Write is a little tricky. Also, remove an unused generic parameter.	2021-05-19 14:36:41 -07:00
Eric Seppanen	d744ddee7c	bin_ser: preserve IO errors on deserialization We're starting to deserialize directly from the TcpStream now, which means that a socket error gets logged as "deserialize error". That's not very helpful; preserve the io::Error so it can be logged.	2021-05-19 14:36:41 -07:00
Eric Seppanen	513696a485	break wal_service into multiple pieces The pieces are: base Connection SendWal ReplicationHandler There are lots of other changes here: - Put the replication reader in a background thread; this gets rid of some hacks with nonblocking mode. - Stop manually buffering input data; use BufReader instead. - Use BytesMut a lot less; use Read/Write traits where possible.	2021-05-19 14:36:41 -07:00
Eric Seppanen	71e93faed7	fix endian typos in BeSer Cut/paste error: BeSer was using the little-endian config in two places. Add better unit tests so this can't happen again.	2021-05-13 19:04:17 -07:00
Eric Seppanen	e5df42feef	add workspace_hack dependency to zenith_utils I didn't think this mattered, but it does: if you add a dependency to zenith_utils, but forget to request a feature you need, the crate will build from the workspace root, but not by itself. It's probably better to pull in the whole dependency tree. This leaves one problem unsolved: the missing feature above will now be a latent bug. If that feature gets removed later by other crates, and then the workspace_hack Cargo.toml is updated, this missing feature will become a build failure.	2021-05-10 18:21:45 -07:00
Eric Seppanen	60d66267a9	add serde support to Lsn type A serialized Lsn and a serialized u64 should be identical.	2021-05-10 16:21:05 -07:00
Eric Seppanen	36c12247b9	add bin_ser module This module adds two traits that implement bincode-based serialization. BeSer implements methods for big-endian encoding/decoding. LeSer implements methods for little-endian encoding/decoding. Right now, the BeSer and LeSer methods have the same names, meaning you can't `use` them both at the same time. This is intended to be a safety mechanism: mixing big-endian and little-endian encoding in the same file is error-prone. There are ways around this, but the easiest fix is to put the big-endian code and little-endian code in different files or submodules.	2021-05-10 16:21:05 -07:00
anastasia	1591f058c6	implement Debug for Lsn type	2021-05-05 16:38:32 +03:00
Heikki Linnakangas	96beffb3c5	Add tests for the `Lsn::fetch_max` function.	2021-04-27 13:43:39 +03:00
Heikki Linnakangas	cff671c1bd	Remove duplicated LSN fields from the page cache. Having multiple copies of the same values is a source of confusion. Commit `da9bf5dc63` fixed one race condition caused by that, for example. See also discussion at https://github.com/zenithdb/zenith/issues/57#issuecomment-824393470 This changes SeqWait.advance() to return the old number, and not panic if you try to move the value backwards. The caller should check for that and act accordingly.	2021-04-27 10:32:39 +03:00
Konstantin Knizhnik	3b09a74f58	Implement offloading of old WAL files to S3 in walkeeper	2021-04-26 16:23:00 +03:00
Heikki Linnakangas	bc652e965e	Save old 'async' version of SeqWait, in case we need it later. It is currently unused, and is not built as part of 'cargo build', but seems like a shame to throw it away completely.	2021-04-26 13:30:10 +03:00
Heikki Linnakangas	3b9e7fc5e6	Use explicit threads. Remove 'async' usage a much as feasible. Async code is harder to debug, and mixing async and non-async code is a recipe for confusion and bugs. There are a couple of exceptions: - The code in walredo.rs, which needs to read and write to the child process simultaneously, still uses async. It's more convenient there. The 'async' usage is carefully limited to just the functions that communicate with the child process. - Code in walreceiver.rs that uses tokio-postgres to do streaming replication. We have to use async there, because tokio-postgres is async. Most rust-postgres functionality has non-async wrappers, but not the new replication client code. The async usage is very limited here, too: we use just block_on to call the tokio-postgres functions. The code in 'page_service.rs' now launches a dedicated thread for each connection. This replaces tokio::sync:⌚:channel with std::sync:mpsc in 'seqwait.rs', to make that non-async. It's not a drop-in replacement, though: std::sync::mpsc doesn't support multiple consumers, so we cannot share a channel between multiple waiters. So this removes the code to check if an existing channel can be reused, and creates a new one for each waiter. That created another problem: BTreeMap cannot hold duplicates, so I replaced that with BinaryHeap. Similarly, the tokio::{mpsc, oneshot} channels used between WAL redo manager and PageCache are replaced with std::sync::mpsc. (There is no separate 'oneshot' channel in the standard library.) Fixes github issue #58, and coincidentally also issue #66.	2021-04-26 13:07:51 +03:00
Eric Seppanen	96b6f350a7	add test cases for Lsn math and AtomicLsn	2021-04-25 19:37:02 -07:00
Eric Seppanen	648755a25e	add Lsn::block_offset, remaining_in_block, calc_padding Replace open-coded math with member fns.	2021-04-25 19:37:02 -07:00
Eric Seppanen	07d0241076	add AtomicLsn AtomicLsn is a wrapper around AtomicU64 that has load() and store() members that are cheap (on x86, anyway) and can be safely used in any context. This commit uses AtomicLsn in the page cache, and fixes up some downstream code that manually implemented LSN formatting. There's also a bugfix to the logging in wait_lsn, which prints the wrong lsn value.	2021-04-25 19:37:02 -07:00
Eric Seppanen	d760446053	remove Lsn::sub in favor of sub_checked There is only one place doing subtraction, and it had a manually implemented check.	2021-04-25 19:37:02 -07:00

1 2

58 Commits