rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2025-12-26 07:39:58 +00:00

Author	SHA1	Message	Date
Konstantin Knizhnik	86056abd0e	Fix merge conflict: set initial WAL position to second segment because of pg_resetwal	2021-05-20 15:26:39 +03:00
Konstantin Knizhnik	3645133700	Fix conflicts with main branch	2021-05-20 14:39:27 +03:00
Eric Seppanen	4c35b22626	Remove FIXME about buffer pools If I'm not going to do anything about it soon, it's not worth keeping this comment.	2021-05-19 14:36:41 -07:00
Eric Seppanen	9fe3b73e13	walkeeper replication: remove the lock from the send stream. I originally thought there would be multiple threads sending here, but that's not currently the case, so remove the lock.	2021-05-19 14:36:41 -07:00
Eric Seppanen	e0146304e6	timeline: make SharedState and some constructors private This was pointed out in code review: no need for these to be public.	2021-05-19 14:36:41 -07:00
Eric Seppanen	fbb04c592a	wal_service: change error message at thread exit Because many errors are propagated to this point, use a better message than "socket error".	2021-05-19 14:36:41 -07:00
Eric Seppanen	8f43d7637c	wal_service: move code around some more Move ReceiveWalConn into its own file. Shuffle constants around so they are close to the protocol they're associated with, or move them into postgres_ffi if they seem to be global constants.	2021-05-19 14:36:41 -07:00
Eric Seppanen	cf30303d8f	extract protocol peek code; rename Connection -> ReceiveWalConn It may be more robust to use the TcpStream::peek function, so do all protocol peeking before creating the protocol object. This reveals the next cleanup step: rename Connection, since it's no longer the parent of SendWalConn. Now we peek at the first bytes and choose which kind of connection object to create.	2021-05-19 14:36:41 -07:00
Eric Seppanen	3296b7d770	wal_service: permit I/O errors while reading control file I'm not sure why ignoring this error is a good idea, but the test_embedded_wal_proposer test fails if we propagate the error upward.	2021-05-19 14:36:41 -07:00
Eric Seppanen	2148ae78ab	wal_service: remove manual output buffering Serialize objects directly to the stream. This allows us to remove a bunch of buffer management code, along with the NewSerializer trait that was a temporary bridge between the old code and the new.	2021-05-19 14:36:41 -07:00
Eric Seppanen	78dcf2207e	replace manual deserialization with serde + BeSer This struct is a little awkward, because in other places it is serialized/deserialized as little-endian, but here it's big-endian.	2021-05-19 14:36:41 -07:00
Eric Seppanen	74b78608d9	split timeline code into a separate file	2021-05-19 14:36:41 -07:00
Eric Seppanen	a11558b84f	break wal_service into multiple files + misc cleanups	2021-05-19 14:36:41 -07:00
Eric Seppanen	513696a485	break wal_service into multiple pieces The pieces are: base Connection SendWal ReplicationHandler There are lots of other changes here: - Put the replication reader in a background thread; this gets rid of some hacks with nonblocking mode. - Stop manually buffering input data; use BufReader instead. - Use BytesMut a lot less; use Read/Write traits where possible.	2021-05-19 14:36:41 -07:00
Eric Seppanen	cedc2eb5c2	wal_service: add BufReader If we try to read a few bytes at a time, we will perform a lot more syscalls than necessary. Wrap the socket in a BufReader, which will buffer bytes as needed.	2021-05-19 14:36:41 -07:00
Stas Kelvich	746f667311	Refactor CLI and CLI<->pageserver interfaces to support remote pageserver This patch started as an effort to support CLI working against remote pageserver, but turned into a pretty big refactoring. * CLI now does not look into repository files directly. New commands 'branch_create' and 'identify_system' were introduced into page_service to support that. * Branch management that was scattered between local_env and zenith/main.rs is moved into pageserver/branches.rs. That code could better fit in Repository/Timeline impl, but I'll leave that for a different patch. * All tests-related code from local_env went into integration_tests/src/lib.rs as an extension to PostgresNode trait. * Paths-generating functions were concentrated around corresponding config types (LocalEnv and PageserverConf).	2021-05-17 19:17:51 +03:00
Stas Kelvich	d737c40eec	copy safekeeper README from older C version to current Rust version	2021-05-17 11:43:18 +03:00
Eric Seppanen	6ff3f1b9fd	don't open log files multiple times Multiple fds writing to the same file doesn't work. One fd will overwrite the output of the other fd. We were opening log files three times (stdout, stderr, and slog). The symptoms can be seen when the program panics; the final file will have truncated or lost messages. After this change, all messages are preserved. If panicking and logging are concurrent (and they definitely can be), some of the messages may be interleaved in slightly inconvenient ways. File::try_clone() is essentially `dup` underneath, meaning the two will share the same file offset.	2021-05-13 00:32:39 -07:00
Eric Seppanen	73647e5715	wal_service: fix NodeId order/endian issues Add fixes suggested in code review. In a previous commit, I changed the NodeId field order and types to try to preserve the exact serialization that was happening. Unfortunately, that serialization was incorrect and the original struct was mostly correct. Change uuid to be a [u8; 16] as it was intended to be a byte array; that will clearly indicate to serde serializers that no endian swaps will ever be needed.	2021-05-10 16:21:05 -07:00
Eric Seppanen	95db33f3f9	wal_service: comment cleanup	2021-05-10 16:21:05 -07:00
Eric Seppanen	bace19ffbe	wal_service: switch to Lsn type Replace XLogRecPtr with Lsn in wal_service.rs . This removes the last use of XLogSegmentOffset and XLByteToSeg, so delete them. (replaced by Lsn::segment_offset and Lsn::segment_number.)	2021-05-10 16:21:05 -07:00
Eric Seppanen	294320e6a8	wal_service: drop repr(C) The C memory representation is only needed if we want to guarantee the same memory layout as some other program. Since we're using serde to serialize these data structures, we can let the compiler do what it wants.	2021-05-10 16:21:05 -07:00
Eric Seppanen	28b4d9abb3	wal_service: use anyhow for error handling We may eventually want precise error types for some of this, but anyhow::Error is a lot easier than trying to force io::Error.	2021-05-10 16:21:05 -07:00
Eric Seppanen	8d8bc304c1	work around NodeId endian issues Instead of playing games during serialize/deserialize, just treat NodeId::term as an 8-byte array instead of a u64.	2021-05-10 16:21:05 -07:00
Eric Seppanen	4788248e11	wal_service: remove manual serialization code Commit to serde for serialization of data structures.	2021-05-10 16:21:05 -07:00
Eric Seppanen	0cbb3798da	try using serde to do all the serialization in wal_service This version validates on every call that our result is exactly the same as the previous result. NodeId is a strange corner case: one field is serialized little-endian and one field is serialized big-endian. Hopefully we can fix that in the future.	2021-05-10 16:21:05 -07:00
Eric Seppanen	1767208563	remove tokio-postgres from dependencies	2021-05-10 15:24:55 -07:00
Eric Seppanen	6c825dcbaa	switch walkeeper over to new postgres blocking interface This is a big async -> sync conversion. Most of it is a pretty straightforward conversion of removing `async` and `.await` and swapping in the right std modules. I didn't find a thread-blocking version of `Notify` so I wrote one, and then realized that there was already a Mutex being used there, so I deleted my Notify and just used Condvar instead. There is one part that seems odd to me: in `handle_start_replication` there is a place where the previous code was doing a non-blocking read; there is no TcpStream::try_read() so I fell back on manually flipping the socket to non-blocking mode and then back again. This seems pretty gross, but I'm not sure exactly what to replace this with: a background thread? Extract the fd and run select() on it to first test if it's readable?	2021-05-10 15:24:55 -07:00
Eric Seppanen	4b46693c81	adapt to new upstream tokio-postgres replication interface Switch over to a newer version of rust-postgres PR752. A few minor changes are required: - PgLsn::UNDEFINED -> PgLsn::from(0) - PgTimestamp -> SystemTime	2021-05-10 15:24:55 -07:00
Eric Seppanen	df5a55c445	add workspace_hack crate Our builds can be a little inconsistent, because Cargo doesn't deal well with workspaces where there are multiple crates which have different dependencies that select different features. As a workaround, copy what other big rust projects do: add a workspace_hack crate. This crate just pins down a set of dependencies and features that satisfies all of the workspace crates. The benefits are: - running `cargo build` from one of the workspace subdirectories now works without rebuilding anything. - running `cargo install` works (without rebuilding anything). - making small dependency changes is much less likely to trigger large dependency rebuilds.	2021-05-07 13:08:31 -07:00
Eric Seppanen	2e0d45d092	Switch to upstream rust-s3 The local fork of rust-s3 has some code to support Google Cloud, but that PR no longer applies upstream, and will need significant changes before it can be re-submitted. In the meantime, we might as well just use the most similar upstream release. The benefit of switching is that it fixes a feature-resolution bug that was causing us to build 24 more crates than needed (mostly async-std and its dependencies).	2021-05-04 12:02:00 -07:00
Eric Seppanen	a3818dee58	pin dependencies to versions If there isn't any version specified for a dependency crate, Cargo may choose a newer version. This could happen when Cargo.lock is updated ("cargo update") but can also happen unexpectedly when adding or changing other dependencies. This can allow API-breaking changes to be picked up, breaking the build. To prevent this, specify versions for all dependencies. Cargo is still allowed to pick newer versions that are (hopefully) non-breaking, by analyzing the semver version number. There are two special cases here: 1. serde_derive::{Serialize, Deserialize} isn't really used any more. It was only a separate crate in the past because of compiler limitations. Nowadays, people turn on the "derive" feature of the serde crate and use serde::{Serialize, Deserialize}. 2. parse_duration is unmaintained and has an open security issue. (gh iss. 87) That issue probably isn't critical for us because of where we use that crate, but it's probably still better to pin the version so we can't get hit with an API-breaking change at an awkward time.	2021-05-03 14:02:10 -07:00
Eric Seppanen	b77597bd99	remove old Cargo.lock files When using a cargo workspace (defined by the root Cargo.toml), there is one shared Cargo.lock file at the root.	2021-04-29 10:31:01 -07:00
Eric Seppanen	975b2d12dc	cargo fmt	2021-04-28 10:01:58 -07:00
anastasia	34d55b09a3	[issue #73 ] fix wal_acceptor merge problem caused by `3fea78d6`	2021-04-28 17:24:31 +03:00
Konstantin Knizhnik	bbec5a13bd	Extract appname from startup package	2021-04-28 15:26:08 +03:00
Konstantin Knizhnik	3b09a74f58	Implement offloading of old WAL files to S3 in walkeeper	2021-04-26 16:23:00 +03:00
Heikki Linnakangas	8af5cbedb1	Move xlog_utils.rs to postgres_ffi module. I had copy-pasted these functions to a few other places. Clean that up, move them to a common module, and add some comments.	2021-04-22 13:22:34 +03:00
Heikki Linnakangas	a4fd1e1a80	Cleanup more issues noted by 'clippy' Mostly stuff that was introduced by commit `3600b33f1c`.	2021-04-22 09:20:05 +03:00
Eric Seppanen	1f3f4cfaf5	clippy cleanup #2 - remove needless return - remove needless format! - remove a few more needless clone() - from_str_radix(_, 10) -> .parse() - remove needless reference - remove needless `mut` Also manually replaced a match statement with map_err() because after clippy was done with it, there was almost nothing left in the match expression.	2021-04-21 17:56:58 -07:00
Heikki Linnakangas	e911427872	Remove some unnecessary dependencies	2021-04-21 16:42:12 +03:00
Eric Seppanen	92e4f4b3b6	cargo fmt	2021-04-20 17:59:56 -07:00
Heikki Linnakangas	d047a3abf7	Fixes, per Eric's and Konstantin's comments	2021-04-20 19:11:29 +03:00
Heikki Linnakangas	f69db17409	Make WAL safekeeper work with zenith timelines	2021-04-20 19:11:29 +03:00
Heikki Linnakangas	3600b33f1c	Implement "timelines" in page server This replaces the page server's "datadir" concept. The Page Server now always works with a "Zenith Repository". When you initialize a new repository with "zenith init", it runs initdb and loads an initial basebackup of the freshly-created cluster into the repository, on "main" branch. Repository can hold multiple "timelines", which can be given human-friendly names, making them "branches". One page server simultaneously serves all timelines stored in the repository, and you can have multiple Postgres compute nodes connected to the page server, as long they all operate on a different timeline. There is a new command "zenith branch", which can be used to fork off new branches from existing branches. The repository uses the directory layout desribed as Repository format v1 in https://github.com/zenithdb/rfcs/pull/5. It it highly inefficient: - we never create new snapshots. So in practice, it's really just a base backup of the initial empty cluster, and everything else is reconstructed by redoing all WAL - when you create a new timeline, the base snapshot and all WAL is copied from the new timeline to the new one. There is no smarts about referencing the old snapshots/wal from the ancestor timeline. To support all this, this commit includes a bunch of other changes: - Implement "basebackup" funtionality in page server. When you initialize a new compute node with "zenith pg create", it connects to the page server, and requests a base backup of the Postgres data directory on that timeline. (the base backup excludes user tables, so it's not as bad as it sounds). - Have page server's WAL receiver write the WAL into timeline dir. This allows running a Page Server and Compute Nodes without a WAL safekeeper, until we get around to integrate that properly into the system. (Even after we integrate WAL safekeeper, this is perhaps how this will operate when you want to run the system on your laptop.) - restore_datadir.rs was renamed to restore_local_repo.rs, and heavily modified to use the new format. It now also restores all WAL. - Page server no longer scans and restores everything into memory at startup. Instead, when the first request is made for a timeline, the timeline is slurped into memory at that point. - The responsibility for telling page server to "callmemaybe" was moved into Postgres libpqpagestore code. Also, WAL producer connstring cannot be specified in the pageserver's command line anymore. - Having multiple "system identifiers" in the same page server is no longer supported. I repurposed much of that code to support multiple timelines, instead. - Implemented very basic, incomplete, support for PostgreSQL's Extended Query Protocol in page_service.rs. Turns out that rust-postgres' copy_out() function always uses the extended query protocol to send out the command, and I'm using that to stream the base backup from the page server. TODO: I haven't fixed the WAL safekeeper for this scheme, so all the integration tests involving safekeepers are failing. My plan is to modify the safekeeper to know about Zenith timelines, too, and modify it to work with the same Zenith repository format. It only needs to care about the '.zenith/timelines/<timeline>/wal' directories.	2021-04-20 19:11:27 +03:00
Heikki Linnakangas	583f64768f	Fix wal safekeeper's reply to IDENTIFY_SYSTEM command. The PostgreSQL FE/BE RowDescription message was built incorrectly, the colums were sent in wrong order, and the command tag was missing NULL-terminator. With these fixes, 'psql' understands the reply and shows it correctly.	2021-04-20 17:35:27 +03:00
Heikki Linnakangas	9809613c6f	Don't try to read from two WAL files in one read() call. That obviously won't work, you have to stop at the WAL file boundary, and open the next file.	2021-04-19 10:34:51 +03:00
Eric Seppanen	3c7f810849	clippy cleanup #1 Resolve some basic warnings from clippy: - useless conversion to the same type - redundant field names in struct initialization - redundant single-component path imports	2021-04-18 19:15:06 -07:00
Eric Seppanen	52d6275812	drop nonfunctional attributes allow(dead_code) These had no effect, so remove them.	2021-04-16 15:59:32 -07:00
Eric Seppanen	35e0099ac6	pin remote rust-s3 dependency to a git hash Using the hash should allow us to change the remote repo and propagate that change to user builds without that change becoming visible at a random time. It's unfortunate that we can't declare this dependency once in the top-level Cargo.toml; that feature request is rust-lang rfc 2906.	2021-04-16 15:26:11 -07:00

1 2

62 Commits