rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-01-08 05:52:55 +00:00

Author	SHA1	Message	Date
Patrick Insinger	d8e509d29e	page_service - use anyhow for error handling	2021-05-11 14:11:10 -04:00
Patrick Insinger	d5bfe84d9e	cargo fmt	2021-05-11 12:35:09 -04:00
Arseny Sher	8fff26ad49	Make Repository API return abstract dyn Timeline. + minor cargo fmt cleanup	2021-05-11 15:27:23 +03:00
Heikki Linnakangas	5f4e32f505	Require valid WAL streaming point. If timeline doesn't have a valid "last valid LSN", refuse WAL streaming. The previous behavior was to start streaming from the very beginning of time. That was needed to support bootstrapping the page server with no data at all (see commit `bd606ab37a`), but we no longer do that.	2021-05-11 11:12:14 +03:00
Heikki Linnakangas	fb71c85a79	Implement std::fmt::Display for RelTag, for debug messages.	2021-05-11 10:55:51 +03:00
Eric Seppanen	0cbb3798da	try using serde to do all the serialization in wal_service This version validates on every call that our result is exactly the same as the previous result. NodeId is a strange corner case: one field is serialized little-endian and one field is serialized big-endian. Hopefully we can fix that in the future.	2021-05-10 16:21:05 -07:00
Eric Seppanen	1767208563	remove tokio-postgres from dependencies	2021-05-10 15:24:55 -07:00
Eric Seppanen	d25656797c	switch pageserver to blocking postgres interface	2021-05-10 15:24:55 -07:00
Eric Seppanen	4b46693c81	adapt to new upstream tokio-postgres replication interface Switch over to a newer version of rust-postgres PR752. A few minor changes are required: - PgLsn::UNDEFINED -> PgLsn::from(0) - PgTimestamp -> SystemTime	2021-05-10 15:24:55 -07:00
Eric Seppanen	d26b76fe7c	cargo fmt	2021-05-07 13:11:44 -07:00
Eric Seppanen	df5a55c445	add workspace_hack crate Our builds can be a little inconsistent, because Cargo doesn't deal well with workspaces where there are multiple crates which have different dependencies that select different features. As a workaround, copy what other big rust projects do: add a workspace_hack crate. This crate just pins down a set of dependencies and features that satisfies all of the workspace crates. The benefits are: - running `cargo build` from one of the workspace subdirectories now works without rebuilding anything. - running `cargo install` works (without rebuilding anything). - making small dependency changes is much less likely to trigger large dependency rebuilds.	2021-05-07 13:08:31 -07:00
Heikki Linnakangas	e5e5c3e067	Tidy up the `parse_relfilename` function. A few things that Eric commented on at PR #96: - Use thiserror to simplify the implemention of FilePathError - Add unit tests - Fix a few complaints from clippy	2021-05-07 11:01:34 +03:00
Heikki Linnakangas	b7575582b8	Add comments to the Repository/Timeline traits. Let's try to have comments on every public function. This doesn't quite get us there yet, but close.	2021-05-06 23:02:11 +03:00
Heikki Linnakangas	77fd24b950	Fix a few clippy warnings. By either accepting clippy's suggestion, or by adding an 'allow' directive to silence it.	2021-05-06 21:57:13 +03:00
Heikki Linnakangas	61af9bb889	Move a few functions that have been copy-pasted around to shared module.	2021-05-06 21:57:10 +03:00
Heikki Linnakangas	a68f60415b	Change a few remaining functions to use the Lsn datatype for LSNs.	2021-05-06 21:57:07 +03:00
Heikki Linnakangas	e7ca580922	Improve comments.	2021-05-06 21:57:04 +03:00
Heikki Linnakangas	33d126ecbe	Tidy up usage of a few constants from PostgreSQL headers.	2021-05-06 21:57:01 +03:00
anastasia	15db0d1d6f	refactor walreciever and restore_local_repo	2021-05-06 12:58:08 +03:00
Heikki Linnakangas	29f122009a	Don't restart WAL streaming in the middle of a record. I think this was changed inadvertently by commit `2c308da4d2`. Change it back. Fixes https://github.com/zenithdb/zenith/issues/98	2021-05-06 11:34:28 +03:00
Heikki Linnakangas	bf0a0cb55d	Remove unused struct	2021-05-05 20:14:09 +03:00
Heikki Linnakangas	0fe5abadf5	Remove dead code around tracking first valid LSN. We should track the range of LSNs that are valid in a GetPage@LSN request somehow, but currently this is just dead code. Remove, until we get around to actually implement it. https://github.com/zenithdb/zenith/issues/95 tracks that.	2021-05-05 17:29:10 +03:00
Heikki Linnakangas	8e57c2e413	Provide more context to a panic. I just bumped into this panic, but couldn't reproduce. Not sure what happened, but let's provide more context.	2021-05-05 15:47:11 +03:00
Heikki Linnakangas	4dd63821bd	Improve trace log messages in page server	2021-05-05 10:39:28 +03:00
Heikki Linnakangas	eeec1a3dcb	Refactor the way truncations are handled. Currently, truncation is implemented in the RocksDB repository by storing a special sentinel entry for each page that was truncated away. Hide that implementation detail better in the abstract Repository interface, so that caller doesn't need to construct the special sentinel WAL record. While we're at it, refactor the CacheEntryContent struct to an enum.	2021-05-05 10:39:28 +03:00
Heikki Linnakangas	b484b896b6	Refactor the functionality page_cache.rs. This moves things around: - The PageCache is split into two structs: Repository and Timeline. A Repository holds multiple Timelines. In order to get a page version, you must first get a reference to the Repository, then the Timeline in the repository, and finally call the get_page_at_lsn() function on the Timeline object. This sounds complicated, but because each connection from a compute node, and each WAL receiver, only deals with one timeline at a time, the callers can get the reference to the Timeline object once and hold onto it. The Timeline corresponds most closely to the old PageCache object. - Repository and Timeline are now abstract traits, so that we can support multiple implementations. I don't actually expect us to have multiple implementations for long. We have the RocksDB implementation now, but as soon as we have a different implementation that's usable, I expect that we will retire the RocksDB implementation. But I think this abstraction works as good documentation in any case: it's now easier to see what the interface for storing and loading pages from the repository is, by looking at the Repository/Timeline traits. They abstract traits are in repository.rs, and the RocksDB implementation of them is in repository/rocksdb.rs. - page_cache.rs is now a "switchboard" to get a handle to the repository. Currently, the page server can only handle one repository at a time, so there isn't much there, but in the future we might do multi-tenancy there.	2021-05-05 10:37:36 +03:00
Eric Seppanen	2e0d45d092	Switch to upstream rust-s3 The local fork of rust-s3 has some code to support Google Cloud, but that PR no longer applies upstream, and will need significant changes before it can be re-submitted. In the meantime, we might as well just use the most similar upstream release. The benefit of switching is that it fixes a feature-resolution bug that was causing us to build 24 more crates than needed (mostly async-std and its dependencies).	2021-05-04 12:02:00 -07:00
Eric Seppanen	ce646ea845	use tokio::try_join instead of futures::try_join We don't use the `futures` crate much. Remove one of only two references to it (tokio has the identical macro).	2021-05-03 18:46:10 -07:00
Eric Seppanen	a3818dee58	pin dependencies to versions If there isn't any version specified for a dependency crate, Cargo may choose a newer version. This could happen when Cargo.lock is updated ("cargo update") but can also happen unexpectedly when adding or changing other dependencies. This can allow API-breaking changes to be picked up, breaking the build. To prevent this, specify versions for all dependencies. Cargo is still allowed to pick newer versions that are (hopefully) non-breaking, by analyzing the semver version number. There are two special cases here: 1. serde_derive::{Serialize, Deserialize} isn't really used any more. It was only a separate crate in the past because of compiler limitations. Nowadays, people turn on the "derive" feature of the serde crate and use serde::{Serialize, Deserialize}. 2. parse_duration is unmaintained and has an open security issue. (gh iss. 87) That issue probably isn't critical for us because of where we use that crate, but it's probably still better to pin the version so we can't get hit with an API-breaking change at an awkward time.	2021-05-03 14:02:10 -07:00
anastasia	1cdeba9db7	[issue #18 ] log module name and position in the file	2021-05-03 15:17:51 +03:00
Konstantin Knizhnik	651a8139f5	Fix bug in transaction_id_set_status_bit	2021-04-30 19:24:00 +03:00
Konstantin Knizhnik	eea6f0898e	Restore CLOG from snapshot	2021-04-30 14:22:47 +03:00
Heikki Linnakangas	086c0ad829	Remove unused 'apply_pending' field.	2021-04-30 12:44:06 +03:00
Eric Seppanen	b77597bd99	remove old Cargo.lock files When using a cargo workspace (defined by the root Cargo.toml), there is one shared Cargo.lock file at the root.	2021-04-29 10:31:01 -07:00
anastasia	1369145e83	code cleanup	2021-04-29 18:41:42 +03:00
anastasia	b49164a1d4	cargo fmt	2021-04-29 18:41:42 +03:00
anastasia	e7b112aacc	Refactor pg_constants. Move them to postgres_ffi/	2021-04-29 18:41:42 +03:00
Eric Seppanen	975b2d12dc	cargo fmt	2021-04-28 10:01:58 -07:00
Heikki Linnakangas	41a3772e90	Replace pgbuild.sh with a Makefile This allows building both Zenith and PostgreSQL in one command. The command is 'make' Reviewed-by: Arseny Sher <sher-ars@yandex.ru>	2021-04-28 16:54:45 +03:00
anastasia	421d586953	code cleanup for XLogRecord decoding	2021-04-28 13:56:27 +03:00
anastasia	ef37eb96b9	refactor XLogRecord reading	2021-04-28 13:56:27 +03:00
anastasia	d311f708b6	handle subtrans in COMMIT/ABORT records	2021-04-28 13:56:27 +03:00
Heikki Linnakangas	c7f54af1f1	Refactor page_cache <-> walredo interface. Make the caller of request_redo() responsible for gathering the WAL records to redo, and for storing the reconstructed page image back in the page cache. This leaves the WAL redo manager purely responsible for dealing with the postgres child process, removing its dependency on the PageCache.	2021-04-27 21:43:56 +03:00
Heikki Linnakangas	cff671c1bd	Remove duplicated LSN fields from the page cache. Having multiple copies of the same values is a source of confusion. Commit `da9bf5dc63` fixed one race condition caused by that, for example. See also discussion at https://github.com/zenithdb/zenith/issues/57#issuecomment-824393470 This changes SeqWait.advance() to return the old number, and not panic if you try to move the value backwards. The caller should check for that and act accordingly.	2021-04-27 10:32:39 +03:00
Eric Seppanen	4acdcbe90f	clippy cleanup #3 Fix issues raised by clippy. Mostly trivial ones, though some allow 4-5 lines of code to be reduced to 1.	2021-04-26 12:35:35 -07:00
Eric Seppanen	fdf6829de5	cargo fmt	2021-04-26 09:36:22 -07:00
anastasia	b361558a8a	fix typo in transaction replay code	2021-04-26 18:35:26 +03:00
Konstantin Knizhnik	c59830fd01	Do not restart wal-redo-postgres	2021-04-26 17:57:29 +03:00
Heikki Linnakangas	f617115467	Remove obsolete comment on async usage in the page cache	2021-04-26 14:12:57 +03:00
Heikki Linnakangas	3b9e7fc5e6	Use explicit threads. Remove 'async' usage a much as feasible. Async code is harder to debug, and mixing async and non-async code is a recipe for confusion and bugs. There are a couple of exceptions: - The code in walredo.rs, which needs to read and write to the child process simultaneously, still uses async. It's more convenient there. The 'async' usage is carefully limited to just the functions that communicate with the child process. - Code in walreceiver.rs that uses tokio-postgres to do streaming replication. We have to use async there, because tokio-postgres is async. Most rust-postgres functionality has non-async wrappers, but not the new replication client code. The async usage is very limited here, too: we use just block_on to call the tokio-postgres functions. The code in 'page_service.rs' now launches a dedicated thread for each connection. This replaces tokio::sync:⌚:channel with std::sync:mpsc in 'seqwait.rs', to make that non-async. It's not a drop-in replacement, though: std::sync::mpsc doesn't support multiple consumers, so we cannot share a channel between multiple waiters. So this removes the code to check if an existing channel can be reused, and creates a new one for each waiter. That created another problem: BTreeMap cannot hold duplicates, so I replaced that with BinaryHeap. Similarly, the tokio::{mpsc, oneshot} channels used between WAL redo manager and PageCache are replaced with std::sync::mpsc. (There is no separate 'oneshot' channel in the standard library.) Fixes github issue #58, and coincidentally also issue #66.	2021-04-26 13:07:51 +03:00

1 2 3

138 Commits