rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-01-05 20:42:54 +00:00

Author	SHA1	Message	Date
Heikki Linnakangas	e6a7241c3a	Simplify construction of rocksdb keys and values. I'm going nuts with the pattern: let k = iter.key().unwrap(); buf.clear(); buf.extend_from_slice(&k); let key = CacheKey::unpack(&mut buf); Introduce helper functions to convert a CacheKey into BytesMut, and from [u8] into CacheKey. Reduces the boilerplate code a lot. The helper functions create a new BytesMut on each call, whereas the old coding could reuse a single BytesMut, so this could be a bit slower. I haven't tried measuring it, but at least it's not immediately noticeable, and readability is much more imporatant at this point. We can optimize later	2021-05-19 12:33:38 +03:00
Stas Kelvich	709b778904	Show help in CLI when no arguments provided	2021-05-19 12:32:57 +03:00
Heikki Linnakangas	aa8debf4e8	Add test for a relation that's larger than 1 GB. This isn't very exciting with the current RocksDB implementation, because it doesn't care about the PostgreSQL 1 GB segment boundaries at all. But I think we will care about this in the future, and more tests is generally better anyway.	2021-05-19 09:22:17 +03:00
Heikki Linnakangas	1912546e52	Change the meaning of PageServerConf.workdir Commit `746f667311` added the 'workdir' field and the get__path() functions, with the idea that we cd into the directory at page server startup, so that the get__path() functions can always return paths relative to '.', but 'workdir' shows the original path to it. Change it so that 'conf.workdir' is always set to '.', too, and the get__path() functions include 'workdir' in the returned paths. Why? Because that allows writing unit tests without changing the current directory. When I was working on commit `97992226d3`, I initially wrote the test so that it changed the current working directory, just like commit `746f667311` did. But that was problematic, when I tried to add another unit test that also* wants to change the current working dir, because they could then not run concurrently. In fact, they could not even run serially, unless the current directory was carefully reset after the test. So it is better to avoid changing the current directory in tests.	2021-05-19 08:49:16 +03:00
Heikki Linnakangas	a6178c135f	Fix starting page server in non-daemonize mode. Commit `746f667311` moved the "chdir" earlier in the startup sequence, before daemonizing. But it forgot to remove a corresponding chdir call later in the sequence when not in daemonize mode. As a result, if you tried to start the pageserver without the --daemonize option, it always failed with "No such file or directory" error.	2021-05-19 08:49:09 +03:00
Heikki Linnakangas	66bced0f36	Fix leftover comment about async I/O	2021-05-18 20:47:35 +03:00
Patrick Insinger	f954d5c501	pageserver - separate pagestream messages	2021-05-17 17:17:08 -04:00
Heikki Linnakangas	e602807476	Be more lenient with branch names. Notably, the "foo@0/12345678" syntax was not allowed, because '/' is not a word character.	2021-05-17 20:44:00 +03:00
Eric Seppanen	398d522d88	cargo fmt	2021-05-17 09:29:58 -07:00
Stas Kelvich	746f667311	Refactor CLI and CLI<->pageserver interfaces to support remote pageserver This patch started as an effort to support CLI working against remote pageserver, but turned into a pretty big refactoring. * CLI now does not look into repository files directly. New commands 'branch_create' and 'identify_system' were introduced into page_service to support that. * Branch management that was scattered between local_env and zenith/main.rs is moved into pageserver/branches.rs. That code could better fit in Repository/Timeline impl, but I'll leave that for a different patch. * All tests-related code from local_env went into integration_tests/src/lib.rs as an extension to PostgresNode trait. * Paths-generating functions were concentrated around corresponding config types (LocalEnv and PageserverConf).	2021-05-17 19:17:51 +03:00
Heikki Linnakangas	952424b78c	Move save_decoded_record() function to Repository trait. The function doesn't depend on the implementation of the Repository, it only calls the public interface functions.	2021-05-17 15:16:28 +03:00
Konstantin Knizhnik	04dc698d4b	Add support of twophase transactions	2021-05-16 00:03:20 +03:00
Heikki Linnakangas	6b11b4250e	Fix compilation with older rust version. Commit `9ece1e863d` used `slice.fill`, which isn't available until Rust v1.50.0. I have 1.48.0 installed, so it was failing to compile for me. We haven't really standardized on any particular Rust version, and if there's a good feature we need in a recent version, let's bump up the minimum requirement. But this is simple enough to work around.	2021-05-15 01:42:33 +03:00
Konstantin Knizhnik	9ece1e863d	Compute and restore pg_xact, pg_multixact and pg_filenode.map files	2021-05-14 16:35:09 +03:00
Heikki Linnakangas	97992226d3	Add some unit tests for the Repository/Timeline interface.	2021-05-14 12:44:52 +03:00
Heikki Linnakangas	270356ec38	Refactor WalRedoManager for easier testing. Turn WalRedoManager into an abstract trait, so that it can be easily mocked in unit tests. One change here is that the WAL redo manager is no longer tied to a specific zenith timeline. It didn't do anything with that information aside from using it in the dummy datadir's name. We could use any random string for that purpose, it's just to prevent two WAL redo managers from stepping over each other. But this commit actually changes things so that all timelines use the same WAL redo manager, so that's not necessary. We will probably want to maintain a pool of WAL redo processes in the future, but for now let's keep it simple. In the passing, fix some comments.	2021-05-14 12:44:49 +03:00
Heikki Linnakangas	c2db828481	Create RocksDB databases under correct path. We used to create them under .zenith/.zenith/<timelineid>. The double .zenith was clearly not intentional. Change it to .zenith/timelines/<timelineid>. Fixes https://github.com/zenithdb/zenith/issues/127	2021-05-14 12:44:44 +03:00
anastasia	38c4b6f02f	Move postgres code related to zenith pageserver to contrib/zenith. - vendor/postgres changes - Respective changes in RUST code: upload shared library, use new GUC names. - Add contrib build to Makefile.	2021-05-13 16:23:21 +03:00
Eric Seppanen	6ff3f1b9fd	don't open log files multiple times Multiple fds writing to the same file doesn't work. One fd will overwrite the output of the other fd. We were opening log files three times (stdout, stderr, and slog). The symptoms can be seen when the program panics; the final file will have truncated or lost messages. After this change, all messages are preserved. If panicking and logging are concurrent (and they definitely can be), some of the messages may be interleaved in slightly inconvenient ways. File::try_clone() is essentially `dup` underneath, meaning the two will share the same file offset.	2021-05-13 00:32:39 -07:00
Patrick Insinger	4c5e23d014	pageserver - fix ParameterStatus write call	2021-05-12 20:59:04 -04:00
Patrick Insinger	99d80aba52	use pageserver for pg list command	2021-05-12 12:34:03 +03:00
Konstantin Knizhnik	2f2dff4c8d	Merge with main brnach	2021-05-12 10:46:01 +03:00
Konstantin Knizhnik	22e7fcbf2d	Handle visbility map updates in WAL redo	2021-05-12 10:38:43 +03:00
Patrick Insinger	49d1921a28	page_server - add python api tests	2021-05-11 14:16:22 -04:00
Patrick Insinger	d8e509d29e	page_service - use anyhow for error handling	2021-05-11 14:11:10 -04:00
Patrick Insinger	d5bfe84d9e	cargo fmt	2021-05-11 12:35:09 -04:00
Arseny Sher	8fff26ad49	Make Repository API return abstract dyn Timeline. + minor cargo fmt cleanup	2021-05-11 15:27:23 +03:00
Heikki Linnakangas	5f4e32f505	Require valid WAL streaming point. If timeline doesn't have a valid "last valid LSN", refuse WAL streaming. The previous behavior was to start streaming from the very beginning of time. That was needed to support bootstrapping the page server with no data at all (see commit `bd606ab37a`), but we no longer do that.	2021-05-11 11:12:14 +03:00
Heikki Linnakangas	fb71c85a79	Implement std::fmt::Display for RelTag, for debug messages.	2021-05-11 10:55:51 +03:00
Eric Seppanen	0cbb3798da	try using serde to do all the serialization in wal_service This version validates on every call that our result is exactly the same as the previous result. NodeId is a strange corner case: one field is serialized little-endian and one field is serialized big-endian. Hopefully we can fix that in the future.	2021-05-10 16:21:05 -07:00
Eric Seppanen	1767208563	remove tokio-postgres from dependencies	2021-05-10 15:24:55 -07:00
Eric Seppanen	d25656797c	switch pageserver to blocking postgres interface	2021-05-10 15:24:55 -07:00
Eric Seppanen	4b46693c81	adapt to new upstream tokio-postgres replication interface Switch over to a newer version of rust-postgres PR752. A few minor changes are required: - PgLsn::UNDEFINED -> PgLsn::from(0) - PgTimestamp -> SystemTime	2021-05-10 15:24:55 -07:00
Eric Seppanen	d26b76fe7c	cargo fmt	2021-05-07 13:11:44 -07:00
Eric Seppanen	df5a55c445	add workspace_hack crate Our builds can be a little inconsistent, because Cargo doesn't deal well with workspaces where there are multiple crates which have different dependencies that select different features. As a workaround, copy what other big rust projects do: add a workspace_hack crate. This crate just pins down a set of dependencies and features that satisfies all of the workspace crates. The benefits are: - running `cargo build` from one of the workspace subdirectories now works without rebuilding anything. - running `cargo install` works (without rebuilding anything). - making small dependency changes is much less likely to trigger large dependency rebuilds.	2021-05-07 13:08:31 -07:00
Heikki Linnakangas	e5e5c3e067	Tidy up the `parse_relfilename` function. A few things that Eric commented on at PR #96: - Use thiserror to simplify the implemention of FilePathError - Add unit tests - Fix a few complaints from clippy	2021-05-07 11:01:34 +03:00
Heikki Linnakangas	b7575582b8	Add comments to the Repository/Timeline traits. Let's try to have comments on every public function. This doesn't quite get us there yet, but close.	2021-05-06 23:02:11 +03:00
Heikki Linnakangas	77fd24b950	Fix a few clippy warnings. By either accepting clippy's suggestion, or by adding an 'allow' directive to silence it.	2021-05-06 21:57:13 +03:00
Heikki Linnakangas	61af9bb889	Move a few functions that have been copy-pasted around to shared module.	2021-05-06 21:57:10 +03:00
Heikki Linnakangas	a68f60415b	Change a few remaining functions to use the Lsn datatype for LSNs.	2021-05-06 21:57:07 +03:00
Heikki Linnakangas	e7ca580922	Improve comments.	2021-05-06 21:57:04 +03:00
Heikki Linnakangas	33d126ecbe	Tidy up usage of a few constants from PostgreSQL headers.	2021-05-06 21:57:01 +03:00
anastasia	15db0d1d6f	refactor walreciever and restore_local_repo	2021-05-06 12:58:08 +03:00
Heikki Linnakangas	29f122009a	Don't restart WAL streaming in the middle of a record. I think this was changed inadvertently by commit `2c308da4d2`. Change it back. Fixes https://github.com/zenithdb/zenith/issues/98	2021-05-06 11:34:28 +03:00
Heikki Linnakangas	bf0a0cb55d	Remove unused struct	2021-05-05 20:14:09 +03:00
Heikki Linnakangas	0fe5abadf5	Remove dead code around tracking first valid LSN. We should track the range of LSNs that are valid in a GetPage@LSN request somehow, but currently this is just dead code. Remove, until we get around to actually implement it. https://github.com/zenithdb/zenith/issues/95 tracks that.	2021-05-05 17:29:10 +03:00
Heikki Linnakangas	8e57c2e413	Provide more context to a panic. I just bumped into this panic, but couldn't reproduce. Not sure what happened, but let's provide more context.	2021-05-05 15:47:11 +03:00
Heikki Linnakangas	4dd63821bd	Improve trace log messages in page server	2021-05-05 10:39:28 +03:00
Heikki Linnakangas	eeec1a3dcb	Refactor the way truncations are handled. Currently, truncation is implemented in the RocksDB repository by storing a special sentinel entry for each page that was truncated away. Hide that implementation detail better in the abstract Repository interface, so that caller doesn't need to construct the special sentinel WAL record. While we're at it, refactor the CacheEntryContent struct to an enum.	2021-05-05 10:39:28 +03:00
Heikki Linnakangas	b484b896b6	Refactor the functionality page_cache.rs. This moves things around: - The PageCache is split into two structs: Repository and Timeline. A Repository holds multiple Timelines. In order to get a page version, you must first get a reference to the Repository, then the Timeline in the repository, and finally call the get_page_at_lsn() function on the Timeline object. This sounds complicated, but because each connection from a compute node, and each WAL receiver, only deals with one timeline at a time, the callers can get the reference to the Timeline object once and hold onto it. The Timeline corresponds most closely to the old PageCache object. - Repository and Timeline are now abstract traits, so that we can support multiple implementations. I don't actually expect us to have multiple implementations for long. We have the RocksDB implementation now, but as soon as we have a different implementation that's usable, I expect that we will retire the RocksDB implementation. But I think this abstraction works as good documentation in any case: it's now easier to see what the interface for storing and loading pages from the repository is, by looking at the Repository/Timeline traits. They abstract traits are in repository.rs, and the RocksDB implementation of them is in repository/rocksdb.rs. - page_cache.rs is now a "switchboard" to get a handle to the repository. Currently, the page server can only handle one repository at a time, so there isn't much there, but in the future we might do multi-tenancy there.	2021-05-05 10:37:36 +03:00

1 2 3 4

162 Commits