rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-01-08 14:02:55 +00:00

Author	SHA1	Message	Date
Eric Seppanen	d25656797c	switch pageserver to blocking postgres interface	2021-05-10 15:24:55 -07:00
Eric Seppanen	6c825dcbaa	switch walkeeper over to new postgres blocking interface This is a big async -> sync conversion. Most of it is a pretty straightforward conversion of removing `async` and `.await` and swapping in the right std modules. I didn't find a thread-blocking version of `Notify` so I wrote one, and then realized that there was already a Mutex being used there, so I deleted my Notify and just used Condvar instead. There is one part that seems odd to me: in `handle_start_replication` there is a place where the previous code was doing a non-blocking read; there is no TcpStream::try_read() so I fell back on manually flipping the socket to non-blocking mode and then back again. This seems pretty gross, but I'm not sure exactly what to replace this with: a background thread? Extract the fd and run select() on it to first test if it's readable?	2021-05-10 15:24:55 -07:00
Eric Seppanen	4b46693c81	adapt to new upstream tokio-postgres replication interface Switch over to a newer version of rust-postgres PR752. A few minor changes are required: - PgLsn::UNDEFINED -> PgLsn::from(0) - PgTimestamp -> SystemTime	2021-05-10 15:24:55 -07:00
Eric Seppanen	8952066ecb	circleci: Save the postgres logs as artifacts	2021-05-09 22:20:58 -07:00
Eric Seppanen	d26b76fe7c	cargo fmt	2021-05-07 13:11:44 -07:00
Eric Seppanen	df5a55c445	add workspace_hack crate Our builds can be a little inconsistent, because Cargo doesn't deal well with workspaces where there are multiple crates which have different dependencies that select different features. As a workaround, copy what other big rust projects do: add a workspace_hack crate. This crate just pins down a set of dependencies and features that satisfies all of the workspace crates. The benefits are: - running `cargo build` from one of the workspace subdirectories now works without rebuilding anything. - running `cargo install` works (without rebuilding anything). - making small dependency changes is much less likely to trigger large dependency rebuilds.	2021-05-07 13:08:31 -07:00
Heikki Linnakangas	e5e5c3e067	Tidy up the `parse_relfilename` function. A few things that Eric commented on at PR #96: - Use thiserror to simplify the implemention of FilePathError - Add unit tests - Fix a few complaints from clippy	2021-05-07 11:01:34 +03:00
Heikki Linnakangas	b7575582b8	Add comments to the Repository/Timeline traits. Let's try to have comments on every public function. This doesn't quite get us there yet, but close.	2021-05-06 23:02:11 +03:00
Heikki Linnakangas	77fd24b950	Fix a few clippy warnings. By either accepting clippy's suggestion, or by adding an 'allow' directive to silence it.	2021-05-06 21:57:13 +03:00
Heikki Linnakangas	61af9bb889	Move a few functions that have been copy-pasted around to shared module.	2021-05-06 21:57:10 +03:00
Heikki Linnakangas	a68f60415b	Change a few remaining functions to use the Lsn datatype for LSNs.	2021-05-06 21:57:07 +03:00
Heikki Linnakangas	e7ca580922	Improve comments.	2021-05-06 21:57:04 +03:00
Heikki Linnakangas	33d126ecbe	Tidy up usage of a few constants from PostgreSQL headers.	2021-05-06 21:57:01 +03:00
anastasia	15db0d1d6f	refactor walreciever and restore_local_repo	2021-05-06 12:58:08 +03:00
Heikki Linnakangas	29f122009a	Don't restart WAL streaming in the middle of a record. I think this was changed inadvertently by commit `2c308da4d2`. Change it back. Fixes https://github.com/zenithdb/zenith/issues/98	2021-05-06 11:34:28 +03:00
Heikki Linnakangas	bf0a0cb55d	Remove unused struct	2021-05-05 20:14:09 +03:00
Heikki Linnakangas	0fe5abadf5	Remove dead code around tracking first valid LSN. We should track the range of LSNs that are valid in a GetPage@LSN request somehow, but currently this is just dead code. Remove, until we get around to actually implement it. https://github.com/zenithdb/zenith/issues/95 tracks that.	2021-05-05 17:29:10 +03:00
anastasia	1591f058c6	implement Debug for Lsn type	2021-05-05 16:38:32 +03:00
Heikki Linnakangas	efa4ecaa7c	Reduce the duration of the pgbench test. We'll want to run it for longer when doing benchmarking, but 5 seconds should be enough to tease out any basic bugs.	2021-05-05 15:47:13 +03:00
Heikki Linnakangas	8e57c2e413	Provide more context to a panic. I just bumped into this panic, but couldn't reproduce. Not sure what happened, but let's provide more context.	2021-05-05 15:47:11 +03:00
Heikki Linnakangas	4dd63821bd	Improve trace log messages in page server	2021-05-05 10:39:28 +03:00
Heikki Linnakangas	eeec1a3dcb	Refactor the way truncations are handled. Currently, truncation is implemented in the RocksDB repository by storing a special sentinel entry for each page that was truncated away. Hide that implementation detail better in the abstract Repository interface, so that caller doesn't need to construct the special sentinel WAL record. While we're at it, refactor the CacheEntryContent struct to an enum.	2021-05-05 10:39:28 +03:00
Heikki Linnakangas	b484b896b6	Refactor the functionality page_cache.rs. This moves things around: - The PageCache is split into two structs: Repository and Timeline. A Repository holds multiple Timelines. In order to get a page version, you must first get a reference to the Repository, then the Timeline in the repository, and finally call the get_page_at_lsn() function on the Timeline object. This sounds complicated, but because each connection from a compute node, and each WAL receiver, only deals with one timeline at a time, the callers can get the reference to the Timeline object once and hold onto it. The Timeline corresponds most closely to the old PageCache object. - Repository and Timeline are now abstract traits, so that we can support multiple implementations. I don't actually expect us to have multiple implementations for long. We have the RocksDB implementation now, but as soon as we have a different implementation that's usable, I expect that we will retire the RocksDB implementation. But I think this abstraction works as good documentation in any case: it's now easier to see what the interface for storing and loading pages from the repository is, by looking at the Repository/Timeline traits. They abstract traits are in repository.rs, and the RocksDB implementation of them is in repository/rocksdb.rs. - page_cache.rs is now a "switchboard" to get a handle to the repository. Currently, the page server can only handle one repository at a time, so there isn't much there, but in the future we might do multi-tenancy there.	2021-05-05 10:37:36 +03:00
Heikki Linnakangas	e5413be5fa	Update 'postgres' submodule to latest version.	2021-05-05 00:37:28 +03:00
Eric Seppanen	b9c0d22045	circleci: shrink python tracebacks Mostly we're not testing python code, so verbose python tracebacks are unhelpful. Add --tb=short to the pytest args to cut down on the noise. To override this during testing, set the "extra_params" parameter on the circleci job to "--tb=auto" or "--tb=long".	2021-05-04 12:43:36 -07:00
Eric Seppanen	2e0d45d092	Switch to upstream rust-s3 The local fork of rust-s3 has some code to support Google Cloud, but that PR no longer applies upstream, and will need significant changes before it can be re-submitted. In the meantime, we might as well just use the most similar upstream release. The benefit of switching is that it fixes a feature-resolution bug that was causing us to build 24 more crates than needed (mostly async-std and its dependencies).	2021-05-04 12:02:00 -07:00
Eric Seppanen	86932c20eb	circleci: disable imperfect match on the rust cache The cache keeps growing as stale packages accumulate; until we can figure out a better strategy, just start over every time Cargo.lock changes.	2021-05-03 23:53:59 -07:00
Eric Seppanen	f5b45a172c	circleci: flush caches The rust cache is growing dramatically. Change the cache key to start over. The weird "v98" was something I'd intended to reset before landing the circleci config. Do the sane thing and start over at v01. The intent is that we just increment the number each time something gets broken.	2021-05-03 23:51:10 -07:00
Eric Seppanen	e6a0987182	python fixtures: enable "zenith pageserver stop" Replaces "killall pageserver", which doesn't work if you don't have the psmisc package installed.	2021-05-03 23:32:06 -07:00
Eric Seppanen	aa64391265	fix clippy warning about redundant clone	2021-05-03 23:20:51 -07:00
Eric Seppanen	aac913f9dc	use nix kill instead of spawning a process Since we are now calling the syscall directly, read_pidfile can now parse an integer. We also verify the pid is >= 1, because calling kill on 0 or negative values goes straight to crazytown.	2021-05-03 23:20:51 -07:00
Eric Seppanen	4e2e5bb4e6	implement "zenith pageserver stop" Fixes #89.	2021-05-03 19:54:13 -07:00
Eric Seppanen	3e15a5c325	test_runner fixtures: feedback from review I forgot to add these fixes before merging: - typo in Postgres doc-comment - add 'wal_acceptor' to the list of conflicting processes.	2021-05-03 18:46:50 -07:00
Eric Seppanen	ce646ea845	use tokio::try_join instead of futures::try_join We don't use the `futures` crate much. Remove one of only two references to it (tokio has the identical macro).	2021-05-03 18:46:10 -07:00
Eric Seppanen	effcabb590	circleci: do builds on a bigger container. default(medium): 2 CPUs, 4GB RAM. xlarge: 8 CPUs, 16GB RAM. Some build jobs are getting killed with signal 9. I'm guessing that this is probably an OOM condition...	2021-05-03 14:29:29 -07:00
Eric Seppanen	a08dfb1c2c	gitignore .zenith only in git root I found I had a few other .zenith directories hanging around in odd places. I doubt we intended those directories to collect in multiple locations, so only hide the one in the git root directory.	2021-05-03 14:07:46 -07:00
Eric Seppanen	a3818dee58	pin dependencies to versions If there isn't any version specified for a dependency crate, Cargo may choose a newer version. This could happen when Cargo.lock is updated ("cargo update") but can also happen unexpectedly when adding or changing other dependencies. This can allow API-breaking changes to be picked up, breaking the build. To prevent this, specify versions for all dependencies. Cargo is still allowed to pick newer versions that are (hopefully) non-breaking, by analyzing the semver version number. There are two special cases here: 1. serde_derive::{Serialize, Deserialize} isn't really used any more. It was only a separate crate in the past because of compiler limitations. Nowadays, people turn on the "derive" feature of the serde crate and use serde::{Serialize, Deserialize}. 2. parse_duration is unmaintained and has an open security issue. (gh iss. 87) That issue probably isn't critical for us because of where we use that crate, but it's probably still better to pin the version so we can't get hit with an API-breaking change at an awkward time.	2021-05-03 14:02:10 -07:00
Eric Seppanen	219cbe2d9c	pytest: improve documentation and protect against wrong versions It's quite hard to get python2 to exit gracefully when the code was intended for python3, because the interpreter will SyntaxError before running a single line of code. Thankfully, the pytest developers put a version check in their .ini config, so that should gracefully handle both wrong-pytest-version and wrong-python-version. Also document the woes of trying to run the pytest version shipped by e.g. Debian or Ubuntu.	2021-05-03 11:31:32 -07:00
Eric Seppanen	129f85f652	circleci: shallow clone the postgres repo Fetching the postgres submodule is one of the more expensive steps of the build. Doing a shallow clone ("--depth 1") should save some time and a lot of network bandwidth.	2021-05-03 11:31:32 -07:00
Eric Seppanen	790f1b05c6	Add circleCI build & test jobs This does the postgres & rust builds, caching the results, and preserves its outputs in a "workspace" for downstream test jobs (which can run in parallel). Pytest jobs are parameterized, so adding new pytest-based tests requires only adding a new job to the "workflows" section at the end. This could use some optimization: - The "apt-get install" step is quite slow. - The rust build step will always happen, even if only unrelated changes are present (e.g. modified a python test file) - Saving/restoring the rust cache (/target) is very slow (it contains 1.3GB of data) - Saving the workspace is very slow. - The "install" step is ugly; postgres and rust artifacts could take a much better form.	2021-05-03 11:31:32 -07:00
Eric Seppanen	37cd662ab2	add pytest integration tests Use pytest to manage background services, paths, and environment variables. Benefits: - Tests are a little easier to write. - Cleanup is more reliable. You can CTRL-C a test and it will still shut down gracefully. If you manually start a conflicting process, the test fixtures will detect this and abort at startup. - Don't need to worry about remembering '--test-threads=1' - Output of sub-processes can be captured to files. - Test fixtures configure everything to operate under a single test output directory, making it easier to capture logs in CI. - Detects all the necessary paths if run from the git root, but can also run from arbitrary paths by setting environment variables. There is also a deliberately broken test (test_broken.py) that can be used to test whether the test fixtures properly clean up after themselves. It won't run by default; the comment at the top explains how to enable it.	2021-05-03 11:31:32 -07:00
Eric Seppanen	277a4d4582	allow zenith to run using arbitrary paths Remove the check that enforces running from the git root directory. Discover the zenith binary path from current_exe(). Look for postgres in $POSTGRES_BIN or $CWD/tmp_install.	2021-05-03 11:31:32 -07:00
anastasia	1cdeba9db7	[issue #18 ] log module name and position in the file	2021-05-03 15:17:51 +03:00
Eric Seppanen	7d104e5660	update dependencies Running 'cargo update' happens to synchronize a few transitive dependencies, allowing us to build slightly fewer crates.	2021-05-02 16:01:18 -07:00
Eric Seppanen	49530145d8	cargo fmt	2021-05-02 11:03:58 -07:00
Arseny Sher	da96965897	Remove assert(is_ok) before unwrap. It only hides the error.	2021-05-02 17:19:09 +03:00
Stas Kelvich	3762b53986	show branch name in "zenith pg list"	2021-05-01 03:32:48 +03:00
Konstantin Knizhnik	9ad99152b8	Merge pull request #84 from zenithdb/embedded_wal_proposer Enable wal proposer test	2021-04-30 19:50:27 +03:00
Konstantin Knizhnik	651a8139f5	Fix bug in transaction_id_set_status_bit	2021-04-30 19:24:00 +03:00
Konstantin Knizhnik	f82c3eb5e2	Enable wal proposer test	2021-04-30 15:18:32 +03:00

1 2 3 4 5 ...

323 Commits