rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-05-20 06:30:43 +00:00

Author	SHA1	Message	Date
Konstantin Knizhnik	14168c7aa7	Increase downtime timeout to avoid address already in use error and fix checking for elapsed time	2021-04-28 17:24:31 +03:00
anastasia	7a8501d12f	[issue #73 ] fix race in test_acceptors_unavailability test	2021-04-28 17:24:31 +03:00
anastasia	34d55b09a3	[issue #73 ] fix wal_acceptor merge problem caused by `3fea78d6`	2021-04-28 17:24:31 +03:00
Heikki Linnakangas	41a3772e90	Replace pgbuild.sh with a Makefile This allows building both Zenith and PostgreSQL in one command. The command is 'make' Reviewed-by: Arseny Sher <sher-ars@yandex.ru>	2021-04-28 16:54:45 +03:00
Konstantin Knizhnik	bbec5a13bd	Extract appname from startup package	2021-04-28 15:26:08 +03:00
anastasia	421d586953	code cleanup for XLogRecord decoding	2021-04-28 13:56:27 +03:00
anastasia	ef37eb96b9	refactor XLogRecord reading	2021-04-28 13:56:27 +03:00
anastasia	d311f708b6	handle subtrans in COMMIT/ABORT records	2021-04-28 13:56:27 +03:00
Heikki Linnakangas	c7f54af1f1	Refactor page_cache <-> walredo interface. Make the caller of request_redo() responsible for gathering the WAL records to redo, and for storing the reconstructed page image back in the page cache. This leaves the WAL redo manager purely responsible for dealing with the postgres child process, removing its dependency on the PageCache.	2021-04-27 21:43:56 +03:00
Heikki Linnakangas	44a85d9176	Put back 'pgbuild.sh', which was removed accidentally. Oops, I deleted it accidentally in commit `96beffb3c5`. Put it back.	2021-04-27 15:33:38 +03:00
Heikki Linnakangas	96beffb3c5	Add tests for the `Lsn::fetch_max` function.	2021-04-27 13:43:39 +03:00
Heikki Linnakangas	cff671c1bd	Remove duplicated LSN fields from the page cache. Having multiple copies of the same values is a source of confusion. Commit `da9bf5dc63` fixed one race condition caused by that, for example. See also discussion at https://github.com/zenithdb/zenith/issues/57#issuecomment-824393470 This changes SeqWait.advance() to return the old number, and not panic if you try to move the value backwards. The caller should check for that and act accordingly.	2021-04-27 10:32:39 +03:00
Eric Seppanen	4acdcbe90f	clippy cleanup #3 Fix issues raised by clippy. Mostly trivial ones, though some allow 4-5 lines of code to be reduced to 1.	2021-04-26 12:35:35 -07:00
Eric Seppanen	fdf6829de5	cargo fmt	2021-04-26 09:36:22 -07:00
anastasia	b361558a8a	fix typo in transaction replay code	2021-04-26 18:35:26 +03:00
Konstantin Knizhnik	c59830fd01	Do not restart wal-redo-postgres	2021-04-26 17:57:29 +03:00
Konstantin Knizhnik	636194406f	Dump log files in case of regress_tests failure	2021-04-26 17:04:26 +03:00
Konstantin Knizhnik	3b09a74f58	Implement offloading of old WAL files to S3 in walkeeper	2021-04-26 16:23:00 +03:00
Heikki Linnakangas	f617115467	Remove obsolete comment on async usage in the page cache	2021-04-26 14:12:57 +03:00
Heikki Linnakangas	4f529b7d4a	Remove unused function.	2021-04-26 13:54:06 +03:00
Heikki Linnakangas	bc652e965e	Save old 'async' version of SeqWait, in case we need it later. It is currently unused, and is not built as part of 'cargo build', but seems like a shame to throw it away completely.	2021-04-26 13:30:10 +03:00
Heikki Linnakangas	3b9e7fc5e6	Use explicit threads. Remove 'async' usage a much as feasible. Async code is harder to debug, and mixing async and non-async code is a recipe for confusion and bugs. There are a couple of exceptions: - The code in walredo.rs, which needs to read and write to the child process simultaneously, still uses async. It's more convenient there. The 'async' usage is carefully limited to just the functions that communicate with the child process. - Code in walreceiver.rs that uses tokio-postgres to do streaming replication. We have to use async there, because tokio-postgres is async. Most rust-postgres functionality has non-async wrappers, but not the new replication client code. The async usage is very limited here, too: we use just block_on to call the tokio-postgres functions. The code in 'page_service.rs' now launches a dedicated thread for each connection. This replaces tokio::sync:⌚:channel with std::sync:mpsc in 'seqwait.rs', to make that non-async. It's not a drop-in replacement, though: std::sync::mpsc doesn't support multiple consumers, so we cannot share a channel between multiple waiters. So this removes the code to check if an existing channel can be reused, and creates a new one for each waiter. That created another problem: BTreeMap cannot hold duplicates, so I replaced that with BinaryHeap. Similarly, the tokio::{mpsc, oneshot} channels used between WAL redo manager and PageCache are replaced with std::sync::mpsc. (There is no separate 'oneshot' channel in the standard library.) Fixes github issue #58, and coincidentally also issue #66.	2021-04-26 13:07:51 +03:00
Konstantin Knizhnik	5292b502f3	Check regression test exit status	2021-04-26 11:06:31 +03:00
Konstantin Knizhnik	abcecc992e	[refer #67 ] Replace File.write with File.write_all	2021-04-26 09:30:03 +03:00
Eric Seppanen	96b6f350a7	add test cases for Lsn math and AtomicLsn	2021-04-25 19:37:02 -07:00
Eric Seppanen	648755a25e	add Lsn::block_offset, remaining_in_block, calc_padding Replace open-coded math with member fns.	2021-04-25 19:37:02 -07:00
Eric Seppanen	1c775bdcac	Drop LSNs from PageCacheStats There's no clear way to sum LSNs across timelines, so just remove them for now.	2021-04-25 19:37:02 -07:00
Eric Seppanen	07d0241076	add AtomicLsn AtomicLsn is a wrapper around AtomicU64 that has load() and store() members that are cheap (on x86, anyway) and can be safely used in any context. This commit uses AtomicLsn in the page cache, and fixes up some downstream code that manually implemented LSN formatting. There's also a bugfix to the logging in wait_lsn, which prints the wrong lsn value.	2021-04-25 19:37:02 -07:00
Eric Seppanen	d760446053	remove Lsn::sub in favor of sub_checked There is only one place doing subtraction, and it had a manually implemented check.	2021-04-25 19:37:02 -07:00
Eric Seppanen	01e239afa3	apply Lsn type everywhere Use the `Lsn` type everywhere that I can find u64 being used to represent an LSN.	2021-04-25 19:37:02 -07:00
Eric Seppanen	f62ce4bcf7	make seqwait generic SeqWait can use any type that is Ord + Debug + Copy. Debug is not strictly necessary, but allows us to keep the panic message if a caller wants the sequence number to go backwards.	2021-04-25 19:37:02 -07:00
Eric Seppanen	3d3eb0ed16	add Lsn type This type is a zero-cost wrapper for a u64, meant to help code communicate with precision what that value means. It implements Display and Debug. Display "{}" will format as "1234ABCD:5678CDEF" while Debug will format as Lsn{1234567890}.	2021-04-25 19:37:02 -07:00
Konstantin Knizhnik	da9bf5dc63	Store atomic last_valid_lsn after seqwait_lsn.advance	2021-04-25 14:11:31 +03:00
Eric Seppanen	1cb9b5523b	cargo fmt	2021-04-24 16:03:44 -07:00
Konstantin Knizhnik	968cd8f20c	Do not delete versions in GC	2021-04-24 23:52:50 +03:00
Konstantin Knizhnik	3e007b0eb9	Do not delete versions in GC	2021-04-24 22:32:22 +03:00
Heikki Linnakangas	5e0cc89de8	Re-group functions in page_cache.rs, and add comments.	2021-04-24 17:54:31 +03:00
Heikki Linnakangas	0fc05569e0	Improve comments in page_cache.rs. Explain the mix of async and other functions in the page cache.	2021-04-24 17:54:28 +03:00
Heikki Linnakangas	021462da3e	Refactor put_wal_record() so that it doesn't need to be marked 'async'. It was only marked as async because it calls relsize_get(), but relsize_get() will in fact never block when it's called with the max LSN value, like put_wal_record() does. Refactor to avoid marking put_wal_record() as 'async'.	2021-04-24 17:54:26 +03:00
Heikki Linnakangas	93d7d2ae2a	Refactor pagecache <-> Wal redo communication After the rocksdb patch (commit `6aa38d3f7d`), the CacheEntry struct was used only momentarily in the communication between the page_cache and the walredo modules. It was in fact not stored in any cache anymore. For clarity, refactor the communication. There is now a WalRedoManager struct, with `request_redo` function, that can be used to request WAL replay of a particular page. It sends a request to a queue like before, but the queue has been replaced with tokio::sync::mpsc. Previously, the resulting page image was stored directly in the CacheEntry, and the requestor was notified using a condition variable. Now, the requestor includes a 'oneshot' channel in the request, and the WAL redo manager sends the response there.	2021-04-24 12:24:04 +03:00
Eric Seppanen	fe79082e29	require documentation in seqwait.rs	2021-04-23 15:01:22 -07:00
Eric Seppanen	6dfe196c40	add .zenith to .gitignore	2021-04-23 14:19:24 -07:00
Eric Seppanen	8beaf76c85	SeqWait: don't do wakeups under the lock Clippy pointed out that `drop(waiters)` didn't do anything, because there was a misplaced ";" causing `waiters` to be a unit type `()`. This change makes it do what was intended: the lock should be dropped first, then the wakeups should be processed.	2021-04-23 14:16:34 -07:00
Konstantin Knizhnik	499b4f7eba	Log garbage collection statistics	2021-04-23 18:02:58 +03:00
Konstantin Knizhnik	52ee3a2bac	Support CREATE DATABASE command	2021-04-23 17:03:56 +03:00
anastasia	b64bd2a8af	handle XLOG_DBASE_CREATE in waldecoder	2021-04-23 14:06:09 +03:00
anastasia	573f1ada83	[issue #56 ] Fix race at postgres instance + walreceiver start. Uses postgres/vendor issue_56_rebased branch.	2021-04-23 13:35:30 +03:00
Konstantin Knizhnik	904ccbdb70	Merge pull request #62 from zenithdb/dump_log_files Wait WAL receiver to start	2021-04-23 12:45:59 +03:00
Konstantin Knizhnik	59b23fef64	Wait for WAL receiver to start	2021-04-23 12:40:29 +03:00
Konstantin Knizhnik	0eaff5aa7f	Fix pageserver.log path	2021-04-23 11:37:28 +03:00

1 2 3 4 5 ...

257 Commits