rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-01-06 21:12:55 +00:00

Author	SHA1	Message	Date
Patrick Insinger	cff4572774	Avoid race in `get_layer_for_write` Implement the changes suggested in a comment, create `get_layer_for_read_locked` so that `get_layer_for_write` doesn't have to drop the LayerMap lock when searching for the predecessor.	2021-09-14 11:24:24 -07:00
Dmitry Rodionov	84008a2560	factor out common logging initialisation routine This contains a lowest common denominator of pageserver and safekeeper log initialisation routines. It uses daemonize flag to decide where to stream log messages. In case daemonize is true log messages are forwarded to file. Otherwise streaming to stdout is used. Usage of stdout for log output is the default in docker side of things, so make it easier to browse our logs via builtin docker commands.	2021-09-14 18:09:14 +03:00
Dmitry Ivanov	6b7f3bc78c	Add inter-repo CI job to CircleCI configuration This job will be responsible for triggering remote CI pipeline in zenithdb/console repository. That way, we'll always know when a PR to zenithdb/zenith breaks the cloud console app.	2021-09-14 16:56:04 +03:00
Arseny Sher	a68c23448a	Skip the bootstrap hole in safekeeper's find_end_of_wal. Otherwise restart of safekeeper before the first segment is filled makes it report 0 as flushed LSN. To this end, tweak find_end_of_wal_segment to allow starting from given LSN, not only from the start of the segment. While here, make it less panicky.	2021-09-13 22:46:04 +03:00
Dmitry Rodionov	9043f45489	removes protobuf dependency (brought by prometheus default features)	2021-09-13 15:57:41 +03:00
Heikki Linnakangas	6afd99c73f	Fix misc typos in comments.	2021-09-13 12:31:04 +03:00
nkotlyarov	18b5165b22	Update README.md typo	2021-09-12 15:35:18 +03:00
Arseny Sher	6dc66eefb6	bump vendor/postgres	2021-09-11 06:10:10 +03:00
Arseny Sher	0aec60938a	Make flush_lsn reported by safekeepers point to record boundary. Otherwise we produce corrupted record holes in WAL during compute node restart in case there was an unfinished record from the old compute, as these reports advance commit_lsn -- reliably persisted part of WAL. ref #549. Mostly by @knizhnik. I adjusted to make sure proposer always starts streaming since record beginning so we don't need special quirks for decoding in safekeeper.	2021-09-11 06:10:10 +03:00
Patrick Insinger	7c62a57e54	initialize tenant_mgr after daemonizing Ran into problems launching the WAL redo process on OS X after 4b73ad. Launching the `initdb` process was met with "bad file descriptor" errors. Using dtrace, I found shortly after calling `posix_spawn` for `initdb`, `kevent` was returning this error. I haven't dug super deep to see if the daemonization itself is the problem, but this commit fixes it for me. My hunch is that some file descriptors used when the Tokio runtime is initailzed become invalid in the daemon process.	2021-09-10 13:00:39 +03:00
Heikki Linnakangas	59e7ca585d	Minor fixes	2021-09-10 12:43:11 +03:00
anastasia	3dea06b825	Update layered_repository/README.md	2021-09-10 12:43:11 +03:00
Heikki Linnakangas	ab33614ab1	Forbid adding WAL to the repository after advancing last record LSN. When you advance last record LSN, all changes up to that LSN should be imported into repository. We have been a bit sloppy about that when it comes to the checkpoint information that we also store in the repository. In WAL receiver, for example, we would receive a WAL record, advance last record LSN, and only then update the checkpoint relish at the same LSN. Reorder that so that you advance the last record LSN only after updating the checkpoint relish. It hasn't apparently caused any problems so far, but let's be tidy. Tighten the check for that in get_layer_for_write(), so that it checks for 'lsn > last_record_lsn' rather than 'lsn >= last_record_lsn'.	2021-09-10 10:59:09 +03:00
Heikki Linnakangas	03dff207db	Remove start_lsn arg from `create_empty_repository`. Always use lsn(0) as the initial last_record_lsn. It is updated soon after creating the timeline anyway, after loading the bootstrap data, so it doesn't stay long in that state. I was a bit worried about using a special value like 0, but it's actually nice that you can distinguish it from any real LSN value. The unit tests have been using Lsn(0) as the initial start LSN all along.	2021-09-10 10:24:35 +03:00
Heikki Linnakangas	6a8785379a	Add explicit 'wait_lsn' calls before get_page_at_lsn and such calls. Move the responsibility to wait for the WAL to arrive to the callers, and remove the wait_lsn() calls from the Timeline::get_page_at_lsn() and friends. We were not totally consistent before, list_rels() was missing the wait_lsn() call for example. Closes https://github.com/zenithdb/zenith/issues/521	2021-09-10 09:56:11 +03:00
Heikki Linnakangas	507177b42e	Refactor code to handle incoming page requests.	2021-09-09 18:48:46 +03:00
anastasia	b79754d06e	list_rels() and list_nonrels() refactoring: move shared code to list_relishes() function.	2021-09-09 16:05:32 +03:00
anastasia	674807eee1	Add test for dropped reltaions. Fix list_rels() and list_nonrels() functions	2021-09-09 16:05:32 +03:00
Konstantin Knizhnik	30c0343727	Use layer start_lsn instead of *entry_lsn as LSN to continue WAL record traversal at next layer (#573 ) refer #532	2021-09-09 15:15:50 +03:00
Dmitry Rodionov	4fae115dc2	propagate pageserver http error messages to zenith cli	2021-09-08 17:32:59 +03:00
anastasia	3d17255400	Add comment to 'pg stop' changes	2021-09-08 14:12:00 +03:00
anastasia	5488ce8834	Change CLI command 'pg stop' to avoid races in tests. Stop postgres immediately only when destroy option is used. Otherwise, use default shutdown mode (fast).	2021-09-08 14:12:00 +03:00
Max Sharnoff	d7313bb85c	Switch tokio-postgres dependency to git repo The other crates in this repository use zenithdb/rust-postgres as a dependency for the related items, instead of the crates.io versions. Switching to using that for the proxy as well removes an additional three dependencies when we compile. (319 -> 316)	2021-09-07 19:49:03 -07:00
Dmitry Rodionov	4b73ada26e	fix connection error appeared on zenith start by binding sockets before daemonization also use less annoying error reporting by not printing full error messages for connect errors in first several connection retries closes #507	2021-09-07 20:50:27 +03:00
Dmitry Rodionov	b4ecae33e4	add incremental tracking of logical timeline size In order to exclude problems with synchronizing disk and memory logical size is not stored in metadata on disk. It is calculated on timeline "start" by scanning the contents of layered repo and then size is maintained via an atomic variable. This patch also adds new endpoint to pageserver http api: branch detail. It allows retrieval of a particular branch info by its name. Size info is also added to the response of the endpoint and used in tests.	2021-09-07 18:25:15 +03:00
Patrick Insinger	1b9e49eb60	pageserver - update `unload()` comment Update comment to reflect changes made in 5ac4a2 and 98f496	2021-09-07 08:19:42 -07:00
Heikki Linnakangas	7a03e32dd5	Use Rust shorthand range syntax	2021-09-07 18:10:07 +03:00
Heikki Linnakangas	018a606987	Refactor code in LayerMap, for readability - Reorder the structs and functions - Delegate many of the operations in LayerMap to SegEntry. For example, `LayerMap::insert_open` now looks up the right SegEntry struct, and then calls `SegEntry::insert_open` on it. - Use HashMap::entry() function with or_default() to implement the lookups with less code	2021-09-07 18:10:07 +03:00
Heikki Linnakangas	26782851a9	Rename OpenSegEntry to OpenLayerEntry That's more appropriate: it's a struct that holds a Layer, not a segment.	2021-09-07 18:10:07 +03:00
Heikki Linnakangas	04ee1d5977	Add test for managing old open segments in binary heap. I thought this test would trigger the bug fixed previous commit, but it did not. More tests are nice in any case.	2021-09-07 18:10:07 +03:00
Heikki Linnakangas	6245702c7c	Comment fixes	2021-09-07 18:10:07 +03:00
Heikki Linnakangas	9098f2159d	Fix comparison routines of OpenSegEntry Commit `66929ad6fb` added a 'generation' number to open segments stored in the layer map, to distinguish old layers from layers that were added to the map during checkpoint processing. But it neglected the OpenSegEntry::cmp() function. It seems that the cmp() function is never used by BinaryHeap, so this didn't cause any user-visible bugs (I tried adding a panic() to the cmp() function and it didn't fire). But it's clearly wrong and we need to fix it, anyway.	2021-09-07 18:10:07 +03:00
Kirill Bulatov	292bdaa6a7	Update documentation to note some Postgres specifics	2021-09-07 17:48:41 +03:00
anastasia	6f0c065743	preserve filediff artifacts in CI	2021-09-07 16:58:21 +03:00
anastasia	94c50e3e90	Fix check_restored_datadir_content(). Call 'basebackup' command directly, instead of relying on CLI	2021-09-07 16:58:21 +03:00
Konstantin Knizhnik	f83108002b	Revert "Bump postgres version" This reverts commit `511873aaed`.	2021-09-07 15:06:43 +03:00
Konstantin Knizhnik	511873aaed	Bump postgres version	2021-09-07 15:05:08 +03:00
anastasia	eb3fd7a8da	print diff for mismatching files in check_restored_datadir_content()	2021-09-06 18:21:23 +03:00
Konstantin Knizhnik	a3214e982d	Transaction commit redo handler should set TRANSACTION_STATUS_COMMITTED status for subtransactions, not TRANSACTION_STATUS_SUB_COMMITTED Closes #535	2021-09-06 18:21:23 +03:00
anastasia	1e172230ce	Add test funciton to compare files in compute nodes to catch bugs in SLRU replay. Compare files in existing compute node's pgdata with fresh basebackup at the same lsn. We expect that content is identical, except tmp files Use it after some tests.	2021-09-06 18:21:23 +03:00
Arseny Sher	51d36b9930	bump vendor/postgres	2021-09-06 13:06:20 +03:00
Arseny Sher	d1f0b1eda4	Adapt safekeepers to --sync-safekeepers walproposer mode. 1) Do epoch switch without record from new epoch, immediately after recovery -- --sync-safekeepers mode doesn't generate new records. 2) Fix commit_lsn advancement by taking into account wal we have locally -- setting it further is incorrect. 3) Report it back to walproposer so he knows when sync is done. 4) Remove system id check as it is unknown in sync mode. And make logging slightly better. ref #439	2021-09-06 13:06:20 +03:00
Stas Kelvich	ed4eed0a19	Make use of `postgres --sync-safekeepers` in tests and CLI. Change control plane code to call `postgres --sync-safekeepers` before compute node start when safekeepers are enabled. Now `pg create` will create an empty data directory with the proper config file. Subsequent `pg start` will run `sync-safekeepers` and will call basebackup with the resulting LSN. Also change few tests to accommodate this new behavior.	2021-09-06 13:06:20 +03:00
Konstantin Knizhnik	2cf3a70be5	Add description of Zenith changes in Postgres core (#533 ) * Add description of Zenith changes in Postgres core * Update README.md	2021-09-03 19:48:26 +03:00
Kirill Bulatov	6d42ea47bf	Check rusage return code	2021-09-03 17:29:23 +03:00
Konstantin Knizhnik	b227c63edf	Set proper xl_prev in basebackup, when possible. In a passing fix two minor issues with basabackup: * check that we can't create branches with pre-initdb LSN's * normalize branch LSN's that are pointing to the segment boundary patch by @knizhnik closes #506	2021-09-03 14:58:59 +03:00
anastasia	45c09c1cdd	Add LayerMap.dump() funciton for debugging. Print timelineid in layer dumps	2021-09-03 11:00:38 +03:00
anastasia	66dcaa4e01	Rename put_unlink() to drop_relish() in Timeline trait. Rename put_unlink() to drop_segment() in Layer trait.	2021-09-03 11:00:38 +03:00
anastasia	a7de53d4c4	Improve comments for Layer trait.	2021-09-03 11:00:38 +03:00
anastasia	fabf5ec664	Don't use term 'snapshot' to describe layers	2021-09-03 11:00:38 +03:00

1 2 3 4 5 ...

909 Commits