rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2025-12-27 16:12:56 +00:00

Author	SHA1	Message	Date
Konstantin Knizhnik	f817985f2b	Do not throw error on attempt to drop unexisted relation	2021-09-23 12:11:06 +03:00
Konstantin Knizhnik	2e94e1428e	Remove import_timeline_wal function	2021-09-23 12:11:06 +03:00
Konstantin Knizhnik	9a486ca109	Do not write WAL at pageserver	2021-09-23 12:11:06 +03:00
Max Sharnoff	90ef661673	Fix rustc & clippy warnings for nightly (2021-09-19) (#629 ) Fix clippy warnings for nightly (2021-09-19)	2021-09-22 11:24:43 -07:00
Dmitry Rodionov	579b5ee944	exclude labels formatting for every operation in LOGICAL_TIMELINE_SIZE gauge metric	2021-09-22 18:03:48 +03:00
Heikki Linnakangas	a91eeb1c65	Buffer the writes when writing a layer to disk. Significantly reduces the CPU time spent on libc::write.	2021-09-21 16:54:29 +03:00
Patrick Insinger	ea4c3639e3	Include layer metadata in layer summary chapters Include all data stored in layer filenames and the tenant+timeline IDs inside a summary chapter. Use this chapter in the `dump_layerfile` utility.	2021-09-20 07:57:51 -07:00
Heikki Linnakangas	745627c8ca	Remove unused FE/BE ControlFile message. It's a remnant of some old tests in Zenith, but isn't used anymore. It doesn't exist in PostgreSQL.	2021-09-17 20:06:04 +03:00
Heikki Linnakangas	540973eac4	Don't get confused on request of latest page version with very old LSN. If the 'latest' flag in the client request is true, the client wants the latest page version regardless of the LSN in the request. The LSN is just a hint in that case, indicating that the page hasn't been modified since since that LSN. The LSN can be very old, so it's possible that the page server has already garbage collected away the layer at that LSN. We tried to fetch the old layer and errored out if that happened. To fix, always fetch the data as of last-record-LSN, if 'latest' is set in the client request. We now only use the LSN to wait if the requested LSN hasn't been received and processed yet. Fixes https://github.com/zenithdb/zenith/issues/567	2021-09-17 18:56:05 +03:00
Heikki Linnakangas	ad5f16f724	Improve the protocol between Postgres and page server. - Use different message formats for different kinds of response messages. - Add an Error message, for passing errors from page server to Postgres. Previously, we would respond to 'exists' request with 'false', and to 'nblocks' request with 0, if an error happened. Fix those to return an error message to the client. GetPage requests had a mechanism to return an error, but it was just a flag with no error message. - Add a flag to requests, to indicate that we actually want the latest page version on the timeline, and the LSN is just a hint that we know that there haven't been any modifications since that LSN. The flag isn't used for anything yet, but I'm planning to use it to fix https://github.com/zenithdb/zenith/issues/567	2021-09-17 16:38:14 +03:00
Kirill Bulatov	1d5abf1253	Initial version of the relish storage	2021-09-17 15:30:22 +03:00
Max Sharnoff	3743344e64	Add `get_timeline_for_tenant()` to `tenant_mgr` (#615 ) Most of the previous usages of get_repository_for_tenant were followed by immediately getting a timeline in that repository, without keeping it around for longer. The new `get_timeline_for_tenant` function implements that same behavior, but in one line.	2021-09-16 10:38:21 -07:00
Kirill Bulatov	7dda9f2894	Fix clippy lints and enable clippy checking in CI	2021-09-16 15:09:16 +03:00
anastasia	8de41f1d70	Change checkpoint_distance type to u64	2021-09-16 12:33:50 +03:00
anastasia	6984d33b4e	Run GC and checkpointer separate threads. Add checkpoint_period configuration parameter	2021-09-16 12:33:50 +03:00
anastasia	98d4f9cea5	Add checkpoint_distance config parameter. - Change hardcoded OLDEST_INMEM_DISTANCE value to pageserver config option checkpoint_distance. - Get rid of 'force' flag in checkpoint_internal(). Use checkpoint_distance=0 instead.	2021-09-16 12:33:50 +03:00
Patrick Insinger	25b7d424ab	Prevent frozen InMemoryLayer races Instead of panicking when a race happens, retry the operation after getting a new layer.	2021-09-15 20:50:51 -07:00
Patrick Insinger	a5bd306db9	Ensure InMemoryLayer predecessor updated correctly When the new open InMemoryLayer predecessor is updated, ensure it was pointing to the old frozen layer.	2021-09-15 16:04:49 -07:00
Patrick Insinger	0cbee4a416	Don't hold lock on LayerMap while writing to disk	2021-09-15 16:04:49 -07:00
Patrick Insinger	91ff09151d	Remove disk IO from `InMemoryLayer::freeze` Move the creation of Image and Delta layers from `InMemoryLayer::freeze()` to `InMemoryLayer::write_to_disk`.	2021-09-15 16:04:49 -07:00
Patrick Insinger	fea5954b18	Change filling gap println! to trace!	2021-09-15 14:22:04 -07:00
Max Sharnoff	b11b0bb088	bin_ser: reject trailing bytes by default (#587 ) Changes `LeSer`/`BeSer::des`. Also adds a new `des_prefix` function to keep a way to allow trailing bytes.	2021-09-15 11:48:19 -07:00
Kirill Bulatov	3ab60ce76f	Unify tokio deps and bump cargo resolver version	2021-09-15 16:00:08 +03:00
Dmitry Rodionov	4ebe643d0c	Support parallel test running for python tests Support is done via pytest-xdist plugin. To use the feature add -n<concurrency> to pytest invocation e.g. pytest -n8 to run 8 tests in parallel. Changes in code are mostly about ports assigning. Previously port for pageserver was hardcoded without the ability to override through zenith cli and ports for started compute nodes were calculated twice, in zenith cli and in test code. Now zenith cli supports port arguments for pageserver and compute nodes to be passed explicitly. Tests are modified in such a way that each worker gets a non overlapping port range which can be configured and now contains 100 ports. These ports are distributed to test services (pageserver, wal acceptors, compute nodes) so they can work independently.	2021-09-15 14:02:15 +03:00
Patrick Insinger	d150f3ce8c	Detect writes on frozen InMemoryLayers Data written to frozen layers is lost. It will not appear in on-disk structures or in successor InMemoryLayers. Here we detect this race, and fail. I think this race is rare, but this should make it easier to track down when it happens.	2021-09-14 11:44:48 -07:00
Patrick Insinger	cff4572774	Avoid race in `get_layer_for_write` Implement the changes suggested in a comment, create `get_layer_for_read_locked` so that `get_layer_for_write` doesn't have to drop the LayerMap lock when searching for the predecessor.	2021-09-14 11:24:24 -07:00
Dmitry Rodionov	84008a2560	factor out common logging initialisation routine This contains a lowest common denominator of pageserver and safekeeper log initialisation routines. It uses daemonize flag to decide where to stream log messages. In case daemonize is true log messages are forwarded to file. Otherwise streaming to stdout is used. Usage of stdout for log output is the default in docker side of things, so make it easier to browse our logs via builtin docker commands.	2021-09-14 18:09:14 +03:00
Heikki Linnakangas	6afd99c73f	Fix misc typos in comments.	2021-09-13 12:31:04 +03:00
Arseny Sher	0aec60938a	Make flush_lsn reported by safekeepers point to record boundary. Otherwise we produce corrupted record holes in WAL during compute node restart in case there was an unfinished record from the old compute, as these reports advance commit_lsn -- reliably persisted part of WAL. ref #549. Mostly by @knizhnik. I adjusted to make sure proposer always starts streaming since record beginning so we don't need special quirks for decoding in safekeeper.	2021-09-11 06:10:10 +03:00
Patrick Insinger	7c62a57e54	initialize tenant_mgr after daemonizing Ran into problems launching the WAL redo process on OS X after 4b73ad. Launching the `initdb` process was met with "bad file descriptor" errors. Using dtrace, I found shortly after calling `posix_spawn` for `initdb`, `kevent` was returning this error. I haven't dug super deep to see if the daemonization itself is the problem, but this commit fixes it for me. My hunch is that some file descriptors used when the Tokio runtime is initailzed become invalid in the daemon process.	2021-09-10 13:00:39 +03:00
Heikki Linnakangas	59e7ca585d	Minor fixes	2021-09-10 12:43:11 +03:00
anastasia	3dea06b825	Update layered_repository/README.md	2021-09-10 12:43:11 +03:00
Heikki Linnakangas	ab33614ab1	Forbid adding WAL to the repository after advancing last record LSN. When you advance last record LSN, all changes up to that LSN should be imported into repository. We have been a bit sloppy about that when it comes to the checkpoint information that we also store in the repository. In WAL receiver, for example, we would receive a WAL record, advance last record LSN, and only then update the checkpoint relish at the same LSN. Reorder that so that you advance the last record LSN only after updating the checkpoint relish. It hasn't apparently caused any problems so far, but let's be tidy. Tighten the check for that in get_layer_for_write(), so that it checks for 'lsn > last_record_lsn' rather than 'lsn >= last_record_lsn'.	2021-09-10 10:59:09 +03:00
Heikki Linnakangas	03dff207db	Remove start_lsn arg from `create_empty_repository`. Always use lsn(0) as the initial last_record_lsn. It is updated soon after creating the timeline anyway, after loading the bootstrap data, so it doesn't stay long in that state. I was a bit worried about using a special value like 0, but it's actually nice that you can distinguish it from any real LSN value. The unit tests have been using Lsn(0) as the initial start LSN all along.	2021-09-10 10:24:35 +03:00
Heikki Linnakangas	6a8785379a	Add explicit 'wait_lsn' calls before get_page_at_lsn and such calls. Move the responsibility to wait for the WAL to arrive to the callers, and remove the wait_lsn() calls from the Timeline::get_page_at_lsn() and friends. We were not totally consistent before, list_rels() was missing the wait_lsn() call for example. Closes https://github.com/zenithdb/zenith/issues/521	2021-09-10 09:56:11 +03:00
Heikki Linnakangas	507177b42e	Refactor code to handle incoming page requests.	2021-09-09 18:48:46 +03:00
anastasia	b79754d06e	list_rels() and list_nonrels() refactoring: move shared code to list_relishes() function.	2021-09-09 16:05:32 +03:00
anastasia	674807eee1	Add test for dropped reltaions. Fix list_rels() and list_nonrels() functions	2021-09-09 16:05:32 +03:00
Konstantin Knizhnik	30c0343727	Use layer start_lsn instead of *entry_lsn as LSN to continue WAL record traversal at next layer (#573 ) refer #532	2021-09-09 15:15:50 +03:00
Dmitry Rodionov	4b73ada26e	fix connection error appeared on zenith start by binding sockets before daemonization also use less annoying error reporting by not printing full error messages for connect errors in first several connection retries closes #507	2021-09-07 20:50:27 +03:00
Dmitry Rodionov	b4ecae33e4	add incremental tracking of logical timeline size In order to exclude problems with synchronizing disk and memory logical size is not stored in metadata on disk. It is calculated on timeline "start" by scanning the contents of layered repo and then size is maintained via an atomic variable. This patch also adds new endpoint to pageserver http api: branch detail. It allows retrieval of a particular branch info by its name. Size info is also added to the response of the endpoint and used in tests.	2021-09-07 18:25:15 +03:00
Patrick Insinger	1b9e49eb60	pageserver - update `unload()` comment Update comment to reflect changes made in 5ac4a2 and 98f496	2021-09-07 08:19:42 -07:00
Heikki Linnakangas	7a03e32dd5	Use Rust shorthand range syntax	2021-09-07 18:10:07 +03:00
Heikki Linnakangas	018a606987	Refactor code in LayerMap, for readability - Reorder the structs and functions - Delegate many of the operations in LayerMap to SegEntry. For example, `LayerMap::insert_open` now looks up the right SegEntry struct, and then calls `SegEntry::insert_open` on it. - Use HashMap::entry() function with or_default() to implement the lookups with less code	2021-09-07 18:10:07 +03:00
Heikki Linnakangas	26782851a9	Rename OpenSegEntry to OpenLayerEntry That's more appropriate: it's a struct that holds a Layer, not a segment.	2021-09-07 18:10:07 +03:00
Heikki Linnakangas	04ee1d5977	Add test for managing old open segments in binary heap. I thought this test would trigger the bug fixed previous commit, but it did not. More tests are nice in any case.	2021-09-07 18:10:07 +03:00
Heikki Linnakangas	6245702c7c	Comment fixes	2021-09-07 18:10:07 +03:00
Heikki Linnakangas	9098f2159d	Fix comparison routines of OpenSegEntry Commit `66929ad6fb` added a 'generation' number to open segments stored in the layer map, to distinguish old layers from layers that were added to the map during checkpoint processing. But it neglected the OpenSegEntry::cmp() function. It seems that the cmp() function is never used by BinaryHeap, so this didn't cause any user-visible bugs (I tried adding a panic() to the cmp() function and it didn't fire). But it's clearly wrong and we need to fix it, anyway.	2021-09-07 18:10:07 +03:00
Konstantin Knizhnik	a3214e982d	Transaction commit redo handler should set TRANSACTION_STATUS_COMMITTED status for subtransactions, not TRANSACTION_STATUS_SUB_COMMITTED Closes #535	2021-09-06 18:21:23 +03:00
Arseny Sher	d1f0b1eda4	Adapt safekeepers to --sync-safekeepers walproposer mode. 1) Do epoch switch without record from new epoch, immediately after recovery -- --sync-safekeepers mode doesn't generate new records. 2) Fix commit_lsn advancement by taking into account wal we have locally -- setting it further is incorrect. 3) Report it back to walproposer so he knows when sync is done. 4) Remove system id check as it is unknown in sync mode. And make logging slightly better. ref #439	2021-09-06 13:06:20 +03:00

1 2 3 4 5 ...

409 Commits