rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-01-08 14:02:55 +00:00

Author	SHA1	Message	Date
Kirill Bulatov	3ab60ce76f	Unify tokio deps and bump cargo resolver version	2021-09-15 16:00:08 +03:00
Dmitry Rodionov	4ebe643d0c	Support parallel test running for python tests Support is done via pytest-xdist plugin. To use the feature add -n<concurrency> to pytest invocation e.g. pytest -n8 to run 8 tests in parallel. Changes in code are mostly about ports assigning. Previously port for pageserver was hardcoded without the ability to override through zenith cli and ports for started compute nodes were calculated twice, in zenith cli and in test code. Now zenith cli supports port arguments for pageserver and compute nodes to be passed explicitly. Tests are modified in such a way that each worker gets a non overlapping port range which can be configured and now contains 100 ports. These ports are distributed to test services (pageserver, wal acceptors, compute nodes) so they can work independently.	2021-09-15 14:02:15 +03:00
Patrick Insinger	d150f3ce8c	Detect writes on frozen InMemoryLayers Data written to frozen layers is lost. It will not appear in on-disk structures or in successor InMemoryLayers. Here we detect this race, and fail. I think this race is rare, but this should make it easier to track down when it happens.	2021-09-14 11:44:48 -07:00
Patrick Insinger	cff4572774	Avoid race in `get_layer_for_write` Implement the changes suggested in a comment, create `get_layer_for_read_locked` so that `get_layer_for_write` doesn't have to drop the LayerMap lock when searching for the predecessor.	2021-09-14 11:24:24 -07:00
Dmitry Rodionov	84008a2560	factor out common logging initialisation routine This contains a lowest common denominator of pageserver and safekeeper log initialisation routines. It uses daemonize flag to decide where to stream log messages. In case daemonize is true log messages are forwarded to file. Otherwise streaming to stdout is used. Usage of stdout for log output is the default in docker side of things, so make it easier to browse our logs via builtin docker commands.	2021-09-14 18:09:14 +03:00
Heikki Linnakangas	6afd99c73f	Fix misc typos in comments.	2021-09-13 12:31:04 +03:00
Arseny Sher	0aec60938a	Make flush_lsn reported by safekeepers point to record boundary. Otherwise we produce corrupted record holes in WAL during compute node restart in case there was an unfinished record from the old compute, as these reports advance commit_lsn -- reliably persisted part of WAL. ref #549. Mostly by @knizhnik. I adjusted to make sure proposer always starts streaming since record beginning so we don't need special quirks for decoding in safekeeper.	2021-09-11 06:10:10 +03:00
Patrick Insinger	7c62a57e54	initialize tenant_mgr after daemonizing Ran into problems launching the WAL redo process on OS X after 4b73ad. Launching the `initdb` process was met with "bad file descriptor" errors. Using dtrace, I found shortly after calling `posix_spawn` for `initdb`, `kevent` was returning this error. I haven't dug super deep to see if the daemonization itself is the problem, but this commit fixes it for me. My hunch is that some file descriptors used when the Tokio runtime is initailzed become invalid in the daemon process.	2021-09-10 13:00:39 +03:00
Heikki Linnakangas	59e7ca585d	Minor fixes	2021-09-10 12:43:11 +03:00
anastasia	3dea06b825	Update layered_repository/README.md	2021-09-10 12:43:11 +03:00
Heikki Linnakangas	ab33614ab1	Forbid adding WAL to the repository after advancing last record LSN. When you advance last record LSN, all changes up to that LSN should be imported into repository. We have been a bit sloppy about that when it comes to the checkpoint information that we also store in the repository. In WAL receiver, for example, we would receive a WAL record, advance last record LSN, and only then update the checkpoint relish at the same LSN. Reorder that so that you advance the last record LSN only after updating the checkpoint relish. It hasn't apparently caused any problems so far, but let's be tidy. Tighten the check for that in get_layer_for_write(), so that it checks for 'lsn > last_record_lsn' rather than 'lsn >= last_record_lsn'.	2021-09-10 10:59:09 +03:00
Heikki Linnakangas	03dff207db	Remove start_lsn arg from `create_empty_repository`. Always use lsn(0) as the initial last_record_lsn. It is updated soon after creating the timeline anyway, after loading the bootstrap data, so it doesn't stay long in that state. I was a bit worried about using a special value like 0, but it's actually nice that you can distinguish it from any real LSN value. The unit tests have been using Lsn(0) as the initial start LSN all along.	2021-09-10 10:24:35 +03:00
Heikki Linnakangas	6a8785379a	Add explicit 'wait_lsn' calls before get_page_at_lsn and such calls. Move the responsibility to wait for the WAL to arrive to the callers, and remove the wait_lsn() calls from the Timeline::get_page_at_lsn() and friends. We were not totally consistent before, list_rels() was missing the wait_lsn() call for example. Closes https://github.com/zenithdb/zenith/issues/521	2021-09-10 09:56:11 +03:00
Heikki Linnakangas	507177b42e	Refactor code to handle incoming page requests.	2021-09-09 18:48:46 +03:00
anastasia	b79754d06e	list_rels() and list_nonrels() refactoring: move shared code to list_relishes() function.	2021-09-09 16:05:32 +03:00
anastasia	674807eee1	Add test for dropped reltaions. Fix list_rels() and list_nonrels() functions	2021-09-09 16:05:32 +03:00
Konstantin Knizhnik	30c0343727	Use layer start_lsn instead of *entry_lsn as LSN to continue WAL record traversal at next layer (#573 ) refer #532	2021-09-09 15:15:50 +03:00
Dmitry Rodionov	4b73ada26e	fix connection error appeared on zenith start by binding sockets before daemonization also use less annoying error reporting by not printing full error messages for connect errors in first several connection retries closes #507	2021-09-07 20:50:27 +03:00
Dmitry Rodionov	b4ecae33e4	add incremental tracking of logical timeline size In order to exclude problems with synchronizing disk and memory logical size is not stored in metadata on disk. It is calculated on timeline "start" by scanning the contents of layered repo and then size is maintained via an atomic variable. This patch also adds new endpoint to pageserver http api: branch detail. It allows retrieval of a particular branch info by its name. Size info is also added to the response of the endpoint and used in tests.	2021-09-07 18:25:15 +03:00
Patrick Insinger	1b9e49eb60	pageserver - update `unload()` comment Update comment to reflect changes made in 5ac4a2 and 98f496	2021-09-07 08:19:42 -07:00
Heikki Linnakangas	7a03e32dd5	Use Rust shorthand range syntax	2021-09-07 18:10:07 +03:00
Heikki Linnakangas	018a606987	Refactor code in LayerMap, for readability - Reorder the structs and functions - Delegate many of the operations in LayerMap to SegEntry. For example, `LayerMap::insert_open` now looks up the right SegEntry struct, and then calls `SegEntry::insert_open` on it. - Use HashMap::entry() function with or_default() to implement the lookups with less code	2021-09-07 18:10:07 +03:00
Heikki Linnakangas	26782851a9	Rename OpenSegEntry to OpenLayerEntry That's more appropriate: it's a struct that holds a Layer, not a segment.	2021-09-07 18:10:07 +03:00
Heikki Linnakangas	04ee1d5977	Add test for managing old open segments in binary heap. I thought this test would trigger the bug fixed previous commit, but it did not. More tests are nice in any case.	2021-09-07 18:10:07 +03:00
Heikki Linnakangas	6245702c7c	Comment fixes	2021-09-07 18:10:07 +03:00
Heikki Linnakangas	9098f2159d	Fix comparison routines of OpenSegEntry Commit `66929ad6fb` added a 'generation' number to open segments stored in the layer map, to distinguish old layers from layers that were added to the map during checkpoint processing. But it neglected the OpenSegEntry::cmp() function. It seems that the cmp() function is never used by BinaryHeap, so this didn't cause any user-visible bugs (I tried adding a panic() to the cmp() function and it didn't fire). But it's clearly wrong and we need to fix it, anyway.	2021-09-07 18:10:07 +03:00
Konstantin Knizhnik	a3214e982d	Transaction commit redo handler should set TRANSACTION_STATUS_COMMITTED status for subtransactions, not TRANSACTION_STATUS_SUB_COMMITTED Closes #535	2021-09-06 18:21:23 +03:00
Arseny Sher	d1f0b1eda4	Adapt safekeepers to --sync-safekeepers walproposer mode. 1) Do epoch switch without record from new epoch, immediately after recovery -- --sync-safekeepers mode doesn't generate new records. 2) Fix commit_lsn advancement by taking into account wal we have locally -- setting it further is incorrect. 3) Report it back to walproposer so he knows when sync is done. 4) Remove system id check as it is unknown in sync mode. And make logging slightly better. ref #439	2021-09-06 13:06:20 +03:00
Konstantin Knizhnik	b227c63edf	Set proper xl_prev in basebackup, when possible. In a passing fix two minor issues with basabackup: * check that we can't create branches with pre-initdb LSN's * normalize branch LSN's that are pointing to the segment boundary patch by @knizhnik closes #506	2021-09-03 14:58:59 +03:00
anastasia	45c09c1cdd	Add LayerMap.dump() funciton for debugging. Print timelineid in layer dumps	2021-09-03 11:00:38 +03:00
anastasia	66dcaa4e01	Rename put_unlink() to drop_relish() in Timeline trait. Rename put_unlink() to drop_segment() in Layer trait.	2021-09-03 11:00:38 +03:00
anastasia	a7de53d4c4	Improve comments for Layer trait.	2021-09-03 11:00:38 +03:00
anastasia	fabf5ec664	Don't use term 'snapshot' to describe layers	2021-09-03 11:00:38 +03:00
Heikki Linnakangas	1686715ad0	Partial fix for issue with extending relation with a gap. This should fix the sporadic regression test failures we've been seeing lately with "no base img found" errors. This fixes the common case, but one corner case is still not handled: If a relation is extended across a segment boundary, leaving a gap block in the segment preceding the segment containing the target block, the preceding segment will not be padded with zeros correctly. This adds a test case for that, but it's commented out. See github issue https://github.com/zenithdb/zenith/issues/500	2021-09-02 22:01:46 +03:00
Dmitry Rodionov	bc709561b6	fix clippy warnings	2021-09-02 18:54:44 +03:00
Kirill Bulatov	0e4cbe0165	Fix some typos	2021-09-02 17:27:18 +03:00
Heikki Linnakangas	66929ad6fb	Fix infinite loop with forced repository checkpoint. To fix, break out of the loop when you reach an in-memory layer that was created after the checkpoint started. To do that, add a "generation" counter into the layer map. Fixes https://github.com/zenithdb/zenith/issues/494	2021-09-02 15:41:40 +03:00
Heikki Linnakangas	c3cbb56ff8	Refactor Layer::get_page_reconstruct_data function Previously, the InMemoryLayer and DeltaLayer implementations of get_page_reconstruct_data would recursively call the predecessor layer's get_page_reconstruct_data function. Refactor so that we iterate in the caller instead. Make get_page_reconstruct_data() return the predecessor layer along with the continuation LSN, so that the caller can iterate. IMO this makes the logic more clear, although this is more lines of code.	2021-09-02 14:22:29 +03:00
Heikki Linnakangas	81479b0218	Rename 'InMemoryLayer::img_layer' field. DeltaLayer uses the name `predecessor` for the same thing. Use the same name in InMemoryLayer. The 'img_layer' name was misleading, as the predecessor layer is not necessarily an image layer. Currently, the 'freeze' function always creates a new image layer, but it wouldn't have to be that way. Also, when you create a new branch, at the branch point the predecessor layer can be a delta layer on the ancestor branch.	2021-09-02 14:22:26 +03:00
Stas Kelvich	59c19d6e18	Rework basebackup. * add lsn argument * do not expose wait_lsn, wait inside list_nonrels() * fix parameters parsing * expose get_last_record_rlsn() to atomically read (last,prev) pair More work is needed to correctly handle basebackup@old_lsn but current approach already allows to fix test_restart_compute	2021-09-02 12:06:12 +03:00
Stas Kelvich	8c07a36fda	Remove last_valid_lsn tracking in wal_receiver. There are two main reasons for that: a) Latest unfinished record may disapper after compute node restart, so let's try not leak volatile part of the WAL into the repository. Always use last_valid_record instead. That change requires different getPage@LSN logic in postgres -- we need to ask LSN's that point to some complete record instead of GetFlushRecPtr() that can point in the middle of the record. That was already done by @knizhnik to deal with the same problem during the work on `postgres --sync-safekeepers`. Postgres will use LSN's aligned on 0x8 boundary in get_page requests, so we also need to be sure that last_valid_record is aligned. b) Switch to get_last_record_lsn() in basebackup@no_lsn. When compute node is running without safekeepers and streams WAL directly to pageserver it is important to match basebackup LSN and LSN of replication start. Before this commit basebackup@no_lsn was waiting for last_valid_lsn and walreceiver started replication with last_record_lsn, which can be less. So replication was failing since compute node doesn't have requested WAL.	2021-09-02 12:06:12 +03:00
Heikki Linnakangas	d7bebd8074	Add 'dump_layerfile' utility for debugging. Seems handy for getting a quick idea of what's stored in an image or delta layer file. Example output on a file after runnnig pgbench for a while: % ./target/debug/dump_layerfile pgbench_layers/pg_control_checkpoint_0_00000000016B914A ----- image layer for checkpoint.0 at 0/16B914A ---- non-blocky (88 bytes) % ./target/debug/dump_layerfile pgbench_layers/pg_xact_0000_0_000000000412FD40 ----- image layer for pg_xact/0000.0 at 0/412FD40 ---- (1) blocks % ./target/debug/dump_layerfile pgbench_layers/rel_1663_14236_1247_0_0_00000000016B914A_000000000412FD40 \| head -n 20 ----- delta layer for 1663/14236/1247.0 0/16B914A-0/412FD40 ---- --- relsizes --- 0/16B914A: 14 0/16CA559: 15 --- page versions --- blk 13 at 0/16BB1D2: rec 8162 bytes will_init: true HEAP INSERT blk 14 at 0/16CA559: rec 8241 bytes will_init: true XLOG FPI blk 14 at 0/16CA637: rec 215 bytes will_init: true HEAP INSERT blk 14 at 0/16DF14F: rec 215 bytes will_init: false HEAP INSERT blk 14 at 0/16DF3A7: rec 215 bytes will_init: false HEAP INSERT blk 14 at 0/16E0637: rec 215 bytes will_init: false HEAP INSERT blk 14 at 0/16E088F: rec 215 bytes will_init: false HEAP INSERT blk 14 at 0/16E5F9F: rec 215 bytes will_init: false HEAP INSERT blk 14 at 0/16E620F: rec 215 bytes will_init: false HEAP INSERT	2021-09-01 12:20:16 -07:00
Patrick Insinger	5ac3cb1c72	TLS for postgres_backend and proxy Add TLS support to `postgres_backend`. Implement this support in `proxy`. Other applications must opt-in and provide a `rustls::ServerConfig`.	2021-09-01 10:29:19 -07:00
Dmitry Rodionov	812160ba16	fix XLOG_MULTIXACT_ZERO_MEM_PAGE wal parsing closes #453	2021-09-01 17:02:14 +03:00
Stas Kelvich	91d605f781	Revert accidental commit: "[refer #506 ] Enforce that xl_prev<curr_lsn for created branch" This reverts commit `aae39ecf57`.	2021-09-01 16:30:09 +03:00
Konstantin Knizhnik	aae39ecf57	[refer #506 ] Enforce that xl_prev<curr_lsn for created branch	2021-09-01 16:23:42 +03:00
anastasia	8b3a293bb0	Use postgres_ffi bindings instead of custom type definitions. Move several functions to postgres_ffi crate	2021-09-01 16:11:44 +03:00
Dmitry Rodionov	989ab7e883	move several functions which replicate ones from postgresql to postgres_ffi crate	2021-09-01 16:11:44 +03:00
anastasia	e9d2181e17	Remove obsolete comment	2021-09-01 15:02:37 +03:00
anastasia	8a05d6dde0	Fix 'unrecognized filename in timeline' warning	2021-09-01 15:02:32 +03:00

1 2 3 4 5 ...

387 Commits