rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-01-07 05:22:56 +00:00

Author	SHA1	Message	Date
Konstantin Knizhnik	59ea3973a4	Set hint bits in pageserver	2021-09-10 18:27:34 +03:00
Konstantin Knizhnik	08bc808043	Create branch just to run tests	2021-09-07 15:12:39 +03:00
Konstantin Knizhnik	ba563ee93e	Revert "Bump postgres version" This reverts commit `511873aaed`.	2021-09-07 15:12:39 +03:00
anastasia	194b33ac3b	print diff for mismatching files in check_restored_datadir_content()	2021-09-07 15:12:39 +03:00
Konstantin Knizhnik	a190c0eb88	Transaction commit redo handler should set TRANSACTION_STATUS_COMMITTED status for subtransactions, not TRANSACTION_STATUS_SUB_COMMITTED Closes #535	2021-09-07 15:12:39 +03:00
anastasia	2b5405ac6e	Add test funciton to compare files in compute nodes to catch bugs in SLRU replay. Compare files in existing compute node's pgdata with fresh basebackup at the same lsn. We expect that content is identical, except tmp files Use it after some tests.	2021-09-07 15:12:39 +03:00
Arseny Sher	1d75c827a0	Adapt safekeepers to --sync-safekeepers walproposer mode. 1) Do epoch switch without record from new epoch, immediately after recovery -- --sync-safekeepers mode doesn't generate new records. 2) Fix commit_lsn advancement by taking into account wal we have locally -- setting it further is incorrect. 3) Report it back to walproposer so he knows when sync is done. 4) Remove system id check as it is unknown in sync mode. And make logging slightly better. ref #439	2021-09-07 15:12:39 +03:00
Stas Kelvich	e1e43f13df	Make use of `postgres --sync-safekeepers` in tests and CLI. Change control plane code to call `postgres --sync-safekeepers` before compute node start when safekeepers are enabled. Now `pg create` will create an empty data directory with the proper config file. Subsequent `pg start` will run `sync-safekeepers` and will call basebackup with the resulting LSN. Also change few tests to accommodate this new behavior.	2021-09-07 15:12:39 +03:00
Konstantin Knizhnik	b2e0490d5e	Add description of Zenith changes in Postgres core (#533 ) * Add description of Zenith changes in Postgres core * Update README.md	2021-09-07 15:12:39 +03:00
Kirill Bulatov	1d3c86e17a	Check rusage return code	2021-09-07 15:12:39 +03:00
Konstantin Knizhnik	e8c22488b9	Set proper xl_prev in basebackup, when possible. In a passing fix two minor issues with basabackup: * check that we can't create branches with pre-initdb LSN's * normalize branch LSN's that are pointing to the segment boundary patch by @knizhnik closes #506	2021-09-07 15:12:39 +03:00
anastasia	9c1dbe3783	Add LayerMap.dump() funciton for debugging. Print timelineid in layer dumps	2021-09-07 15:12:39 +03:00
anastasia	1365f8c703	Rename put_unlink() to drop_relish() in Timeline trait. Rename put_unlink() to drop_segment() in Layer trait.	2021-09-07 15:12:39 +03:00
anastasia	df4ce15456	Improve comments for Layer trait.	2021-09-07 15:12:39 +03:00
anastasia	9ed4db273d	Don't use term 'snapshot' to describe layers	2021-09-07 15:12:39 +03:00
Heikki Linnakangas	21cf4a3e11	Include # of bytes written in pgbench benchmark result Now that the page server collects this metric (since commit `212920e47e`), let's include it in the performance test results The new metric looks like this: performance/test_perf_pgbench.py . [100%] --------------- Benchmark results ---------------- test_pgbench.init: 6.784 s test_pgbench.pageserver_writes: 466 MB <---- THIS IS NEW test_pgbench.5000_xacts: 8.196 s test_pgbench.size: 163 MB =============== 1 passed in 21.00s ===============	2021-09-07 15:12:39 +03:00
Heikki Linnakangas	2c10224c9a	Partial fix for issue with extending relation with a gap. This should fix the sporadic regression test failures we've been seeing lately with "no base img found" errors. This fixes the common case, but one corner case is still not handled: If a relation is extended across a segment boundary, leaving a gap block in the segment preceding the segment containing the target block, the preceding segment will not be padded with zeros correctly. This adds a test case for that, but it's commented out. See github issue https://github.com/zenithdb/zenith/issues/500	2021-09-07 15:12:39 +03:00
Patrick Insinger	c33faf98d1	zenith_utils - box BidiStream::Tls variant Clippy warns that one variant is 40 bytes and the other is 568 bytes. Box the larger variant to avoid this warning	2021-09-07 15:12:39 +03:00
Dmitry Rodionov	95453bc4af	fix clippy warnings	2021-09-07 15:12:39 +03:00
Kirill Bulatov	3a37877edc	Fix some typos	2021-09-07 15:12:39 +03:00
Heikki Linnakangas	2145ec5fe8	Fix infinite loop with forced repository checkpoint. To fix, break out of the loop when you reach an in-memory layer that was created after the checkpoint started. To do that, add a "generation" counter into the layer map. Fixes https://github.com/zenithdb/zenith/issues/494	2021-09-07 15:12:39 +03:00
Konstantin Knizhnik	49d14cbde7	Create branch just to run tests	2021-09-07 13:32:45 +03:00
Heikki Linnakangas	c3cbb56ff8	Refactor Layer::get_page_reconstruct_data function Previously, the InMemoryLayer and DeltaLayer implementations of get_page_reconstruct_data would recursively call the predecessor layer's get_page_reconstruct_data function. Refactor so that we iterate in the caller instead. Make get_page_reconstruct_data() return the predecessor layer along with the continuation LSN, so that the caller can iterate. IMO this makes the logic more clear, although this is more lines of code.	2021-09-02 14:22:29 +03:00
Heikki Linnakangas	81479b0218	Rename 'InMemoryLayer::img_layer' field. DeltaLayer uses the name `predecessor` for the same thing. Use the same name in InMemoryLayer. The 'img_layer' name was misleading, as the predecessor layer is not necessarily an image layer. Currently, the 'freeze' function always creates a new image layer, but it wouldn't have to be that way. Also, when you create a new branch, at the branch point the predecessor layer can be a delta layer on the ancestor branch.	2021-09-02 14:22:26 +03:00
Dmitry Rodionov	3c5452da88	add tenant id tracking to safekeeper Previously timelines were namespaced only by ZTimelineId, so this patch adds ZTenant id to the key of a hashtable closes #381	2021-09-02 12:57:39 +03:00
Stas Kelvich	59c19d6e18	Rework basebackup. * add lsn argument * do not expose wait_lsn, wait inside list_nonrels() * fix parameters parsing * expose get_last_record_rlsn() to atomically read (last,prev) pair More work is needed to correctly handle basebackup@old_lsn but current approach already allows to fix test_restart_compute	2021-09-02 12:06:12 +03:00
Stas Kelvich	8c07a36fda	Remove last_valid_lsn tracking in wal_receiver. There are two main reasons for that: a) Latest unfinished record may disapper after compute node restart, so let's try not leak volatile part of the WAL into the repository. Always use last_valid_record instead. That change requires different getPage@LSN logic in postgres -- we need to ask LSN's that point to some complete record instead of GetFlushRecPtr() that can point in the middle of the record. That was already done by @knizhnik to deal with the same problem during the work on `postgres --sync-safekeepers`. Postgres will use LSN's aligned on 0x8 boundary in get_page requests, so we also need to be sure that last_valid_record is aligned. b) Switch to get_last_record_lsn() in basebackup@no_lsn. When compute node is running without safekeepers and streams WAL directly to pageserver it is important to match basebackup LSN and LSN of replication start. Before this commit basebackup@no_lsn was waiting for last_valid_lsn and walreceiver started replication with last_record_lsn, which can be less. So replication was failing since compute node doesn't have requested WAL.	2021-09-02 12:06:12 +03:00
Stas Kelvich	ddd2c83c64	Change test_restart_compute to expose safekeeper problems. Make this test look like 'test_compute_restart.sh' by @ololobus, which was surprisingly good for checking safekeepers behavior. This test adds an intermediate compute node start with bulk select that causes a lot of FPI's and select itself wouldn't wait for all that WAL to be replicated. So if we kill compute node right after that we end up with lagging safekeepers with VCL != flush_lsn. And starting new node from that state takes special care. Also, run and print `pg_controldata` output after each compute node start to eyeball lsn/checkpoint info of basebackup. This commit only adds test without fixing the problem.	2021-09-02 12:06:12 +03:00
Kirill Bulatov	212920e47e	Collect and expose I/O disk write metrics	2021-09-02 11:33:00 +03:00
Kirill Bulatov	291c2c9a1b	Test readme typo fix	2021-09-02 11:33:00 +03:00
Heikki Linnakangas	d7bebd8074	Add 'dump_layerfile' utility for debugging. Seems handy for getting a quick idea of what's stored in an image or delta layer file. Example output on a file after runnnig pgbench for a while: % ./target/debug/dump_layerfile pgbench_layers/pg_control_checkpoint_0_00000000016B914A ----- image layer for checkpoint.0 at 0/16B914A ---- non-blocky (88 bytes) % ./target/debug/dump_layerfile pgbench_layers/pg_xact_0000_0_000000000412FD40 ----- image layer for pg_xact/0000.0 at 0/412FD40 ---- (1) blocks % ./target/debug/dump_layerfile pgbench_layers/rel_1663_14236_1247_0_0_00000000016B914A_000000000412FD40 \| head -n 20 ----- delta layer for 1663/14236/1247.0 0/16B914A-0/412FD40 ---- --- relsizes --- 0/16B914A: 14 0/16CA559: 15 --- page versions --- blk 13 at 0/16BB1D2: rec 8162 bytes will_init: true HEAP INSERT blk 14 at 0/16CA559: rec 8241 bytes will_init: true XLOG FPI blk 14 at 0/16CA637: rec 215 bytes will_init: true HEAP INSERT blk 14 at 0/16DF14F: rec 215 bytes will_init: false HEAP INSERT blk 14 at 0/16DF3A7: rec 215 bytes will_init: false HEAP INSERT blk 14 at 0/16E0637: rec 215 bytes will_init: false HEAP INSERT blk 14 at 0/16E088F: rec 215 bytes will_init: false HEAP INSERT blk 14 at 0/16E5F9F: rec 215 bytes will_init: false HEAP INSERT blk 14 at 0/16E620F: rec 215 bytes will_init: false HEAP INSERT	2021-09-01 12:20:16 -07:00
Patrick Insinger	5ac3cb1c72	TLS for postgres_backend and proxy Add TLS support to `postgres_backend`. Implement this support in `proxy`. Other applications must opt-in and provide a `rustls::ServerConfig`.	2021-09-01 10:29:19 -07:00
Dmitry Rodionov	812160ba16	fix XLOG_MULTIXACT_ZERO_MEM_PAGE wal parsing closes #453	2021-09-01 17:02:14 +03:00
Stas Kelvich	91d605f781	Revert accidental commit: "[refer #506 ] Enforce that xl_prev<curr_lsn for created branch" This reverts commit `aae39ecf57`.	2021-09-01 16:30:09 +03:00
Konstantin Knizhnik	aae39ecf57	[refer #506 ] Enforce that xl_prev<curr_lsn for created branch	2021-09-01 16:23:42 +03:00
anastasia	8b3a293bb0	Use postgres_ffi bindings instead of custom type definitions. Move several functions to postgres_ffi crate	2021-09-01 16:11:44 +03:00
Dmitry Rodionov	989ab7e883	move several functions which replicate ones from postgresql to postgres_ffi crate	2021-09-01 16:11:44 +03:00
anastasia	e9d2181e17	Remove obsolete comment	2021-09-01 15:02:37 +03:00
anastasia	8a05d6dde0	Fix 'unrecognized filename in timeline' warning	2021-09-01 15:02:32 +03:00
Heikki Linnakangas	b45d5368b0	Don't create an image layer for dropped relations. I noticed that the timeline directory contained files like this: pg_xact_0000_0_000000000169C3C2_00000000016BB399 pg_xact_0000_0_00000000016BB399 pg_xact_0000_0_00000000016BB399_00000000016BDD06 pg_xact_0000_0_00000000016BDD06 pg_xact_0000_0_00000000016BDD06_00000000016C63AA pg_xact_0000_0_00000000016C63AA pg_xact_0000_0_00000000016C63AA_0000000001765226_DROPPED pg_xact_0000_0_0000000001765226 pg_xact_0001_0_00000000016BB77E_00000000016BDD06 pg_xact_0001_0_00000000016BDD06 pg_xact_0001_0_00000000016BDD06_0000000001765226_DROPPED pg_xact_0001_0_0000000001765226 Note how there is an image file after each DROPPED file. It's a waste of time and space to materialize an image of the file at the point where it's dropped, no one is going to request pages on a dropped relation. And it's a correctness issue too: list_rels() and list_nonrels() will not consider the relation as unlinked, unless the latest layer indicates so, and there is no concept of a dropped image layer. That was causing test_clog_truncate test to fail, when I adjusted the checkpointer to force a checkpoint more aggressively. There are a bunch more issues related to dropped rels and branching, see https://github.com/zenithdb/zenith/issues/502. Hence this doesn't completely fix the issue I saw with test_clog_truncate either. But it's a start.	2021-09-01 09:42:18 +03:00
Max Sharnoff	625abf3c52	Bump vendor/postgres for walproposer cleanup ref zenithdb/postgres#69	2021-08-31 13:09:16 -07:00
anastasia	c0ace1efff	Bump vendor/postgres to use relsize cache.	2021-08-31 14:10:50 +03:00
Kirill Bulatov	03a09b7827	Replace old git urls with the current ones	2021-08-30 23:51:47 +03:00
Heikki Linnakangas	63d0a865f4	Update and move comment. The comment talked about the WAL redo thread, but commit `6e22a8f709` refactored that away. The problem the comment describes probably still exists, so keep the comment, but update the wording.	2021-08-30 20:35:08 +03:00
Patrick Insinger	5ac4a27042	image_layer - read images directly from disk Avoid slurping entire image files into memory. For blocky segments, we write the bytes directly to a bookfile chapter. The blocks are a fixed size, which allows for random access.	2021-08-30 10:34:36 -07:00
Patrick Insinger	7c7e89e2ea	layered_repo - atomic last/prev record_lsn Make a new type that stores both Lsns. Use an RwLock for thread safety.	2021-08-30 09:40:13 -07:00
Patrick Insinger	561bf2c510	circleci - fix test summary	2021-08-30 09:18:49 -07:00
Patrick Insinger	98f49671c1	delta_layer - read page versions from disk split the page versions into two chapters: PAGE_VERSION_METAS - a rust BTreeMap from (block #, lsn) -> page & WAL byte ranges in PAGE_VERSIONS_CHAPTER PAGE_VERSIONS_CHAPTER - raw page images and serialized WAL records	2021-08-30 09:12:38 -07:00
anastasia	78963ad104	Issue #411 . Support drop database in pageserver. Use put_unlink for FileNodeMap relishes. Always store FileNodeMap as materialized page images (no wal records).	2021-08-30 17:29:29 +03:00
anastasia	27442c3daa	Add test for DROP DATABASE command	2021-08-30 17:29:29 +03:00

1 2 3 4 5 ...

775 Commits