rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2025-12-27 08:09:58 +00:00

Author	SHA1	Message	Date
Dmitry Rodionov	99e0f07a1d	review adjustments, fancy enum for builder, minor cleanups	2022-02-23 08:33:50 +03:00
Dmitry Rodionov	5d490babf8	add node id to pageserver This adds node id parameter to pageserver configuration. Also I use a simple builder to construct pageserver config struct to avoid setting node id to some temporary invalid value. Some of the changes in test fixtures are needed to split init and start operations for envrionment.	2022-02-23 08:33:50 +03:00
anastasia	1a4682a04a	Add 'walreceiver-after-ingest' failpoint. Use sleep at this point to imitate slow walreceiver.	2022-02-22 13:56:21 +03:00
Stas Kelvich	207286f2b8	Actualize branching parts of openapi spec. Previous version of spec caused parsing errors in generated clients as return type is object not array, also one field was missing. In a passing set `format: hex` on ancestor_id too as value conforms to that format.	2022-02-18 20:22:21 +02:00
Dmitry Rodionov	9cce430430	remove several obsolete management api commands from pageserver's libpq api these commands are now available via http api	2022-02-17 11:26:28 +03:00
Kirill Bulatov	5563ff123f	Reuse tenant-timeline id struct from utils	2022-02-15 17:45:23 +02:00
Heikki Linnakangas	9632c352ab	Avoid having multiple records for the same page and LSN. If a heap UPDATE record modified two pages, and both pages needed to have their VM bits cleared, and the VM bits were located on the same VM page, we would emit two ZenithWalRecord::ClearVisibilityMapFlags records for the same VM page. That produced warnings like this in the pageserver log: Page version Wal(ClearVisibilityMapFlags { heap_blkno: 18, flags: 3 }) of rel 1663/13949/2619_vm blk 0 at 2A/346046A0 already exists To fix, change ClearVisibilityMapFlags so that it can update the bits for both pages as one operation. This was already covered by several python tests, so no need to add a new one. Fixes #1125. Co-authored-by: Konstantin Knizhnik <knizhnik@zenith.tech>	2022-02-15 14:26:16 +02:00
Kirill Bulatov	7c1c7702d2	Code review fixes	2022-02-10 08:33:22 -05:00
Kirill Bulatov	6eef401602	Move routerify behind zenith_utils	2022-02-10 08:33:22 -05:00
Kirill Bulatov	c5b5905ed3	Remove parking_lot dependency from workspace	2022-02-10 08:33:22 -05:00
Kirill Bulatov	76b74349cb	Bump pageserver dependencies	2022-02-10 08:33:22 -05:00
Kirill Bulatov	b67cddb303	Implement EphemeralFile flush in a least dangerous way	2022-02-05 22:02:59 -05:00
Kirill Bulatov	3ed156a5b6	Add a CLI tool to manipulate remote storage blob files	2022-02-05 15:48:08 -05:00
Heikki Linnakangas	2d93b129a0	Avoid eprintln() in pageserver and walkeeper. Use log::error!() instead. I spotted a few of these "connection error" lines in the logs, without timestamps and the other stuff we print for all other log messages.	2022-02-05 17:59:31 +02:00
Dmitry Rodionov	5df21e1058	remove Timeline::start_lsn in favor of ancestor_lsn	2022-01-28 12:31:15 +03:00
Konstantin Knizhnik	f58a22d07e	Freeze layers at the same end LSN (#1182 ) * Freeze vectors at the same end LSN * Fix calculation of last LSN for inmem layer * Do not advance disk_consistent_lsn is no open layer was evicted * Fix calculation of freeze_end_lsn * Let start_lsn be larger than oldest_pending_lsn * Rename 'oldest_pending_lsn' and 'last_lsn', add comments. * Fix future_layerfiles test * Update comments conserning olest_lsn * Update comments conserning olest_lsn Co-authored-by: Heikki Linnakangas <heikki@zenith.tech>	2022-01-27 18:21:00 +03:00
Konstantin Knizhnik	79f0e44a20	Gc cutoff rwlock (#1139 ) * Reproduce github issue #1047. * Use RwLock to protect gc_cuttof_lsn * Eeduce number of updates in test_gc_aggressive * Change test_prohibit_get_page_at_lsn_for_garbage_collected_pages test * Change test_prohibit_get_page_at_lsn_for_garbage_collected_pages * Lock latest_gc_cutoff_lsn in all operations accessing storage to prevent race conditions with GC * Remove random sleep between wait_for_lsn and get_page_at_lsn * Initialize latest_gc_cutoff with initdb_lsn and remove separate check that lsn >= initdb_lsn * Update test_prohibit_branch_creation_on_pre_initdb_lsn test Co-authored-by: Heikki Linnakangas <heikki@zenith.tech>	2022-01-27 14:41:16 +03:00
anastasia	5abe2129c6	Extend replication protocol with ZentihFeedback message to pass current_timeline_size to compute node Put standby_status_update fields into ZenithFeedback and send them as one message. Pass values sizes together with keys in ZenithFeedback message.	2022-01-27 11:20:45 +03:00
Dmitry Rodionov	63dd7bce7e	bandaid to avoid concurrent timeline downloading until proper refactoring/fix	2022-01-26 19:54:09 +03:00
Dmitry Rodionov	37c440c5d3	Introduce first version of tenant migraiton between pageservers This patch includes attach/detach http endpoints in pageservers. Some changes in callmemaybe handling inside safekeeper and an integrational test to check migration with and without load. There are still some rough edges that will be addressed in follow up patches	2022-01-24 17:20:15 +03:00
Konstantin Knizhnik	7bc1274a03	Fix comparison with disk_consistent_lsn in newer_image_layer_exists (#1167 )	2022-01-24 12:19:18 +03:00
Konstantin Knizhnik	e209764877	Do not delete layers beyand cutoff LSN (#1128 ) * Do not delete layers beyand cutoff LSN * Update pageserver/src/layered_repository/layer_map.rs Co-authored-by: Heikki Linnakangas <heikki.linnakangas@iki.fi> Co-authored-by: Heikki Linnakangas <heikki.linnakangas@iki.fi>	2022-01-24 10:42:40 +03:00
Dmitry Rodionov	026eb64a83	Use python lib to mock s3	2022-01-20 18:42:47 +02:00
Kirill Bulatov	45124856b1	Better S3 remote storage logging	2022-01-20 18:42:47 +02:00
Kirill Bulatov	38c6f6ce16	Allow specifying custom endpoint in s3	2022-01-20 18:42:47 +02:00
Dmitry Ivanov	d3542c34f1	Refactoring: use anyhow::Context's methods where possible	2022-01-19 16:33:48 +03:00
Heikki Linnakangas	dab30c27b6	Refactor thread management and shutdown This introduces a new module to handle thread creation and shutdown. All page server threads are now registered in a global hash map, and there's a function to request individual threads to shut down gracefully. Thread shutdown request is signalled to the thread with a flag, as well as a Future that can be used to wake up async operations if shutdown is requested. Use that facility to have the libpq listener thread respond to pageserver shutdown, based on Kirill's earlier prototype (https://github.com/zenithdb/zenith/pull/1088). That addresses https://github.com/zenithdb/zenith/issues/1036, previously the libpq listener thread would not exit until one more connection arrives. This also eliminates a resource leak in the accept() loop. Previously, we added the JoinHanlde of each new thread to a vector but old handles for threads that had already exited were never removed.	2022-01-14 18:36:10 +02:00
Heikki Linnakangas	bad1dd9759	Don't panic if spawning a new WAL receiver thread fails. The panic would kill the page service thread. That's not too bad, but still let's try to handle it more gracefully.	2022-01-14 18:02:34 +02:00
Heikki Linnakangas	d29836d0d5	Don't panic if spawning a thread to handle a connection fails. Log the error and continue. Hopefully it's a transient failure. This might have been happening in staging earlier, when the safekeeper had a problem where it opened connections very frequently to issue "callmemaybe" commands. If you launch too many threads too fast, you might run out of file descriptors or something. It's not totally clear what happened, but with commit, at least the page server will continue to run and accept new connections, if a transient error happens.	2022-01-14 18:02:30 +02:00
Heikki Linnakangas	adb0b3dada	Include backtrace in error messages in the log. 'anyhow' crate can include a backtrace in all errors, when the 'backtrace' feature is enabled. Enable it, and change the places that used '{:#}' or '{}' to '{:?}', so that the backtrace is printed.	2022-01-14 10:10:17 +02:00
Heikki Linnakangas	19aaa91f6d	Timeline IDs are not globally unique, fix some code that assumed that. A timeline ID is only guaranteed to be unique for a particular tenant, so you need to use tenant ID + timeline ID as the key, rather than just timeline ID. The safekeeper currently makes the same assumption, and we should fix that too, but this commit just addresses this one case in the page server. In the passing, reorder some function arguments to be more consistent.	2022-01-13 18:45:30 +02:00
Konstantin Knizhnik	404aab9373	Use mutex to prevent concurrent checkpoints (#1115 ) * Use mutex to prevent concurrent checkpoints * Fix comment	2022-01-13 17:48:24 +03:00
Konstantin Knizhnik	bc6db2c10e	Implement IO metrics in VirtualFile (#1112 ) * Implement IO metrics in VirtualFile * Do not group virtual file close statistics by tenantid/timelineid * Add comments concenring close metrics	2022-01-13 17:36:53 +03:00
Konstantin Knizhnik	f70a5cad61	Fix releasing of timelines lock (#1100 ) refer #1087	2022-01-12 15:05:08 +03:00
Kirill Bulatov	4b3b19f444	Support prefixes when working with s3 buckets	2022-01-11 15:44:50 +02:00
Kirill Bulatov	8ab4c8a050	Code review fixes	2022-01-11 15:44:23 +02:00
Kirill Bulatov	65c851a451	Test pageserver's timeline http methods z	2022-01-11 15:44:23 +02:00
Kirill Bulatov	23cf2fa984	Properly shutdown storage sync loop	2022-01-11 15:44:23 +02:00
Kirill Bulatov	384b2a91fa	Pass generic pageserver params through zenith cli	2022-01-11 15:44:23 +02:00
Konstantin Knizhnik	2fd4c390cb	Do not hold timelines lock during GC (#1089 ) * Do not hold timelines lock during GC refer #1087 * Add gc_cs mutex for preveting creation of new timelines during GC * Make clippy happy * Use Mutex<()> instead of Mutex<i32> for GC critical section	2022-01-10 14:41:15 +03:00
Patrick Insinger	191d9d2b74	par_fsync - use VirtualFile	2022-01-04 20:40:57 -08:00
Patrick Insinger	24c8dab86f	pageserver - parallelize checkpoint fsyncs	2022-01-04 20:40:57 -08:00
Heikki Linnakangas	55a4cf64a1	Refactor WAL record handling. Introduce the concept of a "ZenithWalRecord", which can be a Postgres WAL record that is replayed with the Postgres WAL redo process, or a built-in type that is handled entirely by pageserver code. Replace the special code to replay Postgres XACT commit/abort records with new Zenith WAL records. A separate zenith WAL record is created for each modified CLOG page. This allows removing the 'main_data_offset' field from stored PostgreSQL WAL records, which saves some memory and some disk space in delta layers. Introduce zenith WAL records for updating bits in the visibility map. Previously, when e.g. a heap insert cleared the VM bit, we duplicated the heap insert WAL record for the affected VM page. That was very wasteful. The heap WAL record could be massive, containing a full page image in the worst case. This addresses github issue #941.	2022-01-04 11:26:37 +02:00
Heikki Linnakangas	722667f189	Add test case for performance issue #941 . The first COPY generates about 230 MB of write I/O, but the second COPY, after deleting most of the rows and vacuuming the rows away, generates 370 MB of writes. Both COPYs insert the same amount of data, so they should generate roughly the same amount of I/O. This commit doesn't try to fix the issue, just adds a test case to demonstrate it. Add a new 'checkpoint' command to the pageserver API. Previously, we've used 'do_gc' for that, but many tests, including this new one, really only want to perform a checkpoint and don't care about GC. For now, I only used the command in the new test, though, and didn't convert any existing tests to use it.	2022-01-04 11:26:37 +02:00
Konstantin Knizhnik	1c47fbae81	Do not write image layers during enforced checkpoint (#1057 ) * Do not write image layers during enforced checkpoint refer #1056 * Add Flush option to CheckpointConfig refer #1057	2022-01-01 19:08:09 +03:00
Dmitry Rodionov	c910132d4b	Fix wal receiver shutdown This patch allows to shutdown wal receiver when there are no messages and wal receiver is blocked inside tokio-postgres. In this case it cannot check the shutdown flag. This patch switches to use async interface of tokio-postgres directly without sync wrappers. It opens the possibility to use tokio::select! between the phsycal_stream.next() and a shutdown channel readiness to interrupt replication process. Also this allows to shutdown only particular wal receiver without using global shutdown_requested flag.	2021-12-29 14:42:29 +03:00
Kirill Bulatov	f0afd08667	Fix zenith init defaults	2021-12-28 00:21:48 +02:00
Kirill Bulatov	b494ac1ea0	Remove redundant pageserver cli params	2021-12-27 18:38:54 +02:00
Arseny Sher	a163650a99	Refactor Postgres command parsing in safekeeper. Do it separately with SafekeeperPostgresCommand enum as a result. Since query is always C string, switch postgres_backend process_query argument from Bytes to &str. Make passing ztli/ztenant id in safekeeper connection string optional; this is needed for upcoming intra-safekeeper heartbeat cmd which is not bound to any timeline.	2021-12-24 15:48:13 +03:00
anastasia	980f5f8440	Propagate remote_consistent_lsn to safekeepers. Change meaning of lsns in HOT_STANDBY_FEEDBACK: flush_lsn = disk_consistent_lsn, apply_lsn = remote_consistent_lsn Update compute node backpressure configuration respectively. Update compute node configuration: set 'synchronous_commit=remote_write' in setup without safekeepers. This way compute node doesn't have to wait for data checkpoint on pageserver. This doesn't guarantee data durability, but we only use this setup for tests, so it's fine.	2021-12-24 15:32:54 +03:00

1 2 3 4 5 ...

573 Commits