rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-05-23 16:10:37 +00:00

Author	SHA1	Message	Date
Heikki Linnakangas	791eebefe2	Silence clippy warning	2022-11-24 19:20:17 +01:00
Heikki Linnakangas	50b686c3e4	rustfmt	2022-11-24 19:20:17 +01:00
Heikki Linnakangas	39b10696e9	Fix unit tests. `activate` is now more strict and errors out if the tenant is already Active.	2022-11-24 19:20:17 +01:00
Heikki Linnakangas	264b0ada9f	Handle concurrent detach and attach more gracefully. If tenant detach is requested while the tenant is still in Attaching state, we set the state to Paused, but when the attach completed, it changed it to Active again, and worse, it started the background jobs. To fix, rewrite the set_state() function so that when you activate a tenant that is already in Paused state, it stays in Paused state and we don't start the background loops.	2022-11-24 19:20:17 +01:00
Heikki Linnakangas	78338f7b94	Remove background_jobs_enabled, move code from tenant_mgr.rs to tenant.rs	2022-11-24 19:20:17 +01:00
Heikki Linnakangas	0d533ce840	Test detach while attach is still in progress	2022-11-24 19:20:17 +01:00
Christian Schwarz	978f1879b9	fix typo in storage_sync.rs module comment Co-authored-by: Joonas Koivunen <joonas@neon.tech>	2022-11-24 19:17:41 +01:00
Christian Schwarz	c863d679f8	storage_sync: update module doc comment to reflect our changes	2022-11-23 11:46:03 -05:00
Christian Schwarz	c61731a31f	metric pageserver_remote_upload_queue_unfinished_tasks: add labels for file & op kind, and check for them in the test Fails right now because turns out we don't actually generate layer removal tasks with the current test code. That will be the next commit.	2022-11-22 12:13:38 -05:00
Heikki Linnakangas	d1b92e976a	When a new layer file is created in compaction, also upload it. I can't believe this was missing..	2022-11-22 16:35:31 +02:00
Heikki Linnakangas	d6c1a9aa18	Fix formatting	2022-11-22 15:02:09 +02:00
Heikki Linnakangas	3c4680b718	Turn upload_queue_items_metric into a regular function	2022-11-22 14:45:22 +02:00
Christian Schwarz	6b7cbec9b3	WIP: add test for storage_sync upload retries	2022-11-22 05:49:46 -05:00
Christian Schwarz	6de9a33c05	metric REMOTE_UPLOAD_QUEUE_UNFINISHED_TASKS	2022-11-22 05:49:46 -05:00
Christian Schwarz	77f883c95c	prometheus metrics for storage_sync2 Extracted from https://github.com/neondatabase/neon/pull/2595 Use pub(crate) to highlight unused metrics.	2022-11-22 05:49:25 -05:00
Heikki Linnakangas	e6557f4f91	Silence test failures, where we operate on a tenant before it's loaded Saw a failure like this, from 'test_tenants_attached_after_download' and 'test_tenant_redownloads_truncated_file_on_startup': > test_runner/fixtures/neon_fixtures.py:1064: in verbose_error > res.raise_for_status() > /github/home/.cache/pypoetry/virtualenvs/neon-_pxWMzVK-py3.9/lib/python3.9/site-packages/requests/models.py:1021: in raise_for_status > raise HTTPError(http_error_msg, response=self) > E requests.exceptions.HTTPError: 404 Client Error: Not Found for url: http://localhost:18150/v1/tenant/2334c9c113a82b5dd1651a0a23c53448/timeline > > The above exception was the direct cause of the following exception: > test_runner/regress/test_tenants_with_remote_storage.py:185: in test_tenants_attached_after_download > restored_timelines = client.timeline_list(tenant_id) > test_runner/fixtures/neon_fixtures.py:1148: in timeline_list > self.verbose_error(res) > test_runner/fixtures/neon_fixtures.py:1070: in verbose_error > raise PageserverApiException(msg) from e > E fixtures.neon_fixtures.PageserverApiException: NotFound: Tenant 2334c9c113a82b5dd1651a0a23c53448 is not active. Current state: Loading These tests starts the pageserver, wait until assert_no_in_progress_downloads_for_tenant says that has_downloads_in_progress is false, and then call timeline_list on the tenant. But has_downloads_in_progress was only returned as true when the tenant was being attached, not when it was being loaded at pageserver startup. Change tenant_status API endpoint (/v1/tenant/:tenant_id) so that it returns has_downloads_in_progress=true also for tenants that are still in Loading state.	2022-11-21 20:50:42 +01:00
Heikki Linnakangas	e8db20eb26	On connection from compute, wait for tenant to become Active. If a connection from compute arrives while a tenant is still in Loading state, wait for it to become Active instead of throwing an error to the client. This should fix the errors from test_gc_cutoff test that repeatedly restarts the pageserver and immediately tries to connect to it.	2022-11-21 20:50:42 +01:00
Heikki Linnakangas	7552e2d25f	Enable passing FAILPOINTS at startup. - Pass through FAILPOINTS environment variable to the pageserver in "neon_local pageserver start" command - On startup, list any failpoints that were set with FAILPOINTS to the log - Add optional "extra_env_vars" argument to the NeonPageserver.start() function in the python fixture, so that you can pass FAILPOINTS None of the tests use this functionality yet; that comes in a separate commit. closes https://github.com/neondatabase/neon/pull/2865	2022-11-21 20:50:42 +01:00
Heikki Linnakangas	e4c9b83a39	Merge remote-tracking branch 'origin/main' into HEAD	2022-11-20 02:31:21 +02:00
Heikki Linnakangas	eed99b7251	Silence clippy warning	2022-11-19 18:26:18 +02:00
Christian Schwarz	f564dff0e3	make test_tenant_detach_smoke fail reproducibly Add failpoint that triggers the race condition. Skip test until we'll land the fix from https://github.com/neondatabase/neon/pull/2851 with https://github.com/neondatabase/neon/pull/2785	2022-11-18 17:15:34 +01:00
Christian Schwarz	d783889a1f	timeline: explicit tracking of flush loop state: NotStarted, Running, Exited This allows us to error out in the case where we request flush but the flush loop is not running. Before, we would only track whether it was started, but not when it exited. Better to use an enum with 3 states than a 2-state bool because then the error message can answer the question whether we ever started the flush loop or not.	2022-11-18 17:15:34 +01:00
Christian Schwarz	66f8f686a0	run manual gc in a task_mgr task to prevent race with detach This fixes flaky test_tenant_detach_smoke.	2022-11-18 12:15:14 +02:00
Christian Schwarz	919f2b261a	make test_tenant_detach_smoke fail reproducibly Add failpoint that triggers the race condition.	2022-11-18 12:15:14 +02:00
Christian Schwarz	9d273c840a	timeline: explicit tracking of flush loop state: NotStarted, Running, Exited This allows us to error out in the case where we request flush but the flush loop is not running. Before, we would only track whether it was started, but not when it exited. Better to use an enum with 3 states than a 2-state bool because then the error message can answer the question whether we ever started the flush loop or not.	2022-11-18 12:15:14 +02:00
Heikki Linnakangas	328ec1ce24	Print a more full error message, with stack trace, on GC failure. In a CI run, I got a test failure because of this error in the log, from the test_get_tenant_size_with_multiple_branches test: ERROR gc_loop{tenant_id=f1630516d4b526139836ced93be0c878}: Gc failed, retrying in 2s: No such file or directory (os error 2) There are known race conditions between GC and timeline deletion, which surely caused that error. But if we didn't know the cause, it would be pretty hard to debug without a stack trace.	2022-11-18 11:44:00 +02:00
Dmitry Rodionov	6600e1f896	initialize upload queue before starting download operations	2022-11-17 23:53:31 +02:00
Dmitry Rodionov	348369414b	fix incorrect metadata update Previously in some cases local metadata was confused with remote one and there was a check, that we write locally only if remote metadata has greater disk_consistent_lsn. So because they were equal we didnt write anything. For attach scenario this ended up in not writing metadata at all. Rearrange code so we decide on proper metadata value earlier on and initialize timeline with correct one without need to update it late in the initialization process in .reconsile_with_remote	2022-11-17 23:53:31 +02:00
Christian Schwarz	3890acaf7f	stop using Option for UploadQueueInitialized::{latest_metadata,last_uploaded_consistent_lsn}	2022-11-17 21:34:16 +02:00
Christian Schwarz	f537a7a873	add explainer comments regarding UploadQueueInitialized::{latest_files,latest_metadata,last_uploaded_consistent_lsn}	2022-11-17 21:34:16 +02:00
Christian Schwarz	71bc45a21b	storage_sync: track upload queue initialization state using enum & fix last_uploaded_consistent_lsn initialization for empty remote storage As pointed out in `b8488e70a9 (r1024319620)` the following is wrong for the case where the remote storage is empty: metadata = whatever the local-ONLY metadata is ... upload_queue.latest_metadata = Some(metadata.clone()); upload_queue.last_uploaded_consistent_lsn = Some(metadata.disk_consistent_lsn()); The reason why it's wrong is that we return last_uploaded_consistent_lsn to safekeepers. So, we'd be returning an Lsn that is not yet uploaded to S3.	2022-11-17 21:34:16 +02:00
Christian Schwarz	decef74503	don't start background jobs if tenant has not timelines Before this change, test_pageserver_with_empty_tenants was failing at: assert loaded_tenant["state"] == { "Active": {"background_jobs_running": False} }, "Tenant {tenant_with_empty_timelines_dir} with empty timelines dir should be active and ready for timeline creation" because background_jobs_running was True instead of False. Personally I think we should simply always start the background loops and not bother, but let's punt this until after we've merged this PR.	2022-11-17 11:22:02 +02:00
Dmitry Rodionov	b8488e70a9	run clippy/fmt	2022-11-16 17:25:04 +02:00
Christian Schwarz	16fdd104ac	bring back HTTP API `has_in_progress_downloads` and `awaits_download` field, derived from TenantState	2022-11-16 14:57:26 +02:00
Christian Schwarz	bb6dbd2f43	crash-safe and resumable tenant attach This change introduces a marker file $repo/tenants/$tenant_id/attaching that is present while a tenant is in Attaching state. When pageserver restarts, we use it to resume the tenant attach operation. Before this change, a crash during tenant attach would result in one of the following: 1. crash upon restart due to missing metadata file (IIRC) 2. "successful" loading of the tenant with a subset of timelines	2022-11-16 14:57:26 +02:00
Dmitry Rodionov	1839ce0545	properly merge remote metadata with local one	2022-11-16 14:42:15 +02:00
Dmitry Rodionov	8e04f0455e	add a bunch of .context calls	2022-11-16 14:42:15 +02:00
Dmitry Rodionov	6839773538	handle temporary files during layer map loading	2022-11-16 14:42:15 +02:00
Dmitry Rodionov	2a96c4cfcd	start walreceiver after reconcile_with_remote	2022-11-16 14:42:15 +02:00
Christian Schwarz	027cf22663	fix layer download during reconcile_with_remote reconcile_with_remote, in this PR, is supposed to download all the layer files synchronously. I don't know why, but, download_missing was 1. not doing the download at all for DeltaLayer 2. not using the right RelativePath for image layer This patch fixes both.	2022-11-16 14:42:15 +02:00
Christian Schwarz	f4daa877b5	load_remote_timeline already has an #[instrument]	2022-11-16 14:42:15 +02:00
Christian Schwarz	ed28ced3bc	schedule_index_upload: remove unused Option() around metadata param	2022-11-16 14:42:15 +02:00
Christian Schwarz	d7c120574b	dedup download_missing() calls	2022-11-16 14:42:15 +02:00
Dmitry Rodionov	c9188ffa67	fix wrong path handling in reconcile_with_remote, refine spans	2022-11-16 14:42:15 +02:00
Dmitry Rodionov	c631fa1f50	fix test_gc_cutoff test Also improve it so it fails earlier if something is not working because otherwise it was failing because of the timeout. And if timeout was big enough test can even pass	2022-11-16 14:42:15 +02:00
Dmitry Rodionov	795c3ca131	Port per-tenant upload queue and startup changes from #2595 This is a part of https://github.com/neondatabase/neon/pull/2595. It takes out switch to per tenant upload queue and changes to pageserver startup sequence because these two are highly interleaved with each other. I'm still not happy with the size of the diff, but splitting it even more will probably consume even more time. Ideally we should do it, but this patch isis already a step forward and should be easier to get this patch in yet still quite difficult. Mainly because of the size and fixes for existing concerns which will extend the diff even further Co-authored-by: Heikki Linnakangas <heikki@neon.tech>	2022-11-16 14:42:15 +02:00
Joonas Koivunen	1d105727cb	perf: simple walredo bench (#2816 ) adds a simple walredo bench to allow some comparison of the walredo throughput. Cc: #1339, #2778	2022-11-16 11:13:56 +02:00
Heikki Linnakangas	46d30bf054	Check for errors in pageserver log after each test. If there are any unexpected ERRORs or WARNs in pageserver.log after test finishes, fail the test. This requires whitelisting the errors that are expected in each test, and there's also a few common errors that are printed by most tests, which are whitelisted in the fixture itself. With this, we don't need the special abort() call in testing mode, when compaction or GC fails. Those failures will print ERRORs to the logs, which will be picked up by this new mechanisms. A bunch of errors are currently whitelisted that we probably shouldn't be emitting in the first place, but fixing those is out of scope for this commit, so I just left FIXME comments on them.	2022-11-15 18:47:28 +02:00
Heikki Linnakangas	e44e4a699b	Downgrade log message, if client terminates COPY during basebackup import It's more or less expected from pageserver's point of view. Change the error kind to ConnectionReset, so that it gets logged at INFO level instead of ERROR.	2022-11-15 18:47:28 +02:00
Heikki Linnakangas	dbe5b52494	Avoid some vector-growing overhead. I saw this in 'perf' profile of a sequential scan: > - 31.93% 0.21% compute request pageserver [.] <pageserver::walredo::PostgresRedoManager as pageserver::walredo::WalRedoManager>::request_redo > - 31.72% <pageserver::walredo::PostgresRedoManager as pageserver::walredo::WalRedoManager>::request_redo > - 31.26% pageserver::walredo::PostgresRedoManager::apply_batch_postgres > + 7.64% <std::process::ChildStdin as std::io::Write>::write > + 6.17% nix::poll::poll > + 3.58% <std::process::ChildStderr as std::io::Read>::read > + 2.96% std::sync::condvar::Condvar::notify_one > + 2.48% std::sys::unix::locks::futex::Condvar::wait > + 2.19% alloc::raw_vec::RawVec<T,A>::reserve::do_reserve_and_handle > + 1.14% std::sys::unix::locks::futex::Mutex::lock_contended > 0.67% __rust_alloc_zeroed > 0.62% __stpcpy_ssse3 > 0.56% std::sys::unix::locks::futex::Mutex::wake Note the 'do_reserve_handle' overhead. That's caused by having to grow the buffer used to construct the WAL redo request. This commit eliminates that overhead. It's only about 2% of the overall CPU usage, but every little helps. Also reuse the temp buffer when reading records from a DeltaLayer, and call Vec::reserve to avoid growing a buffer when reading a blob across pages. I saw a reduction from 2% to 1% of CPU spent in do_reserve_and_handle in that codepath, but that's such a small change that it could be just noise. Seems like it shouldn't hurt though.	2022-11-12 18:52:25 +02:00

1 2 3 4 5 ...

1033 Commits