rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-07-07 06:00:38 +00:00

Author	SHA1	Message	Date
Christian Schwarz	fd7e3fd82f	Merge WITH CONFLICTS commit '72832b32140a78db7612af626d7c69079d73f445' into yuchen/direct-io-delta-image-layer-write Conflicts: pageserver/src/tenant/blob_io.rs - minor stuff Also I noticed some earlier merge went through cleanly but the `generate_tombstone_image_layer` layer writer didn't have the right arugments, so, failed to compile. Fixed in this merge commit.	2025-04-11 16:24:35 +02:00
Christian Schwarz	ddf6ba75c2	Merge 2025-04-09 main commit 'd11f23a3419a5b8eef62bc5736a4dd9d413bdab8' into yuchen/direct-io-delta-image-layer-write	2025-04-11 16:18:59 +02:00
Christian Schwarz	f017382b2b	Merge 2025-04-09 main commit 'e7502a3d637932a59ee502ababb1df3d0e3bca26' into yuchen/direct-io-delta-image-layer-write	2025-04-11 16:18:48 +02:00
Christian Schwarz	d0cb1a93dc	Merge 2025-04-09 main commit 'ef8101a9be3ce80d104943238a7d608561432189' into yuchen/direct-io-delta-image-layer-write	2025-04-11 16:18:34 +02:00
Christian Schwarz	140b47dc5a	Merge 2025-04-09 main commit 'a6ff8ec3d47963616d9cef07421d9319db958e8a' into yuchen/direct-io-delta-image-layer-write	2025-04-11 16:17:54 +02:00
Christian Schwarz	de1c392082	Merge 2025-04-07 main commit '486872dd28d538817599f29b045be025d1e3f43a' into yuchen/direct-io-delta-image-layer-write	2025-04-11 16:17:32 +02:00
Christian Schwarz	c5c60e156e	Merge WITH CONFLICTS 2025-03-18 main commit '9fb77d6cdd0894ec4e93b4fe3a576655cfad3b2e' into yuchen/direct-io-delta-image-layer-write The previous merge commit was the commit before, so, all these conflicts are the conflicts that arise from this PR and 97fb77 which is the commit that added cancellation sensitivity to flush task infinite retries. Conflicts: pageserver/src/tenant/remote_timeline_client/download.rs - different return type pageserver/src/virtual_file/owned_buffers_io/write.rs - added TODO that needs to be fixed before merge about retrying final write. I want a different API than this shutdown() thing we have rn pageserver/src/virtual_file/owned_buffers_io/write/flush.rs Most of the churn came from the need to propagate cancellation token. And churn in tests from having to propagate upwards the FlushTaskError instead of the std::io::Error we were propagating upwards before.	2025-04-11 16:13:44 +02:00
Christian Schwarz	9256935e1b	fix download usage of buffered writer (using pad + set_len strategy) this fixes tenant::timeline::tests::test_heatmap_generation	2025-04-11 13:51:56 +02:00
Christian Schwarz	647c881878	fix for vectored_blob_io::tests::test_really_big_array	2025-04-11 13:31:51 +02:00
Christian Schwarz	d1277b8259	I have a hypothesis for what the issue is with the vectored_blob_io::tests::test_really_big_array	2025-04-11 13:26:39 +02:00
Christian Schwarz	53b837d507	put in a note on blob_io writer not needed to do owned buffers io anymore	2025-04-11 13:25:25 +02:00
Christian Schwarz	f5d69e97c4	remark: vectored_blob_io::tests::test_really_big_array is failing since before I started merging from main	2025-04-11 12:56:07 +02:00
Christian Schwarz	e79beb0720	turns out we can delete all the seek-related APIs as well	2025-04-11 12:29:22 +02:00
Christian Schwarz	dfc364e4f4	remove non-absolute-position write APIs from VirtualFile	2025-04-11 11:57:09 +02:00
Christian Schwarz	9222995c4f	REVIEW more the shutdown API	2025-04-10 11:16:35 +02:00
Christian Schwarz	6f25c976f6	REVIEW: undo the `mutable->tail` rename to minimize conflicts with next commit Changes to be committed: modified: pageserver/src/tenant/ephemeral_file.rs modified: pageserver/src/virtual_file/owned_buffers_io/write.rs	2025-04-10 09:02:45 +02:00
Christian Schwarz	dd3178836d	REVIEW: minor nits	2025-04-10 08:58:06 +02:00
Christian Schwarz	2a29b3de89	Merge 2025-03-18 main commit '99639c26b49a0d6d546fd' into yuchen/direct-io-delta-image-layer-write	2025-04-09 19:40:14 +02:00
Christian Schwarz	91aff7b842	Merge WITH CONFLICTS 2025-03-11 main commit '158db414bf881fb358494e3215d192c8fa420a53' into yuchen/dire ct-io-delta-image-layer-write Conflicts: pageserver/src/virtual_file.rs pageserver/src/virtual_file/owned_buffers_io/write/flush.rs	2025-04-09 19:39:56 +02:00
Christian Schwarz	f078d7e1a9	Merge WITH CONFLICTS 2025-03-11 main commit '7c462b3417ecd3ae3907f3480f3b8a8c99fc6d7b' into yuchen/dire ct-io-delta-image-layer-write Conflicts: pageserver/src/tenant/blob_io.rs	2025-04-09 19:39:12 +02:00
Christian Schwarz	537eb334f2	Merge WITH CONFLICTS 2025-02-25 main commit '920040e40240774219b6607f1f8ef74478dc4b29' into yuchen/dire ct-io-delta-image-layer-write Conflicts: pageserver/src/tenant/blob_io.rs pageserver/src/tenant/block_io.rs pageserver/src/tenant/disk_btree.rs pageserver/src/tenant/storage_layer/delta_layer.rs pageserver/src/tenant/storage_layer/image_layer.rs pageserver/src/virtual_file/owned_buffers_io/write.rs	2025-04-09 19:38:20 +02:00
Christian Schwarz	e37cbc1a50	make clippy pass	2025-04-09 19:33:35 +02:00
Conrad Ludgate	72832b3214	chore: fix clippy lints from nightly-2025-03-16 (#11273 ) I like to run nightly clippy every so often to make our future rust upgrades easier. Some notable changes: * Prefer `next_back()` over `last()`. Generic iterators will implement `last()` to run forward through the iterator until the end. * Prefer `io::Error::other()`. * Use implicit returns One case where I haven't dealt with the issues is the now [more-sensitive "large enum variant" lint](https://github.com/rust-lang/rust-clippy/pull/13833). I chose not to take any decisions around it here, and simply marked them as allow for now.	2025-04-09 15:04:42 +00:00
Vlad Lazar	d11f23a341	pageserver: refactor read path for multi LSN batching support (#11463 ) ## Problem We wish to improve pageserver batching such that one batch can contain requests for pages at different LSNs. The current shape of the code doesn't lend itself to the change. ## Summary of changes Refactor the read path such that the fringe gets initialized upfront. This is where the multi LSN change will plug in. A couple other small changes fell out of this. There should be NO behaviour change here. If you smell one, shout! I recommend reviewing commits individually (intentionally made them as small as possible). Related: https://github.com/neondatabase/neon/issues/10765	2025-04-09 13:17:02 +00:00
Dmitrii Kovalkov	e7502a3d63	pageserver: return 412 PreconditionFailed in get_timestamp_of_lsn if timestamp is not found (#11491 ) ## Problem Now `get_timestamp_of_lsn` returns `404 NotFound` if there is no clog pages for given LSN, and it's difficult to distinguish from other 404 errors. A separate status code for this error will allow the control plane to handle this case. - Closes: https://github.com/neondatabase/neon/issues/11439 - Corresponding PR in control plane: https://github.com/neondatabase/cloud/pull/27125 ## Summary of changes - Return `412 PreconditionFailed` instead of `404 NotFound` if no timestamp is fond for given LSN. I looked briefly through the current error handling code in cloud.git and the status code change should not affect anything for the existing code. Change from the corresponding PR also looks fine and should work with the current PS status code. Additionally, here is OK to merge it from control plane team: https://github.com/neondatabase/neon/issues/11439#issuecomment-2789327552 --------- Co-authored-by: John Spray <john@neon.tech>	2025-04-09 13:16:15 +00:00
Heikki Linnakangas	ef8101a9be	refactor: Split "communicator" routines to a separate source file (#11459 ) pagestore_smgr.c had grown pretty large. Split into two parts, such that the smgr routines that PostgreSQL code calls stays in pagestore_smgr.c, and all the prefetching logic and other lower-level routines related to communicating with the pageserver are moved to a new source file, "communicator.c". There are plans to replace communicator parts with a new implementation. See https://github.com/neondatabase/neon/pull/10799. This commit doesn't implement any of the new things yet, but it is good preparation for it. I'm imagining that the new implementation will approximately replace the current "communicator.c" code, exposing roughly the same functions to pagestore_smgr.c. This commit doesn't change any functionality or behavior, or make any other changes to the existing code: It just moves existing code around.	2025-04-09 12:28:59 +00:00
Arpad Müller	d2825e72ad	Add is_stopping check around critical macro in walreceiver (#11496 ) The timeline stopping state is set much earlier than the cancellation token is fired, so by checking for the stopping state, we can prevent races with timeline shutdown where we issue a cancellation error but the cancellation token hasn't been fired yet. Fix #11427.	2025-04-09 12:17:45 +00:00
Erik Grinaker	a6ff8ec3d4	storcon: change default stripe size to 16 MB (#11168 ) ## Problem The current stripe size of 256 MB is a bit large, and can cause load imbalances across shards. A stripe size of 16 MB appears more reasonable to avoid hotspots, although we don't see evidence of this in benchmarks. Resolves https://github.com/neondatabase/cloud/issues/25634. Touches https://github.com/neondatabase/cloud/issues/21870. ## Summary of changes * Change the default stripe size to 16 MB. * Remove `ShardParameters::DEFAULT_STRIPE_SIZE`, and only use `pageserver_api::shard::DEFAULT_STRIPE_SIZE`. * Update a bunch of tests that assumed a certain stripe size.	2025-04-09 08:41:38 +00:00
Dmitrii Kovalkov	cf62017a5b	storcon: add https metrics for pageservers/safekeepers (#11460 ) ## Problem Storcon will not start up if `use_https` is on and there are some pageservers or safekeepers without https port in the database. Metrics "how many nodes with https we have in DB" will help us to make sure that `use_https` may be turned on safely. - Part of https://github.com/neondatabase/cloud/issues/25526 ## Summary of changes - Add `storage_controller_https_pageserver_nodes`, `storage_controller_safekeeper_nodes` and `storage_controller_https_safekeeper_nodes` Prometheus metrics.	2025-04-09 08:33:49 +00:00
Erik Grinaker	c610f3584d	test_runner: tweak `test_create_snapshot` compaction (#11495 ) ## Problem With the recent improvements to L0 compaction responsiveness, `test_create_snapshot` now ends up generating 10,000 layer files (compared to 1,000 in previous snapshots). This increases the snapshot size by 4x, and significantly slows down tests. ## Summary of changes Increase the target layer size from 128 KB to 256 KB, and the L0 compaction threshold from 1 to 5. This reduces the layer count from about 10,000 to 1,000.	2025-04-09 06:52:49 +00:00
Konstantin Knizhnik	c9ca8b7c4a	One more fix for unlogged build support in DEBUG_COMPARE_LOCAL (#11474 ) ## Problem Support of unlogged build in DEBUG_COMPARE_LOCAL. Neon SMGR treats present of local file as indicator of unlogged relations. But it doesn't work in DEBUG_COMPARE_LOCAL mode. ## Summary of changes Use INIT_FORKNUM as indicator of unlogged file and create this file while unlogged index build. Co-authored-by: Konstantin Knizhnik <knizhnik@neon.tech>	2025-04-09 05:14:29 +00:00
Erik Grinaker	7679b63a2c	pageserver: persist stripe size in tenant manifest for tenant_import (#11181 ) ## Problem `tenant_import`, used to import an existing tenant from remote storage into a storage controller for support and debugging, assumed `DEFAULT_STRIPE_SIZE` since this can't be recovered from remote storage. In #11168, we are changing the stripe size, which will break `tenant_import`. Resolves #11175. ## Summary of changes * Add `stripe_size` to the tenant manifest. * Add `TenantScanRemoteStorageShard::stripe_size` and return from `tenant_scan_remote` if present. * Recover the stripe size during`tenant_import`, or fall back to 32768 (the original default stripe size). * Add tenant manifest compatibility snapshot: `2025-04-08-pgv17-tenant-manifest-v1.tar.zst` There are no cross-version concerns here, since unknown fields are ignored during deserialization where relevant.	2025-04-08 20:43:27 +00:00
Erik Grinaker	d177654e5f	gitignore: add `/artifact_cache` (#11493 ) ## Problem This is generated e.g. by `test_historic_storage_formats`, and causes VSCode to list all the contained files as new. ## Summary of changes Add `/artifact_cache` to `.gitignore`.	2025-04-08 16:57:10 +00:00
Alex Chi Z.	a09c933de3	test(pageserver): add conditional append test record (#11476 ) ## Problem For future gc-compaction tests when we support https://github.com/neondatabase/neon/issues/10395 ## Summary of changes Add a new type of neon test WAL record that is conditionally applied (i.e., only when image == the specified value). We can use this to mock the situation where we lose some records in the middle, firing an error, and see how gc-compaction reacts to it. Signed-off-by: Alex Chi Z <chi@neon.tech>	2025-04-08 16:08:44 +00:00
Mikhail Kot	6138d61592	Object storage proxy (#11357 ) Service targeted for storing and retrieving LFC prewarm data. Can be used for proxying S3 access for Postgres extensions like pg_mooncake as well. Requests must include a Bearer JWT token. Token is validated using a pemfile (should be passed in infra/). Note: app is not tolerant to extra trailing slashes, see app.rs `delete_prefix` test for comments. Resolves: https://github.com/neondatabase/cloud/issues/26342 Unrelated changes: gate a `rename_noreplace` feature and disable it in `remote_storage` so as `object_storage` can be built with musl	2025-04-08 14:54:53 +00:00
Roman Zaynetdinov	a7142f3bc6	Configure rsyslog for logs export using the spec (#11338 ) - Work on https://github.com/neondatabase/cloud/issues/24896 - Cplane part https://github.com/neondatabase/cloud/pull/26808 Instead of reconfiguring rsyslog via an API endpoint [we have agreed](https://neondb.slack.com/archives/C04DGM6SMTM/p1743513810964509?thread_ts=1743170369.865859&cid=C04DGM6SMTM) to have a new `logs_export_host` field as part of the compute spec. --------- Co-authored-by: Tristan Partin <tristan@neon.tech>	2025-04-08 14:03:09 +00:00
Dmitrii Kovalkov	7791a49dd4	fix(tests): improve test_scrubber_tenant_snapshot stability (#11471 ) ## Problem `test_scrubber_tenant_snapshot` is flaky with `request was dropped` errors. More details are in the issue. - Closes: https://github.com/neondatabase/neon/issues/11278 ## Summary of changes - Disable shard scheduling during pageservers restart - Add `reconcile_until_idle` in the end of the test	2025-04-08 10:03:38 +00:00
dependabot[bot]	8a6d0dccaa	build(deps): bump tokio from 1.38.0 to 1.38.2 in /test_runner/pg_clients/rust/tokio-postgres in the cargo group across 1 directory (#11478 ) Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-04-08 10:01:15 +00:00
Heikki Linnakangas	7ffcbfde9a	refactor: Move LFC function prototypes to separate header file (#11458 ) Also, move the call to the lfc_init() function. It was weird to have it in libpagestore.c, when libpagestore.c otherwise had nothing to do with the LFC. Move it directly into _PG_init()	2025-04-08 09:03:56 +00:00
Konstantin Knizhnik	b2a0b2e9dd	Skip hole tags in local_cache view (#11454 ) ## Problem If the local file cache is shrunk, so that we punch some holes in the underlying file, the local_cache view displays the holes incorrectly. See https://github.com/neondatabase/neon/issues/10770 ## Summary of changes Skip hole tags in the local_cache view. --------- Co-authored-by: Konstantin Knizhnik <knizhnik@neon.tech>	2025-04-08 03:52:50 +00:00
Alex Chi Z.	0875dacce0	fix(pageserver): more aggressively yield in gc-compaction, degrade errors to warnings (#11469 ) ## Problem Fix various small issues discovered during gc-compaction rollout. ## Summary of changes - Log level changes: if errors are from gc-compaction, fire a warning instead of errors or critical errors. - Yield to L0 compaction more aggressively. Instead of checking every 1k keys, we check on every key. Sometimes a single key reconstruct takes a long time. --------- Signed-off-by: Alex Chi Z <chi@neon.tech>	2025-04-07 21:19:06 +00:00
Erik Grinaker	99d8788756	pageserver: improve tenant manifest lifecycle (#11328 ) ## Problem Currently, the tenant manifest is only uploaded if there are offloaded timelines. The checks are also a bit loose (e.g. only checks number of offloaded timelines). We want to start using the manifest for other things too (e.g. stripe size). Resolves #11271. ## Summary of changes This patch ensures that a tenant manifest always exists. The lifecycle is: * During preload, fetch the existing manifest, if any. * During attach, upload a tenant manifest if it differs from the preloaded one (or does not exist). * Upload a new manifest as needed, if it differs from the last-known manifest (ignoring version number). * On splits, pre-populate the manifest from the parent. * During Pageserver physical GC, remove old manifests but keep the latest 2 generations. This will cause nearly all existing tenants to upload a new tenant manifest on their first attach after this change. Attaches are concurrency-limited in the storage controller, so we expect this will be fine. Also updates `make_broken` to automatically log at `INFO` level when the tenant has been cancelled, to avoid spurious error logs during shutdown.	2025-04-07 19:10:36 +00:00
Erik Grinaker	26c5c7e942	pageserver: set `Stopping` state on attach cancellation (#11462 ) ## Problem If a tenant is cancelled (e.g. due to Pageserver shutdown) during attach, it is set to `Broken`. This results both in error log spam and 500 responses for clients -- shutdown is supposed to return 503 responses which can be retried. This becomes more likely to happen with #11328, where we perform tenant manifest downloads/uploads during attach. ## Summary of changes Set tenant state to `Stopping` when attach fails and the tenant is cancelled, downgrading the log messages to INFO. This introduces two variants of `Stopping` -- with and without a caller barrier -- where the latter is used to signal attach cancellation.	2025-04-07 17:56:56 +00:00
Arpad Müller	8a2b19f467	Allow potential warning in test_storcon_create_delete_sk_down (#11466 ) Since merging #11400 and addition of `test_storcon_create_delete_sk_down`, we've seen an error occur multiple times. https://github.com/neondatabase/neon/pull/11400#issuecomment-2782528369	2025-04-07 16:52:54 +00:00
Arpad Müller	486872dd28	Add support to specify auth token via --auth-token-path (#11443 ) Before we specified the JWT via `SAFEKEEPER_AUTH_TOKEN`, but env vars are quite public, both in procfs as well as the unit files. So add a way to put the auth token into a file directly. context: https://neondb.slack.com/archives/C033RQ5SPDH/p1743692566311099	2025-04-07 16:12:04 +00:00
Alex Chi Z.	d37e90f430	fix(pageserver): allow shard ancestor compaction to be cancelled (#11452 ) ## Problem https://github.com/neondatabase/neon/issues/11330 https://github.com/neondatabase/neon/issues/11358 ## Summary of changes Looking at the staging log, a few tenants right after shard split are stuck on shutdown because they are running shard ancestor compaction. The compaction does not respect the cancellation token. Signed-off-by: Alex Chi Z <chi@neon.tech>	2025-04-07 16:01:21 +00:00
Konstantin Knizhnik	8eb701d706	Save FSM/VM pages on normal shutdown (#11449 ) ## Problem See https://neondb.slack.com/archives/C03QLRH7PPD/p1743746717119179 We wallow FSM/VM pages when they are written to disk to persist them in PS. But it is not happen during shutdown checkpoint, because writing to WAL during checkpoint cause Postgres panic. ## Summary of changes Move `CheckPointBuffers` call to `PreCheckPointGuts` Postgres PRs: https://github.com/neondatabase/postgres/pull/615 https://github.com/neondatabase/postgres/pull/614 https://github.com/neondatabase/postgres/pull/613 https://github.com/neondatabase/postgres/pull/612 --------- Co-authored-by: Konstantin Knizhnik <knizhnik@neon.tech>	2025-04-07 13:56:55 +00:00
Conrad Ludgate	85a515c176	update tokio for RUSTSEC-2025-0023 (#11464 )	2025-04-07 13:33:56 +00:00
Christian Schwarz	aa88279681	fix(storcon/http): node status API returns serialized runtime object (#11461 ) The Serialize impl on the `Node` type is for the `/debug` endpoint only. Committed APIs should use the `NodeDescribeResponse`. Refs - fixes https://github.com/neondatabase/neon/issues/11326 - found while working on admin UI change https://github.com/neondatabase/cloud/pull/26207	2025-04-07 12:23:40 +00:00
Heikki Linnakangas	b2a670c765	refactor: Use same prototype for neon_read_at_lsn on all PG versions (#11457 ) The 'neon_read' function needs to have a different prototype on PG < 16, because it's part of the smgr interface. But neon_read_at_lsn doesn't have that restriction.	2025-04-07 11:04:36 +00:00

1 2 3 4 5 ...

7690 Commits