neon/pageserver at 85445cde14acd100c09a859311db1c28600ea7df - neon

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-05-15 20:20:38 +00:00

Files

Christian Schwarz 85445cde14 idea: concurrency-limit initial logical size calculation

Before this patch, there was no concurrency limit on initial logical
size computations.

In an experiment with a PS with 20k tenants, 1 timeline each,
all tenants inactive in SKs / not present in storage broker,
all logical size calculations are spawned by MetricsCollection,
i.e., consumption metrics worker.

Before this patch, these timelines would all do their initial logical
size calculation in parallel, leading to extreme thrashing in page cache
and virtual file cache.

With this patch, the virtual file cache thrashing is reduced
signficantly (from 80k `open`-system-calls/second to ~500
`open`-system-calls/second during loading).

This patch uses the existing background tasks semaphore to limit
concurrency, which generally is the right call for background activity.
However, due to logical size's involvement in PageserverFeedback towards
safekeepers, I think we need a priority-boosting mechanism, e.g., if
we're still calculating but walreceiver is actively asking, skip the
semaphore. That's fairly easy to implement, but, want to some feedback
on the general idea first before implementing it.
See also the FIXME in the block comment added in this commit.

NB: when evaluating, keep in mind that consumption metrics worker
persists its interval across restarts; delete the state file on disk
to get predictable (and I believe worst-case in terms of concurrency
during PS restart) behavior.

2023-11-28 15:19:13 +00:00

benches

walredo: make request_redo() an async fn (#5559 )

2023-10-18 11:23:06 +01:00

ctl

fix: cleanup of layers from the future can race with their re-creation (#5890 )

2023-11-23 13:33:41 +00:00

fixtures

walredo: simple tests and bench updates (#3045 )

2023-01-16 18:24:45 +02:00

src

idea: concurrency-limit initial logical size calculation

2023-11-28 15:19:13 +00:00

Cargo.toml

fix: replace all std::PathBufs with camino::Utf8PathBuf (#5352 )

2023-10-04 17:52:23 +03:00