neon/pageserver at release-5373 - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-05-20 22:50:38 +00:00

Files

Joonas Koivunen 3695a1efa1 metrics: record time to update gc info as a per timeline metric (#7473 )

We know that updating gc info can take a very long time from [recent
incident], and holding `Tenant::gc_cs` affects many per-tenant
operations in the system. We need a direct way to observe the time it
takes. The solution is to add metrics so that we know when this happens:
- 2 new per-timeline metric
- 1 new global histogram

Verified that the buckets are okay-ish in [dashboard]. In our current
state, we will see a lot more of `Inf,` but that is probably okay; at
least we can learn which timelines are having issues.

Can we afford to add these metrics? A bit unclear, see [another
dashboard] with top pageserver `/metrics` response sizes.

[dashboard]:
https://neonprod.grafana.net/d/b7a5a5e2-1276-4bb0-9e3a-b4528adb6eb6/storage-operations-histograms-in-prod?orgId=1&var-datasource=ZNX49CDVz&var-instance=All&var-operation=All&from=now-7d&to=now

[another dashboard]:
https://neonprod.grafana.net/d/MQx4SN-Vk/metric-sizes-on-prod-and-some-correlations?orgId=1

[recent incident]:
https://neondb.slack.com/archives/C06UEMLK7FE/p1713817696580119?thread_ts=1713468604.508969&cid=C06UEMLK7FE

2024-04-29 07:14:53 +03:00

benches

add async walredo mode (disabled-by-default, opt-in via config) (#6548 )

2024-04-15 22:14:42 +02:00

client

Server support for new pagestream protocol version (#7377 )

2024-04-25 20:45:37 +03:00

compaction

Remove async_trait from CompactionDeltaLayer (#7342 )

2024-04-08 14:59:08 +02:00

ctl

pagectl draw-timeline-dir: include layer file name as an SVG comment (#7455 )