neon/pageserver at 545f7e8cd7fcca09134f5c1eb47c8ff323dfad22 - neon

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-01-09 22:42:57 +00:00

Files

John Spray a43a1ad1df pageserver: fix API-driven secondary downloads possibly colliding with background downloads (#7848 )

## Problem

We've seen some strange behaviors when doing lots of migrations
involving secondary locations. One of these was where a tenant was
apparently stuck in the `Scheduler::running` list, but didn't appear to
be making any progress. Another was a shutdown hang
(https://github.com/neondatabase/cloud/issues/13576).

## Summary of changes

- Fix one issue (probably not the only one) where a tenant in the
`pending` list could proceed to `spawn` even if the same tenant already
had a running task via `handle_command` (this could have resulted in a
weird value of SecondaryProgress)
- Add various extra logging:
- log before as well as after layer downloads so that it would be
obvious if we were stuck in remote storage code (we shouldn't be, it has
built in timeouts)
- log the number of running + pending jobs from the scheduler every time
it wakes up to do a scheduling iteration (~10s) -- this is quite chatty,
but not compared with the volume of logs on a busy pageserver. It should
give us confidence that the scheduler loop is still alive, and
visibility of how many tasks the scheduler thinks are running.

2024-05-23 09:13:55 +01:00

benches

chore!: always use async walredo, warn if sync is configured (#7754 )

2024-05-15 15:04:52 +02:00

client

feat(pagebench): add aux file bench (#7746 )

2024-05-17 20:04:02 +00:00

compaction

Tiered compaction: improvements to the windows (#7787 )

2024-05-16 22:25:19 +02:00

ctl

feat(pageserver): persist aux file policy in index part (#7668 )