neon/control_plane at 59c5b374de8934e76ce7739720fc31547ac9de00 - neon

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-01-10 15:02:56 +00:00

Files

John Spray f2e5212fed storage controller: background reconcile, graceful shutdown, better logging (#6709 )

## Problem

Now that the storage controller is working end to end, we start burning
down the robustness aspects.

## Summary of changes

- Add a background task that periodically calls `reconcile_all`. This
ensures that if earlier operations couldn't succeed (e.g. because a node
was unavailable), we will eventually retry. This is a naive initial
implementation can start an unlimited number of reconcile tasks:
limiting reconcile concurrency is a later item in #6342
- Add a number of tracing spans in key locations: each background task,
each reconciler task.
- Add a top level CancellationToken and Gate, and use these to implement
a graceful shutdown that waits for tasks to shut down. This is not
bulletproof yet, because within these tasks we have remote HTTP calls
that aren't wrapped in cancellation/timeouts, but it creates the
structure, and if we don't shutdown promptly then k8s will kill us.
- To protect shard splits from background reconciliation, expose the `SplitState`
in memory and use it to guard any APIs that require an attached tenant.

2024-02-16 13:00:53 +00:00

attachment_service

storage controller: background reconcile, graceful shutdown, better logging (#6709 )

2024-02-16 13:00:53 +00:00

src

libs: refactor ShardCount.0 to private (#6690 )

2024-02-15 21:59:39 +00:00

.gitignore

Revert "Use actual temporary dir for pageserver unit tests"

2023-01-19 20:16:56 +01:00

Cargo.toml

control_plane: follow up for embedded migrations (#6647 )

2024-02-09 14:26:50 +00:00

safekeepers.conf

Separate mgmt and libpq authentication configs in pageserver. (#3773 )

2023-03-15 13:52:29 +02:00

simple.conf

tests: enable multiple pageservers in neon_local and neon_fixture (#5231 )

2023-09-08 16:19:57 +01:00