neon/regress at arpad/compaction_enabled_test - neon

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-07-05 21:20:37 +00:00

Files

John Spray 63213fc814 storage controller: scheduling optimization for sharded tenants (#7181 )

## Problem

- When we scheduled locations, we were doing it without any context
about other shards in the same tenant
- After a shard split, there wasn't an automatic mechanism to migrate
the attachments away from the split location
- After a shard split and the migration away from the split location,
there wasn't an automatic mechanism to pick new secondary locations so
that the end state has no concentration of locations on the nodes where
the split happened.

Partially completes: https://github.com/neondatabase/neon/issues/7139

## Summary of changes

- Scheduler now takes a `ScheduleContext` object that can be populated
with information about other shards
- During tenant creation and shard split, we incrementally build up the
ScheduleContext, updating it for each shard as we proceed.
- When scheduling new locations, the ScheduleContext is used to apply a
soft anti-affinity to nodes where a tenant already has shards.
- The background reconciler task now has an extra phase `optimize_all`,
which runs only if the primary `reconcile_all` phase didn't generate any
work. The separation is that `reconcile_all` is needed for availability,
but optimize_all is purely "nice to have" work to balance work across
the nodes better.
- optimize_all calls into two new TenantState methods called
optimize_attachment and optimize_secondary, which seek out opportunities
to improve placment:
- optimize_attachment: if the node where we're currently attached has an
excess of attached shard locations for this tenant compared with the
node where we have a secondary location, then cut over to the secondary
location.
- optimize_secondary: if the node holding our secondary location has an
excessive number of locations for this tenant compared with some other
node where we don't currently have a location, then create a new
secondary location on that other node.
- a new debug API endpoint is provided to run background tasks
on-demand. This returns a number of reconciliations in progress, so
callers can keep calling until they get a `0` to advance the system to
its final state without waiting for many iterations of the background
task.

Optimization is run at an implicitly low priority by:
- Omitting the phase entirely if reconcile_all has work to do
- Skipping optimization of any tenant that has reconciles in flight
- Limiting the total number of optimizations that will be run from one
call to optimize_all to a constant (currently 2).

The idea of that low priority execution is to minimize the operational
risk that optimization work overloads any part of the system. It happens
to also make the system easier to observe and debug, as we avoid running
large numbers of concurrent changes. Eventually we may relax these
limitations: there is no correctness problem with optimizing lots of
tenants concurrently, and optimizing multiple shards in one tenant just
requires housekeeping changes to update ShardContext with the result of
one optimization before proceeding to the next shard.

2024-03-28 18:48:52 +00:00

data/extension_test/5670669815

Feat/postgres 16 (#4761 )

2023-09-12 15:11:32 +02:00

test_ancestor_branch.py

tests: Remove "postgres is running on ... branch" messages

2024-02-11 01:34:31 +02:00

test_attach_tenant_config.py

pageserver: check for new image layers based on ingested WAL (#7230 )

2024-03-28 17:44:55 +00:00

test_auth.py

chore: remove repetitive words (#7206 )

2024-03-25 11:43:02 -04:00

test_backpressure.py

Revert "pageserver: use a single tokio runtime (#6555 )" (#7246 )

2024-03-26 15:24:18 +01:00

test_bad_connection.py

Add retry to fetching basebackup (#6537 )

2024-02-01 20:50:04 +00:00

test_basebackup_error.py

Rename "Postgres nodes" in control_plane to endpoints.

2023-04-13 14:34:29 +03:00

test_branch_and_gc.py

tests: log hygiene checks for storage controller (#6710 )

2024-03-19 10:30:33 +00:00

test_branch_behind.py

tests: log hygiene checks for storage controller (#6710 )

2024-03-19 10:30:33 +00:00

test_branching.py

pageserver: remove un-needed "uninit mark" (#5717 )

2024-03-15 17:23:05 +02:00

test_broken_timeline.py

pageserver: remove un-needed "uninit mark" (#5717 )

2024-03-15 17:23:05 +02:00

test_build_info_metric.py

feat: add build_tag env support for set_build_info_metric (#5576 )

2023-10-27 10:47:11 +01:00

test_change_pageserver.py

tests/neon_local: rename "attachment service" -> "storage controller" (#7087 )

2024-03-12 11:36:27 +00:00

test_clog_truncate.py

tests: Remove "postgres is running on ... branch" messages

2024-02-11 01:34:31 +02:00

test_close_fds.py

tests: enable multiple pageservers in neon_local and neon_fixture (#5231 )

2023-09-08 16:19:57 +01:00

test_compatibility.py

tests: stabilize compat tests (#7227 )

2024-03-25 14:35:24 +00:00

test_config.py

tests: Remove "postgres is running on ... branch" messages

2024-02-11 01:34:31 +02:00

test_crafted_wal_end.py

test_runner: replace black with ruff format (#6268 )

2024-01-05 15:35:07 +00:00

test_createdropdb.py

tests: Remove "postgres is running on ... branch" messages

2024-02-11 01:34:31 +02:00

test_createuser.py

tests: Remove "postgres is running on ... branch" messages

2024-02-11 01:34:31 +02:00

test_ddl_forwarding.py

tests: Remove "postgres is running on ... branch" messages

2024-02-11 01:34:31 +02:00

test_disk_usage_eviction.py

tests: add optional cursor to log_contains + fix truthiness issues in callers (#6960 )

2024-03-01 10:45:39 +01:00

test_download_extensions.py

Use test specific directory in test_remote_extensions (#5938 )

2023-11-27 18:57:58 +00:00

test_duplicate_layers.py

tests: add optional cursor to log_contains + fix truthiness issues in callers (#6960 )

2024-03-01 10:45:39 +01:00

test_explain_with_lfc_stats.py

new test for LFC stats in explain (#6968 )

2024-03-01 14:33:08 +00:00

test_fsm_truncate.py

Rename "Postgres nodes" in control_plane to endpoints.

2023-04-13 14:34:29 +03:00

test_fullbackup.py

tests: Remove unnecessary port config with VanillaPostgres class

2024-02-11 01:34:31 +02:00

test_gc_aggressive.py

tests: Remove "postgres is running on ... branch" messages

2024-02-11 01:34:31 +02:00

test_hot_standby.py

fixup(#7204 / postgres): revert IsPrimaryAlive checks (#7209 )

2024-03-23 01:01:51 +00:00

test_import.py

pageserver: remove un-needed "uninit mark" (#5717 )

2024-03-15 17:23:05 +02:00

test_large_schema.py

tests: enable multiple pageservers in neon_local and neon_fixture (#5231 )

2023-09-08 16:19:57 +01:00

test_layer_bloating.py

Limit number of AUX files deltas to reduce reconstruct time (#6874 )

2024-02-27 21:18:46 +02:00

test_layer_eviction.py

pageserver: check for new image layers based on ingested WAL (#7230 )

2024-03-28 17:44:55 +00:00

test_layer_writers_fail.py

Move tenant & timeline dir method to NeonPageserver and use them everywhere (#5262 )

2023-09-15 11:17:18 +01:00

test_layers_from_future.py

pageserver: check for new image layers based on ingested WAL (#7230 )

2024-03-28 17:44:55 +00:00

test_lfc_resize.py

tests: Remove "postgres is running on ... branch" messages

2024-02-11 01:34:31 +02:00

test_lfc_working_set_approximation.py

Testcase for neon extension function approximate_working_set_size() (#6980 )

2024-03-01 13:29:08 +01:00

test_local_file_cache.py

LFC fixes + statistics (#5727 )

2023-11-23 08:59:19 +02:00

test_logging.py

tests: add optional cursor to log_contains + fix truthiness issues in callers (#6960 )

2024-03-01 10:45:39 +01:00

test_logical_replication.py

fix: drop replication slot causes postgres stuck on exit (#7192 )

2024-03-28 15:24:36 +00:00

test_lsn_mapping.py

tests: Remove "postgres is running on ... branch" messages

2024-02-11 01:34:31 +02:00

test_migrations.py

Revoke REPLICATION (#7052 )

2024-03-08 22:24:30 +00:00

test_multixact.py

tests: Remove "postgres is running on ... branch" messages

2024-02-11 01:34:31 +02:00

test_neon_cli.py

tests/neon_local: rename "attachment service" -> "storage controller" (#7087 )

2024-03-12 11:36:27 +00:00

test_neon_extension.py

spec: allow neon extension auto-upgrade + softfail upgrade (#7231 )

2024-03-28 17:22:35 +00:00

test_neon_local_cli.py

fix(test suite): some tests leak child processes (#6497 )

2024-01-26 18:23:53 +00:00

test_neon_superuser.py

fix(test): drop subscription when test completes (#6975 )

2024-03-06 15:52:24 +00:00

test_next_xid.py

Fix calculation of maximal multixact in ingest_multixact_create_record (#6502 )

2024-01-29 07:39:16 +02:00

test_normal_work.py

Rename "Postgres nodes" in control_plane to endpoints.

2023-04-13 14:34:29 +03:00

test_old_request_lsn.py

tests: Remove "postgres is running on ... branch" messages

2024-02-11 01:34:31 +02:00

test_ondemand_download.py

pageserver: check for new image layers based on ingested WAL (#7230 )

2024-03-28 17:44:55 +00:00

test_pageserver_api.py

tokio-epoll-uring: fallback to std-fs if not available & not explicitly requested (#7120 )

2024-03-15 17:46:04 +00:00

test_pageserver_catchup.py

Rename "Postgres nodes" in control_plane to endpoints.

2023-04-13 14:34:29 +03:00

test_pageserver_generations.py

pageserver: check for new image layers based on ingested WAL (#7230 )

2024-03-28 17:44:55 +00:00

test_pageserver_getpage_throttle.py

feat(per-tenant throttling): exclude throttled time from page_service metrics + regression test (#6953 )

2024-03-05 13:44:00 +00:00

test_pageserver_layer_rolling.py

pageserver: limit total ephemeral layer bytes (#7218 )

2024-03-26 15:45:32 +00:00

test_pageserver_metric_collection.py

pageserver: write consumption metrics to S3 (#7200 )

2024-03-22 14:52:14 +00:00

test_pageserver_reconnect.py

Implement lockless update of pageserver_connstring GUC in shared memory (#6314 )

2024-01-23 07:55:05 +02:00

test_pageserver_restart.py

tests: add basic coverage for sharding (#6380 )

2024-01-26 14:40:47 +00:00

test_pageserver_restarts_under_workload.py

pageserver: improve the shutdown log error (#5792 )

2023-11-07 16:57:26 +00:00

test_pageserver_secondary.py

pageserver: remove bare mgr::get_tenant, mgr::list_tenants (#7237 )

2024-03-26 18:29:08 +00:00

test_parallel_copy.py

tests: Remove "postgres is running on ... branch" messages

2024-02-11 01:34:31 +02:00

test_pg_regress.py

tests: add basic coverage for sharding (#6380 )

2024-01-26 14:40:47 +00:00

test_physical_replication.py

Track size of FSM fork while applying records at replica (#5901 )

2023-12-05 18:49:24 +02:00

test_pitr_gc.py

tests: Remove "postgres is running on ... branch" messages

2024-02-11 01:34:31 +02:00

test_proxy_allowed_ips.py

proxy: include client IP in ip deny message (#6854 )

2024-02-21 18:24:59 +01:00

test_proxy_metric_collection.py

refactor(test_consumption_metrics): split for pageserver and proxy (#5324 )

2023-09-16 18:05:35 +03:00

test_proxy_rate_limiter.py

Proxy control plane rate limiter (#5785 )

2023-11-15 09:15:59 +00:00

test_proxy_websockets.py

proxy: add websocket regression tests (#7121 )

2024-03-15 10:21:48 +01:00

test_proxy.py

proxy http cancellation safety (#7117 )

2024-03-14 08:20:56 +00:00

test_read_trace.py

tests: enable multiple pageservers in neon_local and neon_fixture (#5231 )

2023-09-08 16:19:57 +01:00

test_read_validation.py

tests: Remove "postgres is running on ... branch" messages

2024-02-11 01:34:31 +02:00

test_readonly_node.py

tests: Remove "postgres is running on ... branch" messages

2024-02-11 01:34:31 +02:00

test_recovery.py

test: fix test_pageserver_recovery flakyness (#7207 )

2024-03-25 09:38:12 +00:00

test_remote_storage.py

pageserver: check for new image layers based on ingested WAL (#7230 )

2024-03-28 17:44:55 +00:00

test_replication_start.py

fixup(#7204 / postgres): revert IsPrimaryAlive checks (#7209 )

2024-03-23 01:01:51 +00:00

test_s3_restore.py

tests/neon_local: rename "attachment service" -> "storage controller" (#7087 )

2024-03-12 11:36:27 +00:00

test_setup.py

python: more linting (#4734 )

2023-07-18 12:56:40 +03:00

test_sharding_service.py

storage controller: be more tolerant of control plane blocking notifications (#7268 )

2024-03-28 17:38:08 +00:00

test_sharding.py

storage controller: scheduling optimization for sharded tenants (#7181 )

2024-03-28 18:48:52 +00:00

test_sni_router.py

tests: split neon_fixtures.py (#4871 )

2023-08-03 17:20:24 +03:00

test_subxacts.py

tests: Remove "postgres is running on ... branch" messages

2024-02-11 01:34:31 +02:00

test_tenant_conf.py

pageserver: increase DEFAULT_MAX_WALRECEIVER_LSN_WAL_LAG (#6970 )

2024-03-01 16:49:37 +00:00

test_tenant_delete.py

pageserver: cancellation for remote ops in tenant deletion on shutdown (#6105 )

2024-03-15 18:03:49 +00:00

test_tenant_detach.py

tests: add optional cursor to log_contains + fix truthiness issues in callers (#6960 )

2024-03-01 10:45:39 +01:00

test_tenant_relocation.py

tests: add optional cursor to log_contains + fix truthiness issues in callers (#6960 )

2024-03-01 10:45:39 +01:00

test_tenant_size.py

pageserver: exclude gc_horizon from synthetic size calculation (#6407 )

2024-03-15 16:07:36 +00:00

test_tenant_tasks.py

propagate lock guard to background deletion task (#4495 )

2023-06-15 17:30:12 +03:00

test_tenants_with_remote_storage.py

tests: add optional cursor to log_contains + fix truthiness issues in callers (#6960 )

2024-03-01 10:45:39 +01:00

test_tenants.py

pageserver: return 429 on timeline creation in progress (#7225 )

2024-03-26 15:20:05 +00:00

test_threshold_based_eviction.py

tests: add optional cursor to log_contains + fix truthiness issues in callers (#6960 )

2024-03-01 10:45:39 +01:00

test_timeline_delete.py

pageserver: cancellation for remote ops in tenant deletion on shutdown (#6105 )

2024-03-15 18:03:49 +00:00

test_timeline_size.py

Revert "pageserver: use a single tokio runtime (#6555 )" (#7246 )

2024-03-26 15:24:18 +01:00

test_truncate.py

python: more linting (#4734 )

2023-07-18 12:56:40 +03:00

test_twophase.py

tests: Remove "postgres is running on ... branch" messages

2024-02-11 01:34:31 +02:00

test_unlogged.py

Rename "Postgres nodes" in control_plane to endpoints.

2023-04-13 14:34:29 +03:00

test_vm_bits.py

tests: Make test_vm_bit_clear_on_heap_lock more robust again. (#6714 )

2024-02-21 12:36:57 +00:00

test_wal_acceptor_async.py

Make WAL segment init atomic.

2024-01-30 18:05:22 +04:00

test_wal_acceptor.py

Keep walproposer alive until shutdown checkpoint is safe on safekepeers

2024-03-11 23:29:32 +04:00

test_wal_receiver.py

Raise pageserver walreceiver timeouts.

2023-06-19 15:59:38 +04:00

test_wal_restore.py

Allow initdb preservation for broken tenants (#6790 )

2024-02-19 17:27:02 +01:00

test_walredo_not_left_behind_on_detach.py

Move tenant & timeline dir method to NeonPageserver and use them everywhere (#5262 )

2023-09-15 11:17:18 +01:00