neon/regress at bc5ec43056773f4a6742fb64dbff681392b02dd3 - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-01-10 15:02:56 +00:00

Files

History

Heikki Linnakangas bc5ec43056 Fix flaky physical-size tests in test_timeline_size.py.

These two tests, test_timeline_physical_size_post_compaction and
test_timeline_physical_size_post_gc, assumed that after you have
waited for the WAL from a bulk insertion to arrive, and you run a
cycle of checkpoint and compaction, no new layer files are created.
Because if a new layer file is created while we are calculating the
incremental and non-incremental physical sizes, they might differ.

However, the tests used a very small checkpoint_distance, so even a
small amount of WAL generated in PostgreSQL could cause a new layer
file to be created. Autovacuum can kick in at any time, and do that.
That caused occasional failues in the test. I was able to reproduce it
reliably by adding a long delay between the incremental and
non-incremental size calculations:

```
--- a/pageserver/src/http/routes.rs
+++ b/pageserver/src/http/routes.rs
@@ -129,6 +129,9 @@ async fn build_timeline_info(
         }
     };
     let current_physical_size = Some(timeline.get_physical_size());
+    if include_non_incremental_physical_size {
+        std:🧵:sleep(std::time::Duration::from_millis(60000));
+    }

     let info = TimelineInfo {
         tenant_id: timeline.tenant_id,
```

To fix, disable autovacuum for the table. Autovacuum could still kick
in for other tables, e.g. catalog tables, but that seems less likely
to generate enough WAL to causea new layer file to be flushed.

If this continues to be a problem in the future, we could simply retry
the physical size call a few times, if there's a mismatch. A mismatch
could happen every once in a while, but it's very unlikely to happen
more than once or twice in a row.

Fixes https://github.com/neondatabase/neon/issues/2212

2022-10-19 23:50:21 +03:00

..

test_ancestor_branch.py

Move testing pageserver libpq cmds to HTTP api (#2429 )

2022-09-20 11:28:12 -07:00

test_auth.py

tests: do not set num_safekeepers = 1, it's the default (#2457 )

2022-09-15 21:43:51 +03:00

test_backpressure.py

Reorganize python tests.

2022-08-30 18:25:38 +03:00

test_basebackup_error.py

Move testing pageserver libpq cmds to HTTP api (#2429 )

2022-09-20 11:28:12 -07:00

test_branch_and_gc.py

Move testing pageserver libpq cmds to HTTP api (#2429 )

2022-09-20 11:28:12 -07:00

test_branch_behind.py

Move testing pageserver libpq cmds to HTTP api (#2429 )

2022-09-20 11:28:12 -07:00

test_branching.py

Normalize last_record LSN in wal receiver (#2529 )

2022-10-06 09:01:56 +03:00

test_broken_timeline.py

Move testing pageserver libpq cmds to HTTP api (#2429 )

2022-09-20 11:28:12 -07:00

test_build_info_metric.py

Add build info metric to pageserver, safekeeper and proxy (#2596 )

2022-10-11 09:54:32 +03:00

test_clog_truncate.py

Reorganize python tests.

2022-08-30 18:25:38 +03:00

test_close_fds.py

Reorganize python tests.

2022-08-30 18:25:38 +03:00

test_compute_ctl.py

Display sync safekeepers output in compute_ctl (#2571 )

2022-10-06 13:53:52 +00:00

test_config.py

Reorganize python tests.

2022-08-30 18:25:38 +03:00

test_crafted_wal_end.py

tests: do not set num_safekeepers = 1, it's the default (#2457 )

2022-09-15 21:43:51 +03:00

test_createdropdb.py

Reorganize python tests.

2022-08-30 18:25:38 +03:00

test_createuser.py

Reorganize python tests.

2022-08-30 18:25:38 +03:00

test_fsm_truncate.py

Reorganize python tests.

2022-08-30 18:25:38 +03:00

test_fullbackup.py

tests: do not set num_safekeepers = 1, it's the default (#2457 )

2022-09-15 21:43:51 +03:00

test_gc_aggressive.py

Move testing pageserver libpq cmds to HTTP api (#2429 )

2022-09-20 11:28:12 -07:00

test_gc_cutoff.py

Persists latest_gc_cutoff_lsn before performing GC (#2558 )

2022-10-19 12:32:03 +03:00

test_import.py

Merge 'local' and 'remote' parts of TimelineInfo into one struct.

2022-10-14 18:37:14 +03:00

test_large_schema.py

Reorganize python tests.

2022-08-30 18:25:38 +03:00

test_lsn_mapping.py

Make get_lsn_by_timestamp available in mgmt API (#2536 ) (#2560 )

2022-10-06 12:42:50 +03:00

test_multixact.py

Reorganize python tests.

2022-08-30 18:25:38 +03:00

test_neon_cli.py

Rename old project name references

2022-09-14 08:14:05 +03:00

test_next_xid.py

Reorganize python tests.

2022-08-30 18:25:38 +03:00

test_normal_work.py

Clean up terms "delete timeline" and "detach tenant".

2022-10-11 17:47:41 +03:00

test_old_request_lsn.py

Move testing pageserver libpq cmds to HTTP api (#2429 )

2022-09-20 11:28:12 -07:00

test_pageserver_api.py

Merge 'local' and 'remote' parts of TimelineInfo into one struct.

2022-10-14 18:37:14 +03:00

test_pageserver_catchup.py

Reorganize python tests.

2022-08-30 18:25:38 +03:00

test_pageserver_restart.py

Add test that repeatedly kills and restarts the pageserver.

2022-09-06 13:00:40 +03:00

test_parallel_copy.py

Reorganize python tests.

2022-08-30 18:25:38 +03:00

test_pg_regress.py

use pg_version in python tests

2022-09-22 14:15:13 +03:00

test_pitr_gc.py

Move testing pageserver libpq cmds to HTTP api (#2429 )

2022-09-20 11:28:12 -07:00

test_proxy.py

Reorganize python tests.

2022-08-30 18:25:38 +03:00

test_read_validation.py

Reorganize python tests.

2022-08-30 18:25:38 +03:00

test_readonly_node.py

Move testing pageserver libpq cmds to HTTP api (#2429 )

2022-09-20 11:28:12 -07:00

test_recovery.py

Move testing pageserver libpq cmds to HTTP api (#2429 )

2022-09-20 11:28:12 -07:00

test_remote_storage.py

remove redundant expect_tenant_to_download_timeline

2022-10-18 11:21:48 +03:00

test_setup.py

Reorganize python tests.

2022-08-30 18:25:38 +03:00

test_subxacts.py

Reorganize python tests.

2022-08-30 18:25:38 +03:00

test_tenant_conf.py

Increase default compaction_period setting to 20 s.

2022-10-07 13:55:19 +03:00

test_tenant_detach.py

Clean up terms "delete timeline" and "detach tenant".

2022-10-11 17:47:41 +03:00

test_tenant_relocation.py

Merge 'local' and 'remote' parts of TimelineInfo into one struct.

2022-10-14 18:37:14 +03:00

test_tenant_tasks.py

Rename old project name references

2022-09-14 08:14:05 +03:00

test_tenants_with_remote_storage.py

remove redundant expect_tenant_to_download_timeline

2022-10-18 11:21:48 +03:00

test_tenants.py

Return broken tenants due to non existing timelines dir (#2552 ) (#2575 )

2022-10-12 22:28:39 +03:00

test_timeline_delete.py

Merge 'local' and 'remote' parts of TimelineInfo into one struct.

2022-10-14 18:37:14 +03:00

test_timeline_size.py

Fix flaky physical-size tests in test_timeline_size.py.

2022-10-19 23:50:21 +03:00

test_twophase.py

Reorganize python tests.

2022-08-30 18:25:38 +03:00

test_vm_bits.py

Reorganize python tests.

2022-08-30 18:25:38 +03:00

test_wal_acceptor_async.py

Merge 'local' and 'remote' parts of TimelineInfo into one struct.

2022-10-14 18:37:14 +03:00

test_wal_acceptor.py

Merge 'local' and 'remote' parts of TimelineInfo into one struct.

2022-10-14 18:37:14 +03:00

test_wal_restore.py

use pg_version in python tests

2022-09-22 14:15:13 +03:00