rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-05-15 04:00:38 +00:00

Author	SHA1	Message	Date
Arthur Petukhovsky	5c904dd5d8	Use ip instead of localhost to not resolve anything	2021-10-18 22:33:36 +03:00
Arseny Sher	de744a44dd	Add /timeline http request to safekeeper returning its status. Which is mainly generational state (terms) and useful LSNs. Also add /status basic healthcheck request which is now used in tests to determine the safekeeper is up; this fixes #726. ref #115	2021-10-14 19:02:38 +03:00
Arthur Petukhovsky	4b87acb1f6	Use logging in python tests (#674 ) * Use logging in python tests * Use f-strings for logs * Don't log test output while running * Use only pytest logging handler * Add more info about pytest logging	2021-10-14 13:10:09 +03:00
Egor Suvorov	23f4c0a742	Rename `wal_acceptor` binary to `safekeeper` (#740 ), stage 1/2 * Rename wal_acceptor binary to safekeeper * Rename wal_acceptor.pid and wal_acceptor.log to safekeeper.pid and safekeeper.log * Change some mentions of WAL acceptor to safekeeper * Dockerfile: alias wal_acceptor to safekeeper temporarily until internal scripts are updated	2021-10-12 22:03:06 +03:00
anastasia	d7c9dd06f4	Implement graceful shutdown at 'pageserver stop': - perform checkpoint for each tenant repository. - wait for the completion of all threads. Add new option 'immediate' to 'pageserver stop' command to terminate the pageserver immediately.	2021-10-11 13:35:01 +03:00
Egor Suvorov	403d9779d9	safekeeper: add initial metrics and HTTP handler (#699 , #541 ) * `wal_acceptor`: add HTTP handler, /metrics endpoint only, no authentication * Two gauges are currently reported: `flush_lsn` and `commit_lsn` * Add `DEFAULT_PG_LISTEN_PORT` and `DEFAULT_PG_LISTEN_PORT` consts for uniformity	2021-10-08 18:55:41 +03:00
Patrick Insinger	b3b8f18f61	tests - fix get_timeline_size signature	2021-10-07 15:38:22 -07:00
Heikki Linnakangas	c660926a06	Refactor duplicated code to get on-disk timeline size in tests. Move it to a common function. In the passing, remove the obsolete check to exclude the 'wal' directory. The 'wal' directory is no more.	2021-10-08 00:34:26 +03:00
Heikki Linnakangas	db4059cd6d	Measure peak memory usage in perf test. Another useful metric to keep an eye on.	2021-10-07 18:03:20 +03:00
Egor Suvorov	05fe39088b	Readme updates based on a fresher Ubuntu installation experience (#627 )	2021-10-05 19:19:25 +03:00
Egor Suvorov	7e190d72a5	Make `pageserver_` prefix for common metric names configurable (#681 )	2021-10-05 19:06:44 +03:00
Arthur Petukhovsky	d6fc74a412	Various fixes for test_sync_safekeepers (#668 ) * Send ProposerGreeting manually in tests * Move test_sync_safekeepers to test_wal_acceptor.py * Capture test_sync_safekeepers output * Add comment for handle_json_ctrl * Save captured output in CI	2021-09-28 19:25:05 +03:00
Arthur Petukhovsky	d4e037f1e7	Support for `--sync-safekeepers` in tests (#647 ) New command has been added to append specially crafted records in safekeeper WAL. This command takes json for append, encodes LogicalMessage based on json fields, and processes new AppendRequest to append and commit WAL in safekeeper. Python test starts up walkeepers and creates config for walproposer, then appends WAL and checks --sync-safekeepers works without errors. This test is simplest one, more useful test cases (like in #545) for different setups will be added soon.	2021-09-24 13:19:59 +03:00
Arthur Petukhovsky	8ebf2fe550	Add test for acceptor restarts under load (#591 ) In this test safekeepers are restarted one by one, while bank transactions are executed and validated in the background. Bank transactions consist of balance transfers and log writes. In the end balance sum should remain the same and there should be progress from every client, when 2 of 3 safekeeper nodes are up.	2021-09-22 11:59:20 +03:00
Dmitry Rodionov	b7aac87ec1	fix port distribution so services do not use ephemeral ports	2021-09-20 18:44:42 +03:00
Heikki Linnakangas	c2af6d98db	Don't print 'pg_controldata' output after every startup in tests. It's not interesting for most tests, and clutters the output. If there are individual tests where it is worthwhole, let's add pg_controldata calls to those tests, but I don't think it's needed for now.	2021-09-17 20:04:29 +03:00
Dmitry Ivanov	7b3fb760fa	[test_runner] psql should be oblivious to user's preferences This makes psql ignore $HOME/.psqlrc	2021-09-17 14:16:23 +03:00
Dmitry Rodionov	01ef2baef0	show more context for zenith cli run errors	2021-09-15 14:02:15 +03:00
Dmitry Rodionov	9563336d9a	Bring back check for interferring processes, add more comments and descriptive errors	2021-09-15 14:02:15 +03:00
Dmitry Rodionov	4ebe643d0c	Support parallel test running for python tests Support is done via pytest-xdist plugin. To use the feature add -n<concurrency> to pytest invocation e.g. pytest -n8 to run 8 tests in parallel. Changes in code are mostly about ports assigning. Previously port for pageserver was hardcoded without the ability to override through zenith cli and ports for started compute nodes were calculated twice, in zenith cli and in test code. Now zenith cli supports port arguments for pageserver and compute nodes to be passed explicitly. Tests are modified in such a way that each worker gets a non overlapping port range which can be configured and now contains 100 ports. These ports are distributed to test services (pageserver, wal acceptors, compute nodes) so they can work independently.	2021-09-15 14:02:15 +03:00
Dmitry Rodionov	b4ecae33e4	add incremental tracking of logical timeline size In order to exclude problems with synchronizing disk and memory logical size is not stored in metadata on disk. It is calculated on timeline "start" by scanning the contents of layered repo and then size is maintained via an atomic variable. This patch also adds new endpoint to pageserver http api: branch detail. It allows retrieval of a particular branch info by its name. Size info is also added to the response of the endpoint and used in tests.	2021-09-07 18:25:15 +03:00
anastasia	6f0c065743	preserve filediff artifacts in CI	2021-09-07 16:58:21 +03:00
anastasia	94c50e3e90	Fix check_restored_datadir_content(). Call 'basebackup' command directly, instead of relying on CLI	2021-09-07 16:58:21 +03:00
anastasia	eb3fd7a8da	print diff for mismatching files in check_restored_datadir_content()	2021-09-06 18:21:23 +03:00
anastasia	1e172230ce	Add test funciton to compare files in compute nodes to catch bugs in SLRU replay. Compare files in existing compute node's pgdata with fresh basebackup at the same lsn. We expect that content is identical, except tmp files Use it after some tests.	2021-09-06 18:21:23 +03:00
Stas Kelvich	ed4eed0a19	Make use of `postgres --sync-safekeepers` in tests and CLI. Change control plane code to call `postgres --sync-safekeepers` before compute node start when safekeepers are enabled. Now `pg create` will create an empty data directory with the proper config file. Subsequent `pg start` will run `sync-safekeepers` and will call basebackup with the resulting LSN. Also change few tests to accommodate this new behavior.	2021-09-06 13:06:20 +03:00
Heikki Linnakangas	c6678c5dea	Include # of bytes written in pgbench benchmark result Now that the page server collects this metric (since commit `212920e47e`), let's include it in the performance test results The new metric looks like this: performance/test_perf_pgbench.py . [100%] --------------- Benchmark results ---------------- test_pgbench.init: 6.784 s test_pgbench.pageserver_writes: 466 MB <---- THIS IS NEW test_pgbench.5000_xacts: 8.196 s test_pgbench.size: 163 MB =============== 1 passed in 21.00s ===============	2021-09-03 09:00:26 +03:00
Kirill Bulatov	0e4cbe0165	Fix some typos	2021-09-02 17:27:18 +03:00
Stas Kelvich	ddd2c83c64	Change test_restart_compute to expose safekeeper problems. Make this test look like 'test_compute_restart.sh' by @ololobus, which was surprisingly good for checking safekeepers behavior. This test adds an intermediate compute node start with bulk select that causes a lot of FPI's and select itself wouldn't wait for all that WAL to be replicated. So if we kill compute node right after that we end up with lagging safekeepers with VCL != flush_lsn. And starting new node from that state takes special care. Also, run and print `pg_controldata` output after each compute node start to eyeball lsn/checkpoint info of basebackup. This commit only adds test without fixing the problem.	2021-09-02 12:06:12 +03:00
anastasia	27442c3daa	Add test for DROP DATABASE command	2021-08-30 17:29:29 +03:00
Heikki Linnakangas	074bd3bb12	Add basic performance test framework. This provides a pytest fixture to record metrics from pytest tests. The The recorded metrics are printed out at the end of the tests. As a starter, this includes on small test, using pgbench. It prints out three metrics: the initialization time, runtime of 5000 xacts, and the repository size after the tests.	2021-08-27 21:00:45 +03:00
Dmitry Rodionov	23b5249512	translate pageserver api to http	2021-08-24 19:05:00 +03:00
anastasia	20e6cd7724	Update test_twophase - check that we correctly restore files at compute node start.	2021-08-19 12:15:09 +03:00
anastasia	cbeb67067c	Issue #367 . Change CLI so that we always create node from scratch at 'pg start'. This operation preserve previously existing config Add new flag '--config-only' to 'pg create'. If this flag is passed, don't perform basebackup, just fill initial postgresql.conf for the node.	2021-08-17 18:12:31 +03:00
Dmitry Rodionov	0c4ab80eac	try to be more intelligent in WalAcceptor.start, added a bunch of typing sugar to wal acceptor fixtures	2021-08-16 14:27:44 +03:00
Dmitry Rodionov	ce5333656f	Introduce authentication v0.1. Current state with authentication. Page server validates JWT token passed as a password during connection phase and later when performing an action such as create branch tenant parameter of an operation is validated to match one submitted in token. To allow access from console there is dedicated scope: PageServerApi, this scope allows access to all tenants. See code for access validation in: PageServerHandler::check_permission. Because we are in progress of refactoring of communication layer involving wal proposer protocol, and safekeeper<->pageserver. Safekeeper now doesn’t check token passed from compute, and uses “hardcoded” token passed via environment variable to communicate with pageserver. Compute postgres now takes token from environment variable and passes it as a password field in pageserver connection. It is not passed through settings because then user will be able to retrieve it using pg_settings or SHOW .. I’ve added basic test in test_auth.py. Probably after we add authentication to remaining network paths we should enable it by default and switch all existing tests to use it.	2021-08-11 20:05:54 +03:00
anastasia	949ac54401	Add test of clog (pg_xact) truncation	2021-08-11 05:49:24 +03:00
Dmitry Rodionov	767590bbd5	support tenants this patch adds support for tenants. This touches mostly pageserver. Directory layout on disk is changed to contain new layer of indirection. Now path to particular repository has the following structure: <pageserver workdir>/tenants/<tenant id>. Tenant id has the same format as timeline id. Tenant id is included in pageserver commands when needed. Also new commands are available in pageserver: tenant_list, tenant_create. This is also reflected CLI. During init default tenant is created and it's id is saved in CLI config, so following commands can use it without extra options. Tenant id is also included in compute postgres configuration, so it can be passed via ServerInfo to safekeeper and in connection string to pageserver. For more info see docs/multitenancy.md.	2021-07-22 20:54:20 +03:00
Stas Kelvich	791312824d	set superuser name in python tests too	2021-07-21 17:22:22 +03:00
Dmitry Rodionov	75e717fe86	allow both domains and ip addresses in connection options for pageserver and wal keeper. Also updated PageServerNode definition in control plane to account for that. resolves #303	2021-07-09 16:46:21 +03:00
Dmitry Ivanov	257ade0688	Extract PostgreSQL connection logic into PgProtocol This patch aims to: * Unify connection & querying logic of ZenithPagerserver and Postgres. * Mitigate changes to transaction machinery introduced in `psycopg2 >= 2.9`. Now it's possible to acquire db connection using the corresponding method: ```python pg = postgres.create_start('main') conn = pg.connect() ... conn.close() ``` This pattern can be further improved with the help of `closing`: ```python from contextlib import closing pg = postgres.create_start('main') with closing(pg.connect()) as conn: ... ``` All connections produced by this method will have autocommit enabled by default.	2021-06-17 20:19:04 +03:00
Dmitry Ivanov	43ece6e2a2	Fix test_runner's fixtures for python 3.6 Apparently, Literal type is only available since 3.8.	2021-06-17 20:19:04 +03:00
Arseny Sher	37b0236e9a	Move wal acceptor tests to python. Includes fixtures for wal acceptors and associated setup. Nothing really new here, but surprisingly this caught some issues in walproposer. ref #182	2021-06-15 15:14:27 +03:00
Dmitry Ivanov	96c7594d29	Enable some kind of gradual typing in test_runner (#222 ) It's not realistic to enable full-blown type checks within test_runner's codebase, since the amount of warnings revealed by mypy is overwhelming. Tests are supposed to be easy to use, so we can't cripple everybody's workflow for the sake of imaginary benefit. Ultimately, the purpose of this attempt is three-fold: * Facilitate code navigation when paired with python-language-server. * Make method signatures apparent to a fellow programmer. * Occasionally catch some obvious type errors.	2021-06-10 22:53:15 +03:00
anastasia	05a681be2c	add createuser test to test shared catalog restore	2021-06-09 00:31:09 +03:00
Dmitry Ivanov	244fcffc50	Fix typos found by codespell	2021-06-01 21:43:26 +03:00
Dmitry Ivanov	00ce635da9	Reformat tests using yapf	2021-06-01 21:09:09 +03:00
Dmitry Ivanov	7d5f7462c1	Tidy up pytest-based tests	2021-06-01 21:09:09 +03:00
Heikki Linnakangas	1af6607fc3	Add a test for restarting and recreating compute node. This is working; let's keep it that way. This also adds test coverage for the 'zenith pg stop --destroy' option that was added in commit `6ad6e5bd`.	2021-05-27 12:59:45 +03:00
Heikki Linnakangas	22b7e74c83	Add test for following relmapper files at CREATE DATABASE	2021-05-21 12:13:47 +03:00

1 2

61 Commits