rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-01-10 06:52:55 +00:00

Author	SHA1	Message	Date
Alexander Bayandin	9fdd228dee	GitHub Actions: Add branch related actions (#2877 ) Add `neon-branch-create` / `neon-branch-delete` to allow using branches in tests. I have a couple of use cases in mind: - For destructive tests with a big DB, we can create the DB once in advance and then use branches without the need to recreate the DB itself after tests change it. - We can run tests in parallel (if there're compute-bound). Also migrate API v2 for `neon-project-create` / `neon-project-delete`	2022-11-25 18:18:08 +00:00
Heikki Linnakangas	15db566420	Allow setting gc/compaction_period to 0, to disable automatic GC/compaction Many python tests were setting the GC/compaction period to large values, to effectively disable GC / compaction. Reserve value 0 to mean "explicitly disabled". We also set them to 0 in unit tests now, although currently, unit tests don't launch the background jobs at all, so it won't have any effect. Fixes https://github.com/neondatabase/neon/issues/2917	2022-11-25 20:14:06 +02:00
Alexander Bayandin	1a316a264d	Disable statement timeout for performance tests (#2891 ) Fix `test_seqscans` by disabling statement timeout. Also, replace increasing statement timeout with disabling it for performance tests. This should make tests more stable and allow us to observe performance degradation instead of test failures.	2022-11-25 16:05:45 +00:00
Alexander Bayandin	aeeb782342	Make test runner compatible with Python 3.11 (#2915 ) NB: this PR doesn't update Python to 3.11; it makes tests compatible with it and fixes a couple of warnings by updating dependencies. - `poetry add asyncpg@latest` to fix `./scripts/pysync` - `poetry add boto3@latest "boto3-stubs[s3]@latest"` to fix ``` DeprecationWarning: 'cgi' is deprecated and slated for removal in Python 3.13 ``` - `poetry update certifi` to fix ``` DeprecationWarning: path is deprecated. Use files() instead. Refer to https://importlib-resources.readthedocs.io/en/latest/using.html#migrating-from-legacy for migration advice. ``` - Move `types-toml` from `dep-dependencies` to `dependencies` to keep it aligned with other `types-*` deps	2022-11-25 15:59:15 +00:00
Egor Suvorov	ae53dc3326	Add authentication between Safekeeper and Pageserver/Compute * Fix https://github.com/neondatabase/neon/issues/1854 * Never log Safekeeper::conninfo in walproposer as it now contains a secret token * control_panel, test_runner: generate and pass JWT tokens for Safekeeper to compute and pageserver * Compute: load JWT token for Safekepeer from the environment variable. Do not reuse the token from pageserver_connstring because it's embedded in there weirdly. * Pageserver: load JWT token for Safekeeper from the environment variable. * Rewrite docs/authentication.md	2022-11-25 04:17:42 +03:00
Egor Suvorov	1ca76776d0	pageserver: require management permissions on HTTP /status	2022-11-25 04:17:42 +03:00
Egor Suvorov	10d554fcbb	walproposer: refactor safekeeper::conninfo initialization It is used both in WalProposerInit and ResetConnection. In the future the logic will become more complicated due to authentication with Safekeeper.	2022-11-25 04:17:42 +03:00
Egor Suvorov	2ce5d8137d	Separate permission checks for Pageserver and Safekeeper There will be different scopes for those two, so authorization code should be different. The `check_permission` function is now not in the shared library. Its implementation is very similar to the one which will be added for Safekeeper. In fact, we may reuse the same existing root-like 'PageServerApi' scope, but I would prefer to have separate root-like scopes for services. Also, generate_management_token in tests is generate_pageserver_token now.	2022-11-25 04:17:42 +03:00
Egor Suvorov	a406783098	neon_fixtures: refactor AuthKeys to support more scopes	2022-11-25 04:17:42 +03:00
Alexey Kondratov	e6db4b63eb	[safekeeper] Serialize LSN in the `term_history` according to the spec (#2896 ) Use string format in the timeline status HTTP API reponse.	2022-11-24 17:19:01 +01:00
Arseny Sher	0b0cb77da4	Fix deploy after `2d42f84389`.	2022-11-24 20:07:41 +04:00
Dmitry Ivanov	47734fdb0a	[proxy] Move some tests to a dedicated module This unclutters the pivotal `proxy.rs` module.	2022-11-24 18:43:34 +03:00
Sergey Melnikov	9c886ac0a0	Use per-cluster DNS name for link proxy (#2911 )	2022-11-24 12:41:38 +01:00
Egor Suvorov	b6989e8928	pageserver: make `wal_source_connstring: String` a 'wal_source_connconf: PgConnectionConfig`	2022-11-24 14:02:23 +03:00
Egor Suvorov	46ea2a8e96	Continue #2724 : replace `Url`-based `PgConnectionConfig` with a hand-crafted struct Downsides are: * We store all components of the config separately. `Url` stores them inside a single `String` and a bunch of ints which point to different parts of the URL, which is probably more efficient. * It is now impossible to pass arbitrary connection strings to the configuration file, one has to support all components explicitly. However, we never supported anything except for `host:port` anyway. Upsides are: * This significantly restricts the space of possible connection strings, some of which may be either invalid or unsupported. E.g. Postgres' connection strings may include a bunch of parameters as query (e.g. `connect_timeout=`, `options=`). These are nether validated by the current implementation, nor passed to the postgres client library, Hence, storing separate fields expresses the intention better. * The same connection configuration may be represented as a URL in multiple ways (e.g. either `password=` in the query part or a standard URL password). Now we have a single canonical way. * Escaping is provided for `options=`. Other possibilities considered: * `newtype` with a `String` inside and some validation on creation. This is more efficient, but harder to log for two reasons: * Passwords should never end up in logs, so we have to somehow * Escaped `options=` are harder to read, especially if URL-encoded, and we use `options=` a lot.	2022-11-24 14:02:23 +03:00
Heikki Linnakangas	5bca7713c1	Improve comments on TenantStates	2022-11-24 12:26:15 +02:00
Heikki Linnakangas	99d9c23df5	Gather path-related consts and functions to one place. Feels more organized this way.	2022-11-24 12:26:15 +02:00
Dmitry Ivanov	05db6458df	[proxy] Fix project (endpoint) -related error messages	2022-11-23 23:03:29 +03:00
Arseny Sher	2d42f84389	Add storage_broker binary. Which ought to replace etcd. This patch only adds the binary and adjusts Dockerfile to include it; subsequent ones will add deploy of helm chart and the actual replacement. It is a simple and fast pub-sub message bus. In this patch only safekeeper message is supported, but others can be easily added. Compilation now requires protoc to be installed. Installing protobuf-compiler package is fine for Debian/Ubuntu. ref https://github.com/neondatabase/neon/pull/2733 https://github.com/neondatabase/neon/issues/2394	2022-11-23 22:05:59 +04:00
Sergey Melnikov	aee3eb6d19	Deploy link proxy to us-east-2 (#2905 )	2022-11-23 18:11:44 +01:00
Konstantin Knizhnik	a6e4a3c3ef	Implement corrent truncation of FSM/VM forks on arbitrary position (#2609 ) refer #2601 Co-authored-by: Anastasia Lubennikova <anastasia@neon.tech>	2022-11-23 18:46:07 +02:00
Konstantin Knizhnik	21ec28d9bc	Add bulk update test (#2902 )	2022-11-23 17:51:35 +02:00
Heikki Linnakangas	de8f24583f	Remove obsolete 'zenith_ctl' alias from compute images	2022-11-23 16:58:31 +02:00
Sergey Melnikov	85f0975c5a	Setup eu-west-1 as region for PR testing (#2757 )	2022-11-23 10:54:39 +01:00
Konstantin Knizhnik	1af087449a	Reduce max_replication_write_lag to 10Mb (#1793 )	2022-11-23 08:41:22 +02:00
Heikki Linnakangas	37625c4433	Remove obsolete design doc. I considered archiving this under docs/rfcs, but looking at the contents, I don't think it's relevant at all anymore. So let's just remove it.	2022-11-23 00:40:17 +02:00
Heikki Linnakangas	e9f4ca5972	Remove references to obsolete files in .gitignore	2022-11-23 00:40:17 +02:00
Alexey Kondratov	4bf3087aed	[pageserver] list `latest_gc_cutoff_lsn` in the OpenAPI spec (#2894 ) It seems that it's present in the API response for quite a while. It's just not listed in the spec, fix it.	2022-11-22 21:10:49 +01:00
Dmitry Ivanov	9470bc9fe0	[proxy] Implement per-tenant traffic metrics	2022-11-22 18:50:57 +03:00
Heikki Linnakangas	86e483f87b	Fix tenant size modeling code to include WAL at end of branch Imagine that you have a tenant with a single branch like this: ---------------==========> ^ gc horizon where: ---- is the portion of the branch that is older than retention period ==== is the portion of the branch that is newer than retention period. Before this commit, the sizing model included the logical size at the GC horizon, but not the WAL after that. In particular, that meant that on a newly created tenant with just one timeline, where the retention period covered the whole history of the timeline, i.e. gc_cutoff was 0, the calculated tenant size was always zero. We now include the WAL after the GC horizon in the size. So in the above example, the calculated tenant size would be the logical size of the database the GC horizon, plus all the WAL after it (marked with ===). This adds a new `insert_point` function to the sizing model, alongside `modify_branch`, and changes the code in size.rs to use the new function. The new function takes an absolute lsn and logical size as argument, so we no longer need to calculate the difference to the previous point. Also, the end-size is now optional, because we now need to add a point to represent the end of each branch to the model, but we don't want to or need to calculate the logical size at that point.	2022-11-22 17:11:27 +02:00
Christian Schwarz	f50d0ec0c9	test_runner: ignore 'sender is dropped while join handle is still alive' warnings The need for a proper solution to this is tracked in https://github.com/neondatabase/neon/issues/2885	2022-11-22 11:30:34 +01:00
Sergey Melnikov	74ec36a1bf	Add pageserver-1.us-east-2.aws.neon.build (#2881 )	2022-11-22 10:55:02 +01:00
Anastasia Lubennikova	a63ebb6446	Update vendor postgres to 14.6 and 15.1	2022-11-22 10:46:21 +02:00
Alexander Stanovoy	a5b898a31c	Fix the order of checks in LSN (#2882 ) We should check if LSN is in the lower range because it's constant and only after wait for LSN to arrive if needed.	2022-11-22 02:28:41 +02:00
bojanserafimov	c6f095a821	Fix remote seqscan test (#2878 )	2022-11-21 17:21:47 -05:00
Alexander Bayandin	6b2bc7f775	Nightly Benchmarks: Add RDS Postgres (#2859 ) Add RDS Postgres `db.m5.large` instance to Nightly Benchmarks	2022-11-21 15:25:09 +00:00
Heikki Linnakangas	6c97fc941a	Enable passing FAILPOINTS at startup. - Pass through FAILPOINTS environment variable to the pageserver in "neon_local pageserver start" command - On startup, list any failpoints that were set with FAILPOINTS to the log - Add optional "extra_env_vars" argument to the NeonPageserver.start() function in the python fixture, so that you can pass FAILPOINTS None of the tests use this functionality yet; that comes in a separate commit. closes https://github.com/neondatabase/neon/pull/2865	2022-11-21 16:24:19 +01:00
Alexander Bayandin	cb9b26776e	Fix test_seqscans on remote cluster (#2869 ) A remote project is reused between tests, so we need to ensure that we don't have a table with the same name already created.	2022-11-19 23:39:42 +00:00
Heikki Linnakangas	684329d4d2	Another attempt at silencing test_gc_cutoff failures. Increse the pgbench runtimes even further. The theory is that when there are many other tests running at the same time, one pgbench run could take a long time until it generates enough layers for GC to kick in.	2022-11-19 19:28:56 +02:00
Heikki Linnakangas	ed40a045c0	Add more logging to track down test_gc_cutoff failure. see https://github.com/neondatabase/neon/issues/2856	2022-11-19 14:12:21 +02:00
Heikki Linnakangas	3f39327622	Silence a few compiler warnings I saw these from the build of the compute docker image in the CI (compute-node-image-v15): pagestore_smgr.c: In function 'neon_prefetch': pagestore_smgr.c:1654:2: warning: ISO C90 forbids mixed declarations and code [-Wdeclaration-after-statement] 1654 \| BufferTag tag = (BufferTag) { \| ^~~~~~~~~ walproposer.c:197:1: warning: no previous prototype for 'WalProposerSync' [-Wmissing-prototypes] 197 \| WalProposerSync(int argc, char *argv[]) \| ^~~~~~~~~~~~~~~ libpagestore.c: In function 'pageserver_connect': libpagestore.c💯9: warning: variable 'wc' set but not used [-Wunused-but-set-variable] 100 \| int wc; \| ^~ libpagestore.c: In function 'call_PQgetCopyData': libpagestore.c:144:9: warning: variable 'wc' set but not used [-Wunused-but-set-variable] 144 \| int wc; \| ^~ Harmless warnings, but let's be tidy. In the passing, I added some "extern" to a few function declarations that were missing them, and marked WalProposerSync as "static". Those changes are also purely cosmetic.	2022-11-19 14:11:04 +02:00
Heikki Linnakangas	a50a7e8ac0	Try to silence test_gc_cutoff flakiness. Commit `d013a2b227` changed the test, so that it fails if pgbench runs to completion without triggering the failpoint. That has now happened several times in the CI. That's not expected, so this needs some investigation, but as a quick fix just make the pgbench runs longer so that we're closer to the situation before commit `d013a2b227`. See https://github.com/neondatabase/neon/issues/2856	2022-11-19 01:19:09 +02:00
Egor Suvorov	e28eda7939	sourcetree/docs: mention hakari generate (#2864 )	2022-11-18 22:30:41 +00:00
Christian Schwarz	f564dff0e3	make test_tenant_detach_smoke fail reproducibly Add failpoint that triggers the race condition. Skip test until we'll land the fix from https://github.com/neondatabase/neon/pull/2851 with https://github.com/neondatabase/neon/pull/2785	2022-11-18 17:15:34 +01:00
Christian Schwarz	d783889a1f	timeline: explicit tracking of flush loop state: NotStarted, Running, Exited This allows us to error out in the case where we request flush but the flush loop is not running. Before, we would only track whether it was started, but not when it exited. Better to use an enum with 3 states than a 2-state bool because then the error message can answer the question whether we ever started the flush loop or not.	2022-11-18 17:15:34 +01:00
bojanserafimov	2655bdbb2e	Add remote seqscans test (#2840 )	2022-11-18 09:05:13 -05:00
Konstantin Knizhnik	b9152f1ef4	Correctly terminate prefetch in case of pageserver restart (#2850 ) refer #2819 This patch requires deep knowledge of prefetch internals. So @MMeent please review it or suggest better solution.	2022-11-18 15:04:58 +02:00
Heikki Linnakangas	328ec1ce24	Print a more full error message, with stack trace, on GC failure. In a CI run, I got a test failure because of this error in the log, from the test_get_tenant_size_with_multiple_branches test: ERROR gc_loop{tenant_id=f1630516d4b526139836ced93be0c878}: Gc failed, retrying in 2s: No such file or directory (os error 2) There are known race conditions between GC and timeline deletion, which surely caused that error. But if we didn't know the cause, it would be pretty hard to debug without a stack trace.	2022-11-18 11:44:00 +02:00
Heikki Linnakangas	dcb79ef08f	Silence yet another test failure from race condition between GC and delete. Another similar case to commit `9ae4da4f31`.	2022-11-18 10:18:15 +02:00
Konstantin Knizhnik	fd99e0fbc4	Build pg_prewrm extension (#2794 )	2022-11-18 09:10:32 +02:00

1 2 3 4 5 ...

2385 Commits