rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-01-15 09:22:55 +00:00

Author	SHA1	Message	Date
Anastasia Lubennikova	38b0db734e	Fix handling of contribs that have non-default directory	2023-06-29 12:50:43 +03:00
Alek Westover	316082d770	make error set default value rather than panicing	2023-06-28 17:35:17 -04:00
Alek Westover	8b586ea748	delete obsolete code	2023-06-28 17:17:03 -04:00
Alek Westover	b4abbfe6fb	fix clippy; use oncelock instead of mutex because it is more appropriate	2023-06-28 16:49:04 -04:00
Alek Westover	06763949c0	fix clippy problems	2023-06-28 15:06:55 -04:00
Alek Westover	a6c9a4abe7	cache extensions	2023-06-28 14:51:55 -04:00
Anastasia Lubennikova	0875c2284c	Fix shared_preload_libraries parsing. Don't try to download already existing shared_preload_libraries files	2023-06-28 19:20:36 +03:00
Alek Westover	2089d02f94	finish merge	2023-06-28 12:01:50 -04:00
Alek Westover	a6f142c4f6	Merge branch 'extension_server' of github.com:neondatabase/neon into extension_server	2023-06-28 11:59:48 -04:00
Alek Westover	ce0da2889e	cache available libraries: increases efficiency, and reduces code duplication	2023-06-28 11:48:55 -04:00
Anastasia Lubennikova	ceebc8e870	Parse private_ext_prefixes from spec. TODO: ensure that this spec change is backward/forward compatible	2023-06-28 15:46:01 +03:00
Anastasia Lubennikova	1d407f937f	Handle contribs that have non-default directory. Handle extension SQL files in private path. Bump vendor/postgres. Refactoting: unify some shared code	2023-06-28 15:29:02 +03:00
Alek Westover	b402dfd9c7	added support for private libraries. refactored library downloading function to be more efficient by using a hashmap and to reduce code duplication	2023-06-27 16:56:11 -04:00
Alek Westover	5e882d8359	use test bucket for test extensions	2023-06-27 14:50:47 -04:00
Alek Westover	b999b27655	python black style fix	2023-06-27 10:25:05 -04:00
Anastasia Lubennikova	ebe503e6f8	Test cleanup	2023-06-27 16:46:03 +03:00
Anastasia Lubennikova	604d2643d4	Clenup pytest code	2023-06-27 16:17:59 +03:00
Anastasia Lubennikova	7357b7cad5	Code cleanup	2023-06-27 15:29:33 +03:00
Alek Westover	a2e154f07b	real s3 and tenant specific files	2023-06-26 15:25:20 -04:00
Anastasia Lubennikova	7667fdc5c8	use pgVersion parameter in test_download_extesion	2023-06-26 20:54:11 +03:00
Anastasia Lubennikova	85d02acdb4	Merge branch 'main' into extension_server	2023-06-26 20:46:53 +03:00
Anastasia Lubennikova	5f986875bf	WIP debug shared_libs	2023-06-26 20:15:38 +03:00
Anastasia Lubennikova	1104de0b9b	refactoring - enable CREATE EXTENSION and LOAD test - change test_file_download to use mock_s3 - some code cleanup - add caching of extensions_list - WIP downloading of shared_preload_libraries (not tested yet)	2023-06-26 18:54:38 +03:00
Christian Schwarz	1faf69a698	run `Layer::get_value_reconstruct_data` in `spawn_blocking` (#4498 ) This PR concludes the "async `Layer::get_value_reconstruct_data`" project. The problem we're solving is that, before this patch, we'd execute `Layer::get_value_reconstruct_data` on the tokio executor threads. This function is IO- and/or CPU-intensive. The IO is using VirtualFile / std::fs; hence it's blocking. This results in unfairness towards other tokio tasks, especially under (disk) load. Some context can be found at https://github.com/neondatabase/neon/issues/4154 where I suspect (but can't prove) load spikes of logical size calculation to cause heavy eviction skew. Sadly we don't have tokio runtime/scheduler metrics to quantify the unfairness. But generally, we know blocking the executor threads on std::fs IO is bad. So, let's have this change and watch out for severe perf regressions in staging & during rollout. ## Changes * rename `Layer::get_value_reconstruct_data` to `Layer::get_value_reconstruct_data_blocking` * add a new blanket impl'd `Layer::get_value_reconstruct_data` `async_trait` method that runs `get_value_reconstruct_data_blocking` inside `spawn_blocking`. * The `spawn_blocking` requires `'static` lifetime of the captured variables; hence I had to change the data flow to _move_ the `ValueReconstructState` into and back out of get_value_reconstruct_data instead of passing a reference. It's a small struct, so I don't expect a big performance penalty. ## Performance Fundamentally, the code changes cause the following performance-relevant changes: * Latency & allocations: each `get_value_reconstruct_data` call now makes a short-lived allocation because `async_trait` is just sugar for boxed futures under the hood * Latency: `spawn_blocking` adds some latency because it needs to move the work to a thread pool * using `spawn_blocking` plus the existing synchronous code inside is probably more efficient better than switching all the synchronous code to tokio::fs because _each_ tokio::fs call does `spawn_blocking` under the hood. * Throughput: the `spawn_blocking` thread pool is much larger than the async executor thread pool. Hence, as long as the disks can keep up, which they should according to AWS specs, we will be able to deliver higher `get_value_reconstruct_data` throughput. * Disk IOPS utilization: we will see higher disk utilization if we get more throughput. Not a problem because the disks in prod are currently under-utilized, according to node_exporter metrics & the AWS specs. * CPU utilization: at higher throughput, CPU utilization will be higher. Slightly higher latency under regular load is acceptable given the throughput gains and expected better fairness during disk load peaks, such as logical size calculation peaks uncovered in #4154. ## Full Stack Of Preliminary PRs This PR builds on top of the following preliminary PRs 1. Clean-ups * https://github.com/neondatabase/neon/pull/4316 * https://github.com/neondatabase/neon/pull/4317 * https://github.com/neondatabase/neon/pull/4318 * https://github.com/neondatabase/neon/pull/4319 * https://github.com/neondatabase/neon/pull/4321 * Note: these were mostly to find an alternative to #4291, which I thought we'd need in my original plan where we would need to convert `Tenant::timelines` into an async locking primitive (#4333). In reviews, we walked away from that, but these cleanups were still quite useful. 2. https://github.com/neondatabase/neon/pull/4364 3. https://github.com/neondatabase/neon/pull/4472 4. https://github.com/neondatabase/neon/pull/4476 5. https://github.com/neondatabase/neon/pull/4477 6. https://github.com/neondatabase/neon/pull/4485 7. https://github.com/neondatabase/neon/pull/4441	2023-06-26 11:43:11 +02:00
Christian Schwarz	44a441080d	bring back spawn_blocking for `compact_level0_phase1` (#4537 ) The stats for `compact_level0_phase` that I added in #4527 show the following breakdown (24h data from prod, only looking at compactions with > 1 L1 produced): * 10%ish of wall-clock time spent between the two read locks * I learned that the `DeltaLayer::iter()` and `DeltaLayer::key_iter()` calls actually do IO, even before we call `.next()`. I suspect that is why they take so much time between the locks. * 80+% of wall-clock time spent writing layer files * Lock acquisition time is irrelevant (low double-digit microseconds at most) * The generation of the holes holds the read lock for a relatively long time and it's proportional to the amount of keys / IO required to iterate over them (max: 110ms in prod; staging (nightly benchmarks): multiple seconds). Find below screenshots from my ad-hoc spreadsheet + some graphs. <img width="1182" alt="image" src="https://github.com/neondatabase/neon/assets/956573/81398b3f-6fa1-40dd-9887-46a4715d9194"> <img width="901" alt="image" src="https://github.com/neondatabase/neon/assets/956573/e4ac0393-f2c1-4187-a5e5-39a8b0c394c9"> <img width="210" alt="image" src="https://github.com/neondatabase/neon/assets/956573/7977ade7-6aa5-4773-a0a2-f9729aecee0d"> ## Changes In This PR This PR makes the following changes: * rearrange the `compact_level0_phase1` code such that we build the `all_keys_iter` and `all_values_iter` later than before * only grab the `Timeline::layers` lock once, and hold it until we've computed the holes * run compact_level0_phase1 in spawn_blocking, pre-grabbing the `Timeline::layers` lock in the async code and passing it in as an `OwnedRwLockReadGuard`. * the code inside spawn_blocking drops this guard after computing the holds * the `OwnedRwLockReadGuard` requires the `Timeline::layers` to be wrapped in an `Arc`. I think that's Ok, the locking for the RwLock is more heavy-weight than an additional pointer indirection. ## Alternatives Considered The naive alternative is to throw the entire function into `spawn_blocking`, and use `blocking_read` for `Timeline::layers` access. What I've done in this PR is better because, with this alternative, 1. while we `blocking_read()`, we'd waste one slot in the spawn_blocking pool 2. there's deadlock risk because the spawn_blocking pool is a finite resource ![image](https://github.com/neondatabase/neon/assets/956573/46c419f1-6695-467e-b315-9d1fc0949058) ## Metadata Fixes https://github.com/neondatabase/neon/issues/4492	2023-06-26 11:42:17 +02:00
Sasha Krassovsky	c215389f1c	quote_ident identifiers when creating neon_superuser (#4562 ) ## Problem	2023-06-24 10:34:15 +03:00
Sasha Krassovsky	b1477b4448	Create neon_superuser role, grant it to roles created from control plane (#4425 ) ## Problem Currently, if a user creates a role, it won't by default have any grants applied to it. If the compute restarts, the grants get applied. This gives a very strange UX of being able to drop roles/not have any access to anything at first, and then once something triggers a config application, suddenly grants are applied. This removes these grants.	2023-06-24 01:38:27 +03:00
Christian Schwarz	a500bb06fb	use preinitialize_metrics to initialize page cache metrics (#4557 ) This is follow-up to ``` commit `2252c5c282` Author: Alex Chi Z <iskyzh@gmail.com> Date: Wed Jun 14 17:12:34 2023 -0400 metrics: convert some metrics to pageserver-level (#4490) ```	2023-06-23 16:40:50 -04:00
Alek Westover	a8f848b5de	test reals3	2023-06-23 15:51:49 -04:00
Christian Schwarz	15456625c2	don't use MGMT_REQUEST_RUNTIME for consumption metrics synthetic size worker (#4560 ) The consumption metrics synthetic size worker does logical size calculation. Logical size calculation currently does synchronous disk IO. This blocks the MGMT_REQUEST_RUNTIME's executor threads, starving other futures. While there's work on the way to move the synchronous disk IO into spawn_blocking, the quickfix here is to use the BACKGROUND_RUNTIME instead of MGMT_REQUEST_RUNTIME. Actually it's not just a quickfix. We simply shouldn't be blocking MGMT_REQUEST_RUNTIME executor threads on CPU or sync disk IO. That work isn't done yet, as many of the mgmt tasks still _do_ disk IO. But it's not as intensive as the logical size calculations that we're fixing here. While we're at it, fix disk-usage-based eviction in a similar way. It wasn't the culprit here, according to prod logs, but it can theoretically be a little CPU-intensive. More context, including graphs from Prod: https://neondb.slack.com/archives/C03F5SM1N02/p1687541681336949	2023-06-23 15:40:36 -04:00
Alek Westover	ca59330df8	modify file names	2023-06-23 15:28:30 -04:00
Alek Westover	36bb5ad527	refactor	2023-06-23 14:47:48 -04:00
Alek Westover	6532daf528	add todos	2023-06-23 14:07:10 -04:00
Alek Westover	8bc128e474	real s3 tests	2023-06-23 13:41:08 -04:00
Alek Westover	c3994541eb	add real s3 tests	2023-06-23 13:26:28 -04:00
Anastasia Lubennikova	7cdcc8a500	Fix downloading of sql files for extension and libraries. Rust code refactoring and C code fixes. Add test for CREATE EXTENSION and LOAD 'library'	2023-06-23 20:25:14 +03:00
Vadim Kharitonov	a3f0dd2d30	Compile `pg_uuidv7` (#4558 ) Doc says that it should be added into `shared_preload_libraries`, but, practically, it's not required. ``` postgres=# create extension pg_uuidv7; CREATE EXTENSION postgres=# SELECT uuid_generate_v7(); uuid_generate_v7 -------------------------------------- 0188e823-3f8f-796c-a92c-833b0b2d1746 (1 row) ```	2023-06-23 15:56:49 +01:00
Christian Schwarz	76718472be	add pageserver-global histogram for basebackup latency (#4559 ) The histogram distinguishes by ok/err. I took the liberty to create a small abstraction for such use cases. It helps keep the label values inside `metrics.rs`, right next to the place where the metric and its labels are declared.	2023-06-23 16:42:12 +02:00
Alek Westover	4201f4695f	fix typo	2023-06-23 10:03:41 -04:00
Alek Westover	5776df15da	try to add library extensions	2023-06-23 09:33:59 -04:00
Alek Westover	31aa0283b0	More Extension Features (#4555 ) Added tenant specific extensions and more tests	2023-06-23 09:30:49 -04:00
Alek Westover	fd3dfe9d52	fix typo	2023-06-23 08:04:37 -04:00
Alexander Bayandin	c07b6ffbdc	Fix git tag name for release (#4545 ) ## Problem A git tag for a release has an extra `release-` prefix (it looks like `release-release-3439`). ## Summary of changes - Do not add `release-` prefix when create git tag	2023-06-23 12:52:17 +01:00
Alexander Bayandin	6c3605fc24	Nightly Benchmarks: Increase timeout for pgbench-compare job (#4551 ) ## Problem In the test environment vacuum duration fluctuates from ~1h to ~5h, along with another two 1h benchmarks (`select-only` and `simple-update`) it could be up to 7h which is longer than 6h timeout. ## Summary of changes - Increase timeout for pgbench-compare job to 8h - Remove 6h timeouts from Nightly Benchmarks (this is a default value)	2023-06-23 12:47:37 +01:00
Vadim Kharitonov	d96d51a3b7	Update rust to 1.70.0 (#4550 )	2023-06-23 13:09:04 +02:00
Alex Chi Z	a010b2108a	pgserver: better template config file (#4554 ) * `compaction_threshold` should be an integer, not a string. * uncomment `[section]` so that if a user needs to modify the config, they can simply uncomment the corresponding line. Otherwise it's easy for us to forget uncommenting the `[section]` when uncommenting the config item we want to configure. Signed-off-by: Alex Chi <iskyzh@gmail.com>	2023-06-23 10:18:06 +03:00
Alek Westover	384e3ab1a8	fix code style	2023-06-22 15:10:08 -04:00
Alek Westover	4259464f72	fix typo	2023-06-22 15:07:28 -04:00
Alek Westover	152206211b	turn remote extensions off by default	2023-06-22 15:02:51 -04:00
Alek Westover	9c35c06c58	small refactor	2023-06-22 14:24:59 -04:00

1 2 3 4 5 ...

3426 Commits