rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-01-08 14:02:55 +00:00

Author	SHA1	Message	Date
Kirill Bulatov	d4e155aaa3	Librarify common etcd timeline logic	2022-05-06 22:32:57 +03:00
bojanserafimov	ef40e404cf	Rename zenith crate to neon_local (#1625 )	2022-05-05 19:06:53 -04:00
Thang Pham	c4bc604e5f	Fix pg list table alignment #1633 Fixes #1628 - add [`comfy_table`](https://github.com/Nukesor/comfy-table/tree/main) and use it to construct table for `pg list` CLI command Comparison - Old: ``` NODE ADDRESS TIMELINE BRANCH NAME LSN STATUS main 127.0.0.1:55432 3823dd05e35d71f6ccf33049de366d70 main 0/16FB140 running migration_check 127.0.0.1:55433 3823dd05e35d71f6ccf33049de366d70 main 0/16FB140 running ``` - New: ``` NODE ADDRESS TIMELINE BRANCH NAME LSN STATUS main 127.0.0.1:55432 3823dd05e35d71f6ccf33049de366d70 main 0/16FB140 running migration_check 127.0.0.1:55433 3823dd05e35d71f6ccf33049de366d70 main 0/16FB140 running ```	2022-05-04 12:12:26 -04:00
Stas Kelvich	0323bb5870	[proxy] Refactor cplane API and add new console SCRAM auth API Now proxy binary accepts `--auth-backend` CLI option, which determines auth scheme and cluster routing method. Following backends are currently implemented: * legacy old method, when username ends with `@zenith` it uses md5 auth dbname as the cluster name; otherwise, it sends a login link and waits for the console to call back * console new SCRAM-based console API; uses SNI info to select the destination cluster * postgres uses postgres to select auth secrets of existing roles. Useful for local testing * link sends login link for all usernames	2022-05-02 18:32:18 +03:00
Anastasia Lubennikova	5c5c3c64f3	Fix tenant config parsing. Add a test	2022-04-28 11:49:19 +03:00
Dmitry Rodionov	695b5f9d88	Remove obsolete failpoint in proxy When failpoint feature is disabled it throws away passed code so code inside is not guaranteed to compile when feature is disabled. In this particular case code is obsolete so removing it.	2022-04-27 14:34:33 +03:00
Kirill Bulatov	778744d35c	Limit concurrent S3 and IAM interactions	2022-04-26 13:49:37 +03:00
Dmitry Ivanov	d3f356e7a8	Update `rust-postgres` project-wide (#1525 ) * Update `rust-postgres` project-wide This commit points to https://github.com/neondatabase/rust-postgres/commits/neon in order to test our patches on top of the latest version of this crate. * [proxy] Update `hmac` and `sha2`	2022-04-22 17:31:58 +03:00
Heikki Linnakangas	a4700c9bbe	Use pprof to get flamegraph of get_page and get_relsize requests. This depends on a hacked version of the 'pprof-rs' crate. Because of that, it's under an optional 'profiling' feature. It is disabled by default, but enabled for release builds in CircleCI config. It doesn't currently work on macOS. The flamegraph is written to 'flamegraph.svg' in the pageserver workdir when the 'pageserver' process exits. Add a performance test that runs the perf_pgbench test, with profiling enabled.	2022-04-21 20:32:48 +03:00
Kirill Bulatov	81cad6277a	Move and library crates into a dedicated directory and rename them	2022-04-21 13:30:33 +03:00
Heikki Linnakangas	cbdfd8c719	Update 'routerify' dependency in proxy. routerify version 3 is used in zenith_utils, use the same version in proxy to avoid having to build two versions.	2022-04-20 14:42:05 +03:00
Heikki Linnakangas	86bf4301b7	Remove unnecessary dependency on 'webpki'	2022-04-20 14:36:54 +03:00
Heikki Linnakangas	9eaa21317c	Update jsonwebtoken crate. With this, we no longer need to build two versions of 'pem' and 'base64' crates. Introduces a duplicate version of 'time' crate, though, but it's still progress.	2022-04-20 14:27:49 +03:00
Heikki Linnakangas	e660e12f79	Update rustls-split and rustls versions. All dependencies now use rustls 0.20.2, so we no longer need to build two versions of it.	2022-04-20 14:07:55 +03:00
Kirill Bulatov	3e6087a12f	Remove S3 archiving	2022-04-19 23:13:52 +03:00
Kirill Bulatov	81417788c8	walkeeper -> safekeeper	2022-04-18 12:52:31 +03:00
Kirill Bulatov	0ca2bd929b	Remove log crate from pageserver	2022-04-18 00:00:36 +03:00
Dmitry Ivanov	ab20f2c491	Use the same version of `rust-postgres` everywhere. (#1516 ) Turns out we still had a stale dep in `compute_tools`.	2022-04-15 18:36:11 +03:00
Dmitry Ivanov	c9d897f9b6	[proxy] Update rustls (#1510 )	2022-04-15 12:06:25 +03:00
Dmitry Rodionov	2cb39a1624	add missing files, update workspace hack	2022-04-14 20:41:21 +03:00
Heikki Linnakangas	93e0ac2b7a	Remove a couple of unused dependencies. Found by "cargo-udeps"	2022-04-14 17:38:26 +03:00
Daniil	58d5136a61	compute_tools: check writability handler (#941 )	2022-04-13 17:16:25 +03:00
Dmitry Ivanov	4af87f3d60	[proxy] Add SCRAM auth mechanism implementation (#1050 ) * [proxy] Add SCRAM auth * [proxy] Implement some tests for SCRAM * Refactoring + test fixes * Hide SCRAM mechanism behind `#[cfg(test)]` Currently we only use it in tests, so we hide all relevant module behind `#[cfg(test)]` to prevent "unused item" warnings.	2022-04-13 03:00:32 +03:00
Kirill Bulatov	0e9ee772af	Use rusoto in safekeeper	2022-04-11 21:34:04 +03:00
Kirill Bulatov	db63fa64ae	Use rusoto lib for S3 relish_storage impl	2022-04-11 21:34:04 +03:00
Heikki Linnakangas	214567bf8f	Use B-tree for the index in image and delta layers. We now use a page cache for those, instead of slurping the whole index into memory. Fixes https://github.com/zenithdb/zenith/issues/1356 This is a backwards-incompatible change to the storage format, so bump STORAGE_FORMAT_VERSION.	2022-04-07 20:58:55 +03:00
Heikki Linnakangas	5d9851f5d1	Refactor the I/O functions. This introduces two new abstraction layers for I/O: - Block I/O, and - Blob I/O. The BlockReader trait abstracts a file or something else that can be read in 8kB pages. It is implemented by EphemeralFiles, and by a new FileBlockReader struct that allows reading arbitrary VirtualFiles in that manner, utilizing the page cache. There is also a new BlockCursor struct that works as a cursor over a BlockReader. When you create a BlockCursor and read the first page using it, it keeps the reference to the page. If you access the same page again, it avoids going to page cache and quickly returns the same page again. That can save a lot of lookups in the page cache if you perform multiple reads. The Blob-oriented API allows reading and writing "blobs" of arbitrary length. It is a layer on top of the block-oriented API. When you write a blob with the write_blob() function, it writes a length field followed by the actual data to the underlying block storage, and returns the offset where the blob was stored. The blob can be retrieved later using the offset. Finally, this replaces the I/O code in image-, delta-, and in-memory layers to use the new abstractions. These replace the 'bookfile' crate. This is a backwards-incompatible change to the storage format.	2022-04-07 20:58:54 +03:00
Dmitry Ivanov	f5da652388	[proxy] Enable keepalives for all tcp connections (#1448 )	2022-03-31 20:44:57 +03:00
Arseny Sher	ec3bc74165	Add safekeeper information exchange through etcd. Safekeers now publish to and pull from etcd per-timeline data. Immediate goal is WAL truncation, for which every safekeeper must know remote_consistent_lsn; the next would be callmemaybe replacement. Adds corresponding '--broker' argument to safekeeper and ability to run etcd in tests. Adds test checking remote_consistent_lsn is indeed communicated.	2022-03-29 18:16:49 +04:00
Dmitry Rodionov	eee0f51e0c	use cargo-hakari to manage workspace_hack crate workspace_hack is needed to avoid recompilation when different crates inside the workspace depend on the same packages but with different features being enabled. Problem occurs when you build crates separately one by one. So this is irrelevant to our CI setup because there we build all binaries at once, but it may be relevant for local development. this also changes cargo's resolver version to 2	2022-03-29 10:42:04 +03:00
Heikki Linnakangas	07342f7519	Major storage format rewrite. This is a backwards-incompatible change. The new pageserver cannot read repositories created with an old pageserver binary, or vice versa. Simplify Repository to a value-store ------------------------------------ Move the responsibility of tracking relation metadata, like which relations exist and what are their sizes, from Repository to a new module, pgdatadir_mapping.rs. The interface to Repository is now a simple key-value PUT/GET operations. It's still not any old key-value store though. A Repository is still responsible from handling branching, and every GET operation comes with an LSN. Mapping from Postgres data directory to keys/values --------------------------------------------------- All the data is now stored in the key-value store. The 'pgdatadir_mapping.rs' module handles mapping from PostgreSQL objects like relation pages and SLRUs, to key-value pairs. The key to the Repository key-value store is a Key struct, which consists of a few integer fields. It's wide enough to store a full RelFileNode, fork and block number, and to distinguish those from metadata keys. 'pgdatadir_mapping.rs' is also responsible for maintaining a "partitioning" of the keyspace. Partitioning means splitting the keyspace so that each partition holds a roughly equal number of keys. The partitioning is used when new image layer files are created, so that each image layer file is roughly the same size. The partitioning is also responsible for reclaiming space used by deleted keys. The Repository implementation doesn't have any explicit support for deleting keys. Instead, the deleted keys are simply omitted from the partitioning, and when a new image layer is created, the omitted keys are not copied over to the new image layer. We might want to implement tombstone keys in the future, to reclaim space faster, but this will work for now. Changes to low-level layer file code ------------------------------------ The concept of a "segment" is gone. Each layer file can now store an arbitrary range of Keys. Checkpointing, compaction ------------------------- The background tasks are somewhat different now. Whenever checkpoint_distance is reached, the WAL receiver thread "freezes" the current in-memory layer, and creates a new one. This is a quick operation and doesn't perform any I/O yet. It then launches a background "layer flushing thread" to write the frozen layer to disk, as a new L0 delta layer. This mechanism takes care of durability. It replaces the checkpointing thread. Compaction is a new background operation that takes a bunch of L0 delta layers, and reshuffles the data in them. It runs in a separate compaction thread. Deployment ---------- This also contains changes to the ansible scripts that enable having multiple different pageservers running at the same time in the staging environment. We will use that to keep an old version of the pageserver running, for clusters created with the old version, at the same time with a new pageserver with the new binary. Author: Heikki Linnakangas Author: Konstantin Knizhnik <knizhnik@zenith.tech> Author: Andrey Taranik <andrey@zenith.tech> Reviewed-by: Matthias Van De Meent <matthias@zenith.tech> Reviewed-by: Bojan Serafimov <bojan@zenith.tech> Reviewed-by: Konstantin Knizhnik <knizhnik@zenith.tech> Reviewed-by: Anton Shyrabokau <antons@zenith.tech> Reviewed-by: Dhammika Pathirana <dham@zenith.tech> Reviewed-by: Kirill Bulatov <kirill@zenith.tech> Reviewed-by: Anastasia Lubennikova <anastasia@zenith.tech> Reviewed-by: Alexey Kondratov <alexey@zenith.tech>	2022-03-28 05:41:15 -05:00
Dmitry Rodionov	8b8d78a3a0	use main branch of our bookfile crate	2022-03-23 22:05:43 +04:00
Kirill Bulatov	063f9ba81d	Use serde_with to (de)serialize ZId and Lsn to hex	2022-03-21 12:46:07 +02:00
Dmitry Ivanov	705f51db27	[proxy] Propagate some errors to user (#1329 ) * [proxy] Propagate most errors to user This change enables propagation of most errors to the user (e.g. auth and connectivity errors). Some of them will be stripped of sensitive information. As a side effect, most occurrences of `anyhow::Error` were replaced with concrete error types. * [proxy] Box weighty errors	2022-03-16 21:20:04 +03:00
Heikki Linnakangas	9c1a9a1d9f	Update Cargo.lock for new dependencies (#1354 ) Commit `b2ad8342d2` added dependency on 'criterion', which pulled along some other crates.	2022-03-14 21:06:25 +03:00
Arseny Sher	f86cf93435	Refactor timeline creation on safekeepers, allowing storing peer ids. Have separate routine and http endpoint to create timeline on safekeepers. It is not used yet, i.e. timeline is still created implicitly, but we'll change that once infrastructure for learning which tlis are assigned to which safekeepers will be ready, preventing accidental creation by compute. Changes format of safekeeper control file, allowing to store set of peers. Knowing peers provides a part of foundation for peer recovery (calculating min horizons like truncate_lsn for WAL truncation and commit_lsn for sync-safekeepers replacement) and proper membership change; similarly, we don't yet use it for now. Employing cf file version bump, extracts tenant_id and timeline_id to top level where it is more suitable. Also adds a bunch of LSNs there and rename truncate_lsn to more specific peer_horizon_lsn.	2022-03-06 08:06:38 +03:00
anastasia	1a4682a04a	Add 'walreceiver-after-ingest' failpoint. Use sleep at this point to imitate slow walreceiver.	2022-02-22 13:56:21 +03:00
Dmitry Ivanov	a47dade622	[proxy] Migrate to async This change makes most parts of the code asynchronous, except for the `mgmt` subsystem (we're going to drop it anyway). Co-authored-by: bojanserafimov <bojan.serafimov7@gmail.com>	2022-02-17 11:54:27 +03:00
Kirill Bulatov	6eef401602	Move routerify behind zenith_utils	2022-02-10 08:33:22 -05:00
Kirill Bulatov	c5b5905ed3	Remove parking_lot dependency from workspace	2022-02-10 08:33:22 -05:00
Kirill Bulatov	76b74349cb	Bump pageserver dependencies	2022-02-10 08:33:22 -05:00
Dmitry Ivanov	c2927353a5	Enable async deserialization of FeMessage Now it's possible to call Fe{Startup,}Message in both sync and async contexts, which is good for proxy. Co-authored-by: bojanserafimov <bojan.serafimov7@gmail.com>	2022-01-28 19:40:37 +03:00
anastasia	5abe2129c6	Extend replication protocol with ZentihFeedback message to pass current_timeline_size to compute node Put standby_status_update fields into ZenithFeedback and send them as one message. Pass values sizes together with keys in ZenithFeedback message.	2022-01-27 11:20:45 +03:00
Alexey Kondratov	06c28174c2	Integrate compute_tools into zenith workspace and improve logging (zenithdb/console#487 )	2022-01-18 18:47:31 +03:00
Heikki Linnakangas	adb0b3dada	Include backtrace in error messages in the log. 'anyhow' crate can include a backtrace in all errors, when the 'backtrace' feature is enabled. Enable it, and change the places that used '{:#}' or '{}' to '{:?}', so that the backtrace is printed.	2022-01-14 10:10:17 +02:00
bojanserafimov	5e0f39cc9e	Add proxy metrics (#1093 )	2022-01-13 20:34:30 -05:00
bojanserafimov	5b9391b51d	Support "query cancel" in proxy (#1052 )	2022-01-05 17:27:12 -05:00
Patrick Insinger	24c8dab86f	pageserver - parallelize checkpoint fsyncs	2022-01-04 20:40:57 -08:00
Dmitry Rodionov	c910132d4b	Fix wal receiver shutdown This patch allows to shutdown wal receiver when there are no messages and wal receiver is blocked inside tokio-postgres. In this case it cannot check the shutdown flag. This patch switches to use async interface of tokio-postgres directly without sync wrappers. It opens the possibility to use tokio::select! between the phsycal_stream.next() and a shutdown channel readiness to interrupt replication process. Also this allows to shutdown only particular wal receiver without using global shutdown_requested flag.	2021-12-29 14:42:29 +03:00
bojanserafimov	b807570f46	Use parking_lot::Mutex instead of std::Mutex in walreceiver (#1045 )	2021-12-23 14:25:44 -05:00

1 2 3 4

174 Commits