rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-05-17 05:00:38 +00:00

Author	SHA1	Message	Date
Heikki Linnakangas	aaa60c92ca	Use u64/i64 for logical size, comment on why to use signed i64. usize/isize type corresponds to the CPU architecture's pointer width, i.e. 64 bits on a 64-bit platform and 32 bits on a 32-bit platform. The logical size of a database has nothing to do with the that, so u64/i64 is more appropriate. It doesn't make any difference in practice as long as you're on a 64-bit platform, and it's hard to imagine anyone wanting to run the pageserver on a 32-bit platform, but let's be tidy. Also add a comment on why we use signed i64 for the logical size variable, even though size should never be negative. I'm not sure the reasons are very good, but at least this documents them, and hints at some possible better solutions.	2022-08-19 16:44:16 +03:00
Kirill Bulatov	c19b4a65f9	Remove Repository trait, rename LayeredRepository struct into Repository	2022-08-19 16:40:37 +03:00
Kirill Bulatov	8043612334	Remove Timeline trait, rename LayeredTimeline struct into Timeline	2022-08-19 16:40:37 +03:00
Kirill Bulatov	3b819ee159	Remove extra type aliases (#2280 )	2022-08-17 17:51:53 +03:00
Kirill Bulatov	648e8bbefe	Fix 1.63 clippy lints (#2282 )	2022-08-16 18:49:22 +03:00
Arseny Sher	e593cbaaba	Add pageserver checkpoint_timeout option. To flush inmemory layer eventually when no new data arrives, which helps safekeepers to suspend activity (stop pushing to the broker). Default 10m should be ok.	2022-08-11 22:54:09 +03:00
Heikki Linnakangas	d0494c391a	Remove wal_receiver mgmt API endpoint Move all the fields that were returned by the wal_receiver endpoint into timeline_detail. Internally, move those fields from the separate global WAL_RECEIVERS hash into the LayeredTimeline struct. That way, all the information about a timeline is kept in one place. In the passing, I noted that the 'thread_id' field was removed from WalReceiverEntry in commit `e5cb727572`, but it forgot to update openapi_spec.yml. This commit removes that too.	2022-07-29 20:51:37 +03:00
Heikki Linnakangas	02afa2762c	Move Tenant- and TimelineInfo structs to models.rs. They are part of the management API response structs. Let's try to concentrate everything that's part of the API in models.rs.	2022-07-29 15:02:15 +03:00
Heikki Linnakangas	d903dd61bd	Rename 'wal_producer_connstr' to 'wal_source_connstr'. What the WAL receiver really connects to is the safekeeper. The "producer" term is a bit misleading, as the safekeeper doesn't produce the WAL, the compute node does. This change also applies to the name of the field used in the mgmt API in in the response of the '/v1/tenant/:tenant_id/timeline/:timeline_id/wal_receiver' endpoint. AFAICS that's not used anywhere else than one python test, so it should be OK to change it.	2022-07-29 09:09:22 +03:00
Thang Pham	417d9e9db2	Add current physical size to tenant status endpoint (#2173 ) Ref #1902	2022-07-28 13:59:20 -04:00
Thang Pham	6a664629fa	Add timeline physical size tracking (#2126 ) Ref #1902. - Track the layered timeline's `physical_size` using `pageserver_current_physical_size` metric when updating the layer map. - Report the local timeline's `physical_size` in timeline GET APIs. - Add `include-non-incremental-physical-size` URL flag to also report the local timeline's `physical_size_non_incremental` (similar to `logical_size_non_incremental`) - Add a `UIntGaugeVec` and `UIntGauge` to represent `u64` prometheus metrics Co-authored-by: Dmitry Rodionov <dmitry@neon.tech>	2022-07-27 12:36:46 -04:00
Dmitry Rodionov	520ffb341b	fix pageserver openapi spec	2022-07-07 21:20:04 +03:00
Dmitry Rodionov	9f2b40645d	review cleanup, point timeline/detach to timeline/delete	2022-07-07 21:20:04 +03:00
Dmitry Rodionov	168214e0b6	use tenant status endpoint to check whether timelines were downloaded or not	2022-07-07 21:20:04 +03:00
Dmitry Rodionov	d9d4ef12c3	review cleanup	2022-07-07 21:20:04 +03:00
Dmitry Rodionov	e1e24336b7	review adjustments, bring back timeline_detach and rename it to timeline_delete	2022-07-07 21:20:04 +03:00
Dmitry Rodionov	4c54e4b37d	switch to per-tenant attach/detach download operations of all timelines for one tenant are now grouped together so when attach is invoked pageserver downloads all of them and registers them in a single apply_sync_status_update call so branches can be used safely with attach/detach	2022-07-07 21:20:04 +03:00
Kirill Bulatov	e5cb727572	Replace callmemaybe with etcd subscriptions on safekeeper timeline info	2022-06-01 16:07:04 +03:00
chaitanya sharma	c584d90bb9	initial commit, renamed znodeid to nodeid.	2022-05-25 20:11:26 +03:00
Thang Pham	e4a70faa08	Add more information to timeline-related APIs (#1673 ) Resolves #1488. - implemented `GET tenant/:tenant_id/timeline/:timeline_id/wal_receiver` endpoint - returned `thread_id` in `thread_mgr::spawn` - added `latest_gc_cutoff_lsn` field to `LocalTimelineInfo` struct	2022-05-16 11:05:43 -04:00
Kirill Bulatov	10e4da3997	Rework timeline batching	2022-05-07 01:33:33 +03:00
Kirill Bulatov	de37f982db	Share the remote storage as a crate	2022-05-07 00:30:36 +03:00
Kirill Bulatov	52a7e3155e	Add local path to the Layer trait and historic layers	2022-05-05 18:26:39 +03:00
Dhammika Pathirana	f3f12db2cb	Add gc churn threshold knob (#1594 ) Signed-off-by: Dhammika Pathirana <dhammika@gmail.com>	2022-05-01 13:13:17 -07:00
Dmitry Rodionov	05f8e6a050	Use fsync+rename for atomic downloads from remote storage Use failpoint in test_remote_storage to check the behavior	2022-04-29 15:53:56 +03:00
Kirill Bulatov	2911eb084a	Remove timeline files on detach	2022-04-29 09:19:18 +03:00
Kirill Bulatov	6cca57f95a	Properly remove from the local timeline map	2022-04-29 09:19:18 +03:00
Kirill Bulatov	4a46b01caf	Properly populate local timeline map	2022-04-29 09:19:18 +03:00
Kirill Bulatov	fec050ce97	Fix macos clippy issues	2022-04-25 16:23:34 +03:00
Konstantin Knizhnik	5f83c9290b	Make it possible to specify per-tenant configuration parameters Add tenant config API and 'zenith tenant config' CLI command. Add 'show' query to pageserver protocol for tenantspecific config parameters Refactoring: move tenant_config code to a separate module. Save tenant conf file to tenant's directory, when tenant is created to recover it on pageserver restart. Ignore error during tenant config loading, while it is not supported by console Define PiTR interval for GC. refer #1320	2022-04-22 11:24:29 +03:00
Kirill Bulatov	81cad6277a	Move and library crates into a dedicated directory and rename them	2022-04-21 13:30:33 +03:00
Kirill Bulatov	3e6087a12f	Remove S3 archiving	2022-04-19 23:13:52 +03:00
Kirill Bulatov	dc7e3ff05a	Fix rustc 1.60 clippy warnings	2022-04-11 21:34:04 +03:00
Alexey Kondratov	d0c246ac3c	Update pageserver OpenAPI spec with missing attach/detach methods (#1463 ) We have these methods for some time in the API, so mentioning them in the spec could be useful for console (see zenithdb/console#867), as we generate pageserver HTTP API golang client there.	2022-04-05 20:01:57 +03:00
Heikki Linnakangas	07342f7519	Major storage format rewrite. This is a backwards-incompatible change. The new pageserver cannot read repositories created with an old pageserver binary, or vice versa. Simplify Repository to a value-store ------------------------------------ Move the responsibility of tracking relation metadata, like which relations exist and what are their sizes, from Repository to a new module, pgdatadir_mapping.rs. The interface to Repository is now a simple key-value PUT/GET operations. It's still not any old key-value store though. A Repository is still responsible from handling branching, and every GET operation comes with an LSN. Mapping from Postgres data directory to keys/values --------------------------------------------------- All the data is now stored in the key-value store. The 'pgdatadir_mapping.rs' module handles mapping from PostgreSQL objects like relation pages and SLRUs, to key-value pairs. The key to the Repository key-value store is a Key struct, which consists of a few integer fields. It's wide enough to store a full RelFileNode, fork and block number, and to distinguish those from metadata keys. 'pgdatadir_mapping.rs' is also responsible for maintaining a "partitioning" of the keyspace. Partitioning means splitting the keyspace so that each partition holds a roughly equal number of keys. The partitioning is used when new image layer files are created, so that each image layer file is roughly the same size. The partitioning is also responsible for reclaiming space used by deleted keys. The Repository implementation doesn't have any explicit support for deleting keys. Instead, the deleted keys are simply omitted from the partitioning, and when a new image layer is created, the omitted keys are not copied over to the new image layer. We might want to implement tombstone keys in the future, to reclaim space faster, but this will work for now. Changes to low-level layer file code ------------------------------------ The concept of a "segment" is gone. Each layer file can now store an arbitrary range of Keys. Checkpointing, compaction ------------------------- The background tasks are somewhat different now. Whenever checkpoint_distance is reached, the WAL receiver thread "freezes" the current in-memory layer, and creates a new one. This is a quick operation and doesn't perform any I/O yet. It then launches a background "layer flushing thread" to write the frozen layer to disk, as a new L0 delta layer. This mechanism takes care of durability. It replaces the checkpointing thread. Compaction is a new background operation that takes a bunch of L0 delta layers, and reshuffles the data in them. It runs in a separate compaction thread. Deployment ---------- This also contains changes to the ansible scripts that enable having multiple different pageservers running at the same time in the staging environment. We will use that to keep an old version of the pageserver running, for clusters created with the old version, at the same time with a new pageserver with the new binary. Author: Heikki Linnakangas Author: Konstantin Knizhnik <knizhnik@zenith.tech> Author: Andrey Taranik <andrey@zenith.tech> Reviewed-by: Matthias Van De Meent <matthias@zenith.tech> Reviewed-by: Bojan Serafimov <bojan@zenith.tech> Reviewed-by: Konstantin Knizhnik <knizhnik@zenith.tech> Reviewed-by: Anton Shyrabokau <antons@zenith.tech> Reviewed-by: Dhammika Pathirana <dham@zenith.tech> Reviewed-by: Kirill Bulatov <kirill@zenith.tech> Reviewed-by: Anastasia Lubennikova <anastasia@zenith.tech> Reviewed-by: Alexey Kondratov <alexey@zenith.tech>	2022-03-28 05:41:15 -05:00
Kirill Bulatov	55de0b88f5	Hide remote timeline index access details	2022-03-28 12:36:01 +03:00
Kirill Bulatov	bd6bef468c	Provide single list timelines HTTP API handle	2022-03-21 13:42:21 +02:00
Kirill Bulatov	063f9ba81d	Use serde_with to (de)serialize ZId and Lsn to hex	2022-03-21 12:46:07 +02:00
Dmitry Rodionov	7738254f83	refactor timeline memory state management	2022-03-18 18:14:57 +03:00
Kirill Bulatov	093ad8ab59	Send 409 HTTP responses on timeline and tenant creation for existing entity	2022-03-10 19:38:58 +02:00
Kirill Bulatov	c51d545fd9	Serialize Lsn as strings in http api	2022-03-10 19:38:58 +02:00
Kirill Bulatov	fe6fccfdae	Allow already existing repo when creating a tenant	2022-03-10 19:38:58 +02:00
Kirill Bulatov	dd74c66ef0	Do not create timeline along with tenant	2022-03-10 19:38:58 +02:00
Kirill Bulatov	a5e10c4f64	Tidy up pageserver's endpoints	2022-03-10 19:38:58 +02:00
Kirill Bulatov	c7569dce47	Allow passing initial timeline id into zenith CLI commands	2022-03-10 19:38:58 +02:00
Kirill Bulatov	f49990ed43	Allow creating timelines by branching off ancestors	2022-03-10 19:38:58 +02:00
Kirill Bulatov	0c91091c63	Avoid point in time concept on pageserver level	2022-03-10 19:38:58 +02:00
Kirill Bulatov	10f811e886	Use `timeline` instead of `branch` in pageserver's API	2022-03-10 19:38:58 +02:00
Kirill Bulatov	9424bfae22	Use a separate newtype for ZId that (de)serialize as hex strings	2022-03-04 10:58:40 +02:00
Dmitry Rodionov	1d90b1b205	add node id to pageserver (#1310 ) * Add --id argument to safekeeper setting its unique u64 id. In preparation for storage node messaging. IDs are supposed to be monotonically assigned by the console. In tests it is issued by ZenithEnv; at the zenith cli level and fixtures, string name is completely replaced by integer id. Example TOML configs are adjusted accordingly. Sequential ids are chosen over Zid mainly because they are compact and easy to type/remember. * add node id to pageserver This adds node id parameter to pageserver configuration. Also I use a simple builder to construct pageserver config struct to avoid setting node id to some temporary invalid value. Some of the changes in test fixtures are needed to split init and start operations for envrionment. Co-authored-by: Arseny Sher <sher-ars@yandex.ru>	2022-03-04 01:10:42 +03:00

1 2

70 Commits