rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-07-06 21:50:37 +00:00

Author	SHA1	Message	Date
Anastasia Lubennikova	1468c65ffb	Enable billing metric_collection_endpoint on staging	2022-12-23 18:04:45 +02:00
Kirill Bulatov	b77c33ee06	Move tenant-related modules below `tenant` module (#3190 ) No real code changes besides moving code around and adjusting the imports.	2022-12-23 15:40:37 +02:00
Kirill Bulatov	0bafb2a6c7	Do more on-demand downloads where needed (#3194 ) The PR aims to fix two missing redownloads in a flacky test_remote_storage_upload_queue_retries[local_fs] ([example](https://neon-github-public-dev.s3.amazonaws.com/reports/pr-3190/release/3759194738/index.html#categories/80f1dcdd7c08252126be7e9f44fe84e6/8a70800f7ab13620/)) 1. missing redownload during walreceiver work ``` 2022-12-22T16:09:51.509891Z ERROR wal_connection_manager{tenant=fb62b97553e40f949de8bdeab7f93563 timeline=4f153bf6a58fd63832f6ee175638d049}: wal receiver task finished with an error: walreceiver connection handling failure Caused by: Layer needs downloading Stack backtrace: 0: pageserver::tenant::timeline::PageReconstructResult<T>::no_ondemand_download at /__w/neon/neon/pageserver/src/tenant/timeline.rs:467:59 1: pageserver::walingest::WalIngest::new at /__w/neon/neon/pageserver/src/walingest.rs:61:32 2: pageserver::walreceiver::walreceiver_connection::handle_walreceiver_connection::{{closure}} at /__w/neon/neon/pageserver/src/walreceiver/walreceiver_connection.rs:178:25 .... ``` That looks sad, but inevitable during the current approach: seems that we need to wait for old layers to arrive in order to accept new data. For that, `WalIngest::new` now started to return the `PageReconstructResult`. Sync methods from `import_datadir.rs` use `WalIngest::new` too, but both of them import WAL during timeline creation, so no layers to download are needed there, ergo the `PageReconstructResult` is converted to `anyhow::Result` with `no_ondemand_download`. 2. missing redownload during compaction work ``` 2022-12-22T16:09:51.090296Z ERROR compaction_loop{tenant_id=fb62b97553e40f949de8bdeab7f93563}:compact_timeline{timeline=4f153bf6a58fd63832f6ee175638d049}: could not compact, repartitioning keyspace failed: Layer needs downloading Stack backtrace: 0: pageserver::tenant::timeline::PageReconstructResult<T>::no_ondemand_download at /__w/neon/neon/pageserver/src/tenant/timeline.rs:467:59 1: pageserver::pgdatadir_mapping::<impl pageserver::tenant::timeline::Timeline>::collect_keyspace::{{closure}} at /__w/neon/neon/pageserver/src/pgdatadir_mapping.rs:506:41 <core::future::from_generator::GenFuture<T> as core::future::future::Future>::poll at /rustc/e092d0b6b43f2de967af0887873151bb1c0b18d3/library/core/src/future/mod.rs:91:19 pageserver::tenant::timeline::Timeline::repartition::{{closure}} at /__w/neon/neon/pageserver/src/tenant/timeline.rs:2161:50 <core::future::from_generator::GenFuture<T> as core::future::future::Future>::poll at /rustc/e092d0b6b43f2de967af0887873151bb1c0b18d3/library/core/src/future/mod.rs:91:19 2: pageserver::tenant::timeline::Timeline::compact::{{closure}} at /__w/neon/neon/pageserver/src/tenant/timeline.rs:700:14 <core::future::from_generator::GenFuture<T> as core::future::future::Future>::poll at /rustc/e092d0b6b43f2de967af0887873151bb1c0b18d3/library/core/src/future/mod.rs:91:19 3: <tracing::instrument::Instrumented<T> as core::future::future::Future>::poll at /github/home/.cargo/registry/src/github.com-1ecc6299db9ec823/tracing-0.1.37/src/instrument.rs:272:9 4: pageserver::tenant::Tenant::compaction_iteration::{{closure}} at /__w/neon/neon/pageserver/src/tenant.rs:1232:85 <core::future::from_generator::GenFuture<T> as core::future::future::Future>::poll at /rustc/e092d0b6b43f2de967af0887873151bb1c0b18d3/library/core/src/future/mod.rs:91:19 pageserver::tenant_tasks::compaction_loop::{{closure}}::{{closure}} at /__w/neon/neon/pageserver/src/tenant_tasks.rs:76:62 <core::future::from_generator::GenFuture<T> as core::future::future::Future>::poll at /rustc/e092d0b6b43f2de967af0887873151bb1c0b18d3/library/core/src/future/mod.rs:91:19 pageserver::tenant_tasks::compaction_loop::{{closure}} at /__w/neon/neon/pageserver/src/tenant_tasks.rs:91:6 ```	2022-12-23 15:39:59 +02:00
Sergey Melnikov	c01f92c081	Fully remove old staging deploy (#3191 )	2022-12-22 20:09:45 +01:00
Sergey Melnikov	7bc17b373e	Fix calculate-deploy-targets (#3189 ) Was broken in https://github.com/neondatabase/neon/pull/3180	2022-12-22 16:28:36 +01:00
Arthur Petukhovsky	72ab104733	Move zenith-1-sk-3 to zenith-1-sk-4 (#3164 )	2022-12-22 15:21:53 +00:00
Sergey Melnikov	5a496d82b0	Do not deploy storage and proxies to old staging (#3180 ) We fully migrated out, this nodes will be soon decommissioned	2022-12-22 15:37:17 +01:00
Anastasia Lubennikova	8544c59329	Fix flaky test_metrics_collection.py Only check that all metrics are present on the first request, because pageserver doesn't send unchanged metrics.	2022-12-22 16:28:11 +02:00
Anastasia Lubennikova	63eb87bde3	Set default metric_collection_interval to 10 min, which is more reasonable for real usage	2022-12-22 16:24:09 +02:00
Vadim Kharitonov	9b71215906	Simplify some functions in compute_tools and fix typo errors in func name	2022-12-22 15:05:43 +01:00
Stas Kelvich	5a762744c7	Collect core dump backtraces in compute_ctl. Scan core dumps directory on exit. In case of existing core dumps call gdb/lldb to get a backtrace and log it. By default look for core dumps in postgres data directory as core.<pid>. That is how core collection is configured in our k8s nodes (and a reasonable convention in general).	2022-12-22 16:01:49 +02:00
Alexander Bayandin	201fedd65c	tpch-compare: use rust image instead of rustlegacy (#3182 )	2022-12-22 12:40:39 +00:00
Sergey Melnikov	707d1c1c94	Fix vm-compute-image upload to dockerhub (#3181 )	2022-12-22 13:34:16 +01:00
Kirill Bulatov	fca25edae8	Fix 1.66 Clippy warnings (#3178 ) 1.66 release speeds up compile times for over 10% according to tests. Also its Clippy finds plenty of old nits in our code: * useless conversion, `foo as u8` where `foo: u8` and similar, removed `as u8` and similar * useless references and dereferenced (that were automatically adjusted by the compiler), removed various `&` and `` bool -> u8 conversion via `if/else`, changed to `u8::from` * Map `.iter()` calls where only values were used, changed to `.values()` instead Standing out lints: * `Eq` is missing in our protoc generated structs. Silenced, does not seem crucial for us. * `fn default` looks like the one from `Default` trait, so I've implemented that instead and replaced the `dummy_` method in tests with `::default()` invocation Clippy detected that ``` if retry_attempt < u32::MAX { retry_attempt += 1; } ``` is a saturating add and proposed to replace it.	2022-12-22 14:27:48 +02:00
Sergey Melnikov	f5f1197e15	Build vm-compute-node images (#3174 )	2022-12-22 11:25:56 +01:00
Heikki Linnakangas	7ff591ffbf	On-Demand Download The code in this change was extracted from #2595 (Heikki’s on-demand download draft PR). High-Level Changes - New RemoteLayer Type - On-Demand Download As An Effect Of Page Reconstruction - Breaking Semantics For Physical Size Metrics There are several follow-up work items planned. Refer to the Epic issue on GitHub: https://github.com/neondatabase/neon/issues/2029 closes https://github.com/neondatabase/neon/pull/3013 Co-authored-by: Kirill Bulatov <kirill@neon.tech> Co-authored-by: Christian Schwarz <christian@neon.tech> New RemoteLayer Type ==================== Instead of downloading all layers during tenant attach, we create RemoteLayer instances for each of them and add them to the layer map. On-Demand Download As An Effect Of Page Reconstruction ====================================================== At the heart of pageserver is Timeline::get_reconstruct_data(). It traverses the layer map until it has collected all the data it needs to produce the page image. Most code in the code base uses it, though many layers of indirection. Before this patch, the function would use synchronous filesystem IO to load data from disk-resident layer files if the data was not cached. That is not possible with RemoteLayer, because the layer file has not been downloaded yet. So, we do the download when get_reconstruct_data gets there, i.e., “on demand”. The mechanics of how the download is done are rather involved, because of the infamous async-sync-async sandwich problem that plagues the async Rust world. We use the new PageReconstructResult type to work around this. Its introduction is the cause for a good amount of code churn in this patch. Refer to the block comment on `with_ondemand_download()` for details. Breaking Semantics For Physical Size Metrics ============================================ We rename prometheus metric pageserver_{current,resident}_physical_size to reflect what this metric actually represents with on-demand download. This intentionally BREAKS existing grafana dashboard and the cost model data pipeline. Breaking is desirable because the meaning of this metrics has changed with on-demand download. See https://docs.google.com/document/d/12AFpvKY-7FZdR5a4CaD6Ir_rI3QokdCLSPJ6upHxJBo/edit# for how we will handle this breakage. Likewise, we rename the new billing_metrics’s PhysicalSize => ResidentSize. This is not yet used anywhere, so, this is not a breaking change. There is still a field called TimelineInfo::current_physical_size. It is now the sum of the layer sizes in layer map, regardless of whether local or remote. To compute that sum, we added a new trait method PersistentLayer::file_size(). When updating the Python tests, we got rid of current_physical_size_non_incremental. An earlier commit removed it from the OpenAPI spec already, so this is not a breaking change. test_timeline_size.py has grown additional assertions on the resident_physical_size metric.	2022-12-21 19:16:39 +01:00
Christian Schwarz	31543c4acc	refactor: make update_gc_info and transitive callers async This is so that in the next commit, we can add a retry_get to find_lsn_for_timestamp.	2022-12-21 19:16:39 +01:00
Christian Schwarz	1da03141a7	refactor: make Layer::local_path return Option<PathBuf> instead of PathBuf This is in preparation for RemoteLayer, which by definition doesn't have a local path.	2022-12-21 19:16:39 +01:00
Christian Schwarz	2460987328	no-op: pgdatadir_mapping: qualified use of anyhow::Result	2022-12-21 19:16:39 +01:00
Christian Schwarz	749a2f00d7	no-op: distinguished error types for Timeline::get_reconstruct_data	2022-12-21 19:16:39 +01:00
Christian Schwarz	e94b451430	no-op: storage_layer::Iter::{iter, key_iter}: make them fallible	2022-12-21 19:16:39 +01:00
Christian Schwarz	f5b424b96c	no-op: type aliases for Layer::iter and Layer::key_iter return types Not needed by anything right now, but the next commit adds a `Result<>` around iter() and key_iter()'s return types, and that makes clippy complain.	2022-12-21 19:16:39 +01:00
Christian Schwarz	91e8937112	no-op: add Timeline::myself member	2022-12-21 19:16:39 +01:00
Christian Schwarz	f637f6e77e	stop exposing non-incremental sizes in API spec Console doesn't use them, so, don't expose them. refs https://github.com/neondatabase/cloud/pull/3358 refs https://github.com/neondatabase/cloud/pull/3366	2022-12-21 15:37:29 +01:00
Christian Schwarz	a3f0111726	LayerMap::search is actually infallible Found this while investigating failure modes of on-demand download. I think it's a nice cleanup.	2022-12-21 12:14:55 +01:00
Alexander Bayandin	486a985629	mypy: enable check_untyped_defs (#3142 ) Enable `check_untyped_defs` and fix warnings.	2022-12-21 09:38:42 +00:00
Kirill Bulatov	5d4774491f	Exclude macOs fork files from tar processing (#3165 ) When running tenant relocation tests, we use https://github.com/neondatabase/neon/blob/main/scripts/export_import_between_pageservers.py script to export and import basebackup between pageservers. When pageserver runs on macOs and reuses the `tar` library for creating the basebackup archive, it gets the fork files https://superuser.com/questions/61185/why-do-i-get-files-like-foo-in-my-tarball-on-os-x We might be able to fix our code to fix the issue, but if we get such (valid) archive as an input, we [fail](https://github.com/neondatabase/neon/pull/3013#issuecomment-1360093900). This does not seem optimal, given that we can ignore such files.	2022-12-21 00:52:07 +02:00
Anastasia Lubennikova	4235f97c6a	Implement consumption metrics collection. Add new background job to collect billing metrics for each tenant and send them to the HTTP endpoint. Metrics are cached, so we don't send non-changed metrics. Add metric collection config parameters: metric_collection_endpoint (default None, i.e. disabled) metric_collection_interval (default 60s) Add test_metric_collection.py to test metric collection and sending to the mocked HTTP endpoint. Use port distributor in metric_collection test review fixes: only update cache after metrics were send successfully, simplify code disable metric collection if metric_collection_endpoint is not provided in config	2022-12-20 22:59:52 +02:00
Heikki Linnakangas	43fd89eaa7	Improve comments, formatting around layer_removal_cs lock.	2022-12-20 20:50:55 +02:00
Heikki Linnakangas	9a049aa846	Move code from tenant_mgr::delete_timeline to Tenant::delete_timeline. It's better to request the tasks to shut down only after setting the timeline state to Stopping. Otherwise, it's possible that a new task spawns after we have waited for the existing tasks to shut down, but before we have changed the state. We would fail to wait for them. Feels nicer from a readability point of view too.	2022-12-20 20:50:55 +02:00
Kirill Bulatov	0c71dc627b	Tidy up walreceiver logs (#3147 ) Closes https://github.com/neondatabase/neon/issues/3114 Improves walrecevier logs and remove `clone()` calls.	2022-12-20 15:54:02 +02:00
Heikki Linnakangas	8e2edfcf39	Retry remote downloads. Remote operations fail sometimes due to network failures or other external reasons. Add retry logic to all the remote downloads, so that a transient failure at pageserver startup or tenant attach doesn't cause the whole tenant to be marked as Broken. Like in the uploads retry logic, we print the failure to the log as a WARNing after three retries, but keep retrying. We will retry up to 10 times now, before returning the error to the caller. To test the retries, I created a new RemoteStorage wrapper that simulates failures, by returning an error for the first N times that a remote operation is performed. It can be enabled by setting a new "test_remote_failures" option in the pageserver config file. Fixes #3112	2022-12-20 14:27:24 +02:00
Heikki Linnakangas	4cda9919bf	Use Self to emphasize this is a constructor	2022-12-20 14:27:24 +02:00
Heikki Linnakangas	eefb1d46f4	Replace Timeline::checkpoint with Timeline::freeze_and_flush The new Timeline::freeze_and_flush function is equivalent to calling Timeline::checkpoint(CheckpointConfig::Flush). There were only one non-test caller that used CheckpointConfig::Forced, so replace that with a call to the new Timeline::freeze_and_flush, followed by an explicit call to Timeline::compact. That only caller was to handle the mgmt API's 'checkpoint' endpoint. Perhaps we should split that into separate 'flush' and 'compact' endpoints too, but I didn't go that far yet.	2022-12-20 13:45:47 +02:00
Kirill Bulatov	2c11f1fa95	Use separate broker per Python test (#3158 ) And add its logs to Allure reports per test	2022-12-20 11:06:21 +00:00
Sergey Melnikov	cd7fdf2587	Remove neon-stress configs (#3121 )	2022-12-20 12:03:42 +01:00
Heikki Linnakangas	7b0d28bbdc	Update outdated comment on Tenant::gc_iteration. Commit `6dec85b19d` remove the `checkpoint_before_gc` argument, but failed to update the comment. Remove its description, and while we're at it, try to explain better how the `horizon` and `pitr` arguments are used.	2022-12-20 12:52:26 +02:00
Heikki Linnakangas	6ac9ecb074	Remove a few unnecessary checkpoint calls from unit tests. The `make_some_layers' function performs a checkpoint already.	2022-12-20 12:52:26 +02:00
Kirill Bulatov	56d8c25dc8	Revert "Use local brokers" This reverts commit `f9f57e211a`.	2022-12-20 01:57:36 +02:00
Kirill Bulatov	f9f57e211a	Use local brokers	2022-12-20 01:55:59 +02:00
Heikki Linnakangas	39f58038d1	Don't upload index file in compaction, if there was nothing to do. (#3149 ) This splits the storage_sync2::schedule_index_file into two (public) functions: 1. `schedule_index_upload_for_metadata_update`, for when the metadata (e.g. disk_consistent_lsn or last_gc_cutoff) has changed, and 2. `schedule_index_upload_for_file_changes`, for when layer file uploads or deletions have been scheduled. We now keep track of whether there have been any uploads or deletions since the last index-file upload, and skip the upload in `schedule_index_upload_for_file_changes` if there haven't been any changes. That allows us to call the function liberally in timeline.rs, whenever layer file uploads or deletions might've been scheduled, without starting a lot of unnecessary index file uploads. GC was covered earlier by commit `c262390214`, but that missed that we have the same problem with compaction.	2022-12-19 23:58:24 +02:00
Kirill Bulatov	3735aece56	Safekeeper: Always use workdir as a full path	2022-12-19 21:43:36 +02:00
Kirill Bulatov	9ddd1d7522	Stop all storage nodes on startup failure	2022-12-19 21:43:36 +02:00
Kirill Bulatov	49a211c98a	Add neon_local test	2022-12-19 21:43:36 +02:00
Christian Schwarz	7db018e147	[4/4] the fix: do not leak spawn_blocking() tasks from logical size calculation code - Refactor logical_size_calculation_task, moving the pieces that are specific to try_spawn_size_init_task into that function. This allows us to spawn additional size calculation tasks that are not init size calculation tasks. - As part of this refactoring, stop logging cancellations as errors. They are part of regular operations. Logging them as errors was inadvertently introduced in earlier commit 427c1b2e9661161439e65aabc173d695cfc03ab4 initial logical size calculation: if it fails, retry on next call - Change tenant size model request code to spawn task_mgr tasks using the refactored logical_size_calculation_task function. Using a task_mgr task ensures that the calculation cannot outlive the timeline. - There are presumably still some subtle race conditions if a size requests comes in at exactly the same time as a detach / delete request. - But that's the concern of diferent area of the code (e.g., tenant_mgr) and requires holistic solutions, such as the proposed TenantGuard. - Make size calculation cancellable using CancellationToken. This is more of a cherry on top. NB: the test code doesn't use this because we _must_ return from the failpoint, because the failpoint lib doesn't allow to just continue execution in combination with executing the closure. This commit fixes the tests introduced earlier in this patch series.	2022-12-19 16:14:58 +01:00
Christian Schwarz	38ebd6e7a0	[3/4] make initial size estimation task sensitive to task_mgr shutdown requests This exacerbates the problem pointed out in the previous commit. Why? Because with this patch, deleting a timeline also exposes the issue. Extend the test to expose the problem.	2022-12-19 16:14:58 +01:00
Christian Schwarz	40a3d50883	[2/4] add test to show that tenant detach makes us leak running size calculation task	2022-12-19 16:14:58 +01:00
Christian Schwarz	ee2b5dc9ac	[1/4] initial logical size calculation: if it fails, retry on next call Before this patch, if the task fails, we would not reset self.initial_size_computation_started. So, if it fails, we will return the approximate value forever. In practice, it probably never failed because the local filesystem is quite reliable. But with on-demand download, the logical size calculation may need to download layers, which is more likely to fail at times. There will be internal retires with a timeout, but eventually, the downloads will give up. We want to retry in those cases. While we're at it, also change the handling of the timeline state watch so that we treat it as an error. Most likely, we'll not be called again, but if we are, retrying is the right thing.	2022-12-19 16:14:58 +01:00
Christian Schwarz	c785a516aa	remove TimelineInfo.{Remote,Local} along with their types follow-up of https://github.com/neondatabase/neon/pull/2615 which is neon.git: `538876650a` must be deployed after cloud.git change https://github.com/neondatabase/cloud/issues/3232 fixes https://github.com/neondatabase/neon/issues/3041	2022-12-19 14:37:40 +01:00
Heikki Linnakangas	e23d5da51c	Tidy up and add comments to the pageserver startup code. To make it more readable.	2022-12-19 14:03:22 +02:00

1 2 3 4 5 ...

2553 Commits