rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-05-16 20:50:37 +00:00

Author	SHA1	Message	Date
Bojan Serafimov	5bcc5e4891	Fix errors from new clippy version	2022-07-12 16:26:53 -04:00
Bojan Serafimov	05e3d16a18	More conflict resolution	2022-07-12 15:23:52 -04:00
Bojan Serafimov	4724e4b2d1	Resolve conflicts	2022-07-12 15:09:44 -04:00
Heikki Linnakangas	971c03873f	Optimize importing a physical backup Before this patch, importing a physical backup followed the same path as ingesting any WAL records: 1. All the data pages from the backup are first collected in the DatadirModification object. 2. Then, they are "committed" to the Repository. They are written to the in-memory layer 3. Finally, the in-memory layer is frozen, and flushed to disk as a L0 delta layer file. This was pretty inefficient. In step 1, the whole physical backup was held in memory. If the backup is large, you simply run out of memory. And in step 3, the resulting L0 delta layer file is large, holding all the data again. That's a problem if the backup is larger than 5 GB: Amazon S3 doesn't allow uploading files larger than 5 GB (without using multi-part upload, see github issue #1910). So we want to avoid that. To alleviate those problems, optimize the codepath for importing a physical backup. The basic flow is the same as before, but step 1 is optimized so that it doesn't accumulate all the data in memory, and step 3 writes the data in image layers instead of one large delta layer.	2022-07-12 14:54:34 -04:00
Heikki Linnakangas	ffd778a4a2	If an error happens during import of base backup or WAL, log it. We only sent the error to the client, with no trace in the pageserver log. Log it, similar to how we log errors in GetPage@LSN requests.	2022-07-12 14:53:09 -04:00
bojanserafimov	a25ccce3c8	Fix signal file parsing (#2042 )	2022-07-12 14:52:54 -04:00
Bojan Serafimov	1389d6b6a5	Fix gc after import	2022-07-12 14:52:43 -04:00
bojanserafimov	8ca3faa61e	Import basebackup into pageserver (#1925 ) Allow importing basebackup taken from vanilla postgres or another pageserver via psql copy in protocol.	2022-07-12 14:52:30 -04:00
KlimentSerafimov	8346aa3a29	Potential fix to #1626 . Fixed typo is Makefile. (#1781 ) * Potential fix to #1626. Fixed typo is Makefile. * Completed fix to #1626. Summary: changed 'error' to 'bail' in start_pageserver and start_safekeeper.	2022-05-24 04:55:38 -04:00
Heikki Linnakangas	2aceb6a309	Fix garbage collection to not remove image layers that are still needed. The logic would incorrectly remove an image layer, if a new image layer existed, even though the older image layer was still needed by some delta layers after it. See example given in the comment this adds. Without this fix, I was getting a lot of "could not find data for key 010000000000000000000000000000000000" errors from GC, with the new test case being added in PR #1735. Fixes #707	2022-05-23 20:58:27 +03:00
Heikki Linnakangas	ee3bcf108d	Fix compact_level0 for delta layers with overlap or gaps We saw a case in staging, where there was a gap in the LSN ranges of level 0 files, like this: 000000000000000000000000000000000000-FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF__0000000001696070-00000000016960E9 000000000000000000000000000000000000-FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF__00000000016960E9-00000000016E4DB9 000000000000000000000000000000000000-FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF__00000000016E4DB9-000000000BFCE3E1 000000000000000000000000000000000000-FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF__000000000BFCE3E1-000000000BFD0FE9 000000000000000000000000000000000000-FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF__0000000060045901-000000007005EAC1 000000000000000000000000000000000000-FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF__000000007005EAC1-0000000080062E99 000000000000000000000000000000000000-FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF__0000000080062E99-000000009007F481 000000000000000000000000000000000000-FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF__000000009007F481-00000000A009F7C9 000000000000000000000000000000000000-FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF__00000000A009F7C9-00000000AA284EB9 000000000000000000000000000000000000-FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF__00000000AA286471-00000000AA2886B9 Note that gap between 000000000BFD0FE9 and 0000000060045901. I don't know how that happened, but in general the pageserver should be robust if there are gaps like that, or overlapping files etc. In theory they could happen as result of crashes, partial downloads from S3 etc., although it is mystery what caused it in this case. Looking at the compaction code, it was not safe in the face of gaps like that. The compaction routine collected all the level 0 files, and took their min(start)..max(end) as the range of the new files it builds. That's wrong, if the level 0 files don't cover the whole LSN range; the newly created files will miss any records in the gap. Fix that, by only collecting contiguous sequences of level 0 files, so that the end LSN of previous delta file is equal to the start of the next one. Fixes issue #1730	2022-05-19 10:19:38 +03:00
Heikki Linnakangas	0da4046704	Include traversal path in error message. Previously, the path was printed to the log with separate error!() calls. It's better to include the whole path in the error object and have it printed to the log as one message. Also print the path in the ValueReconstructResult::Missing case. This is what it looks like now: 2022-05-17T21:53:53.611801Z ERROR pagestream{timeline=5adcb4af3e95f00a31550d266aab7a37 tenant=74d9f9ad3293c030c6a6e196dd91c60f}: error reading relation or page version: could not find data for key 000000067F000032BE000000000000000001 at LSN 0/1698C48, for request at LSN 0/1698CF8 Caused by: 0: layer traversal: result Complete, cont_lsn 0/1698C48, layer: 000000000000000000000000000000000000-FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF__0000000001698C48-0000000001698CC1 1: layer traversal: result Continue, cont_lsn 0/1698CC1, layer: inmem-0000000001698CC1-FFFFFFFFFFFFFFFF Stack backtrace:	2022-05-19 10:19:38 +03:00
Anastasia Lubennikova	cbd00d7ed9	Remove temp layer files during timeline initialization on pageserver start	2022-05-19 10:11:12 +03:00
Anastasia Lubennikova	4c30ae8ba3	Add random string as a part of tempfile name	2022-05-19 10:11:12 +03:00
Anastasia Lubennikova	3da4b3165e	Fsync layer files before rename	2022-05-19 10:11:12 +03:00
Anastasia Lubennikova	c1b365fdf7	Use temp filename while writing ImageLayer file	2022-05-19 10:11:12 +03:00
Dmitry Rodionov	5914aab78a	add comments, use expect instead of unwrap	2022-05-19 00:54:14 +03:00
Heikki Linnakangas	4a36d89247	Avoid spawning a layer-flush thread when there's no work to do. The check_checkpoint_distance() always spawned a new thread, even if there is no frozen layer to flush. That was a thinko, as @knizhnik pointed out.	2022-05-19 00:51:48 +03:00
Arthur Petukhovsky	134eeeb096	Add more common storage metrics (#1722 ) - Enabled process exporter for storage services - Changed zenith_proxy prefix to just proxy - Removed old `monitoring` directory - Removed common prefix for metrics, now our common metrics have `libmetrics_` prefix, for example `libmetrics_serve_metrics_count` - Added `test_metrics_normal_work`	2022-05-17 19:29:01 +03:00
Heikki Linnakangas	55ea3f262e	Fix race condition leading to panic in remote storage sync thread. The SyncQueue consisted of a tokio mpsc channel, and an atomic counter to keep track of how many items there are in the channel. Updating the atomic counter was racy, and sometimes the consumer would decrement the counter before the producer had incremented it, leading to integer wraparound to usize::MAX. Calling Vec::with_capacity(usize::MAX) leads to a panic. To fix, replace the channel with a VecDeque protected by a Mutex, and a condition variable for signaling. Now that the queue is now protected by standard blocking Mutex and Condvar, refactor the functions touching it to be sync, not async. A theoretical downside of this is that the calls to push items to the queue and the storage sync thread that drains the queue might now need to wait, if another thread is busy manipulating the queue. I believe that's OK; the lock isn't held for very long, and these operations are made in background threads, not in the hot GetPage@LSN path, so they're not very latency-sensitive. Fixes #1719. Also add a test case.	2022-05-17 18:14:57 +03:00
Kirill Bulatov	a884f4cf6b	Add etcd to neon_local	2022-05-17 01:17:44 +03:00
Kirill Bulatov	9a0fed0880	Enable at least 1 safekeeper in every test	2022-05-17 01:17:44 +03:00
chaitanya sharma	85b5c0e989	List profiling as a feature with 'pageserver --enabled-features' Fixes https://github.com/neondatabase/neon/issues/1627	2022-05-16 21:10:57 +03:00
Thang Pham	e4a70faa08	Add more information to timeline-related APIs (#1673 ) Resolves #1488. - implemented `GET tenant/:tenant_id/timeline/:timeline_id/wal_receiver` endpoint - returned `thread_id` in `thread_mgr::spawn` - added `latest_gc_cutoff_lsn` field to `LocalTimelineInfo` struct	2022-05-16 11:05:43 -04:00
Heikki Linnakangas	51ea9c3053	Don't swallow panics when the pageserver is build with failpoints. It's very confusing, and because you don't get a stack trace and error message in the logs, makes debugging very hard. However, the 'test_pageserver_recovery' test relied on that behavior. To support that, add a new "exit" action to the pageserver 'failpoints' command, so that you can explicitly request to exit the process when a failpoint is hit.	2022-05-16 09:58:58 +03:00
Heikki Linnakangas	a10cac980f	Continue with pageserver startup, if loading some tenants fail. Fixes https://github.com/neondatabase/neon/issues/1664	2022-05-15 00:25:38 +03:00
Anastasia Lubennikova	a2561f0a78	Use tenant's pitr_interval instead of hardroded 0 in the command. Adjust python tests that use the	2022-05-13 18:32:14 +03:00
Anastasia Lubennikova	aa7c601eca	Fix pitr_interval check in GC: Use timestamp->LSN mapping instead of file modification time. Fix 'latest_gc_cutoff_lsn' - set it to the minimum of pitr_cutoff and gc_cutoff. Add new test: test_pitr_gc	2022-05-13 18:32:14 +03:00
Kirill Bulatov	b683308791	Return GIT_VERSION back to storage binaries	2022-05-13 16:34:32 +03:00
Kirill Bulatov	51c0f9ab2b	Force git version to be up to date via decl macro	2022-05-13 16:34:32 +03:00
Arthur Petukhovsky	ec8861b8cc	Fix pageserver metrics names (#1682 ) Try to follow Prometheus style-guide https://prometheus.io/docs/practices/naming/ for metrics names. More specifically: - Use `pageserver_` prefix for all pagserver metrics - Specify `_seconds` unit in time metrics - Use unit as a suffix in other cases, such as `_hits`, `_bytes`, `_records` - Use `_total` suffix for accumulating counters (note that Histograms append that suffix internally)	2022-05-12 19:53:07 +03:00
Heikki Linnakangas	5da4f3a4df	Refactor DeltaLayer::dump() function Put most of the code in a closure that returns Result, so that we can use the ?-operator for error handling. That's simpler.	2022-05-12 10:31:04 +03:00
Konstantin Knizhnik	2bde77fced	Do not apply records with LSN smaller than LSN of cached image in del… (#1672 ) * Do not apply records with LSN smaller than LSN of cached image in delta layer * Do not apply records with LSN smaller than LSN of cached image in delta layer	2022-05-12 07:56:02 +03:00
Dhammika Pathirana	c864091035	Fix err msg typo Signed-off-by: Dhammika Pathirana <dham@neon.tech>	2022-05-11 16:13:26 -07:00
Konstantin Knizhnik	e6e883eb12	Do not set LSN for new FPI page (#1657 ) * Do not set LSN for new FPI page refer #1656 * Add page_is_new, page_get_lsn, page_set_lsn functions * Fix page_is_new implementation * Add comment from XLogReadBufferForRedoExtended	2022-05-11 15:23:17 +03:00
Thang Pham	87dfa99734	Update layered_repository REAMDE (#1659 )	2022-05-10 09:55:14 -04:00
Kirill Bulatov	0a7735a656	Rework remote storage sync queue, general refactoring	2022-05-07 01:33:33 +03:00
Kirill Bulatov	64a602b8f3	Delete timeline layers	2022-05-07 01:33:33 +03:00
Kirill Bulatov	10e4da3997	Rework timeline batching	2022-05-07 01:33:33 +03:00
Kirill Bulatov	de37f982db	Share the remote storage as a crate	2022-05-07 00:30:36 +03:00
Kirill Bulatov	2ef0e5c6ed	Do not require metadata in every upload sync task	2022-05-05 18:26:39 +03:00
Kirill Bulatov	52a7e3155e	Add local path to the Layer trait and historic layers	2022-05-05 18:26:39 +03:00
Dmitry Rodionov	0f3ec83172	avoid detach with alive branches	2022-05-05 12:54:42 +03:00
bojanserafimov	bc569dde51	Remove some unwraps from waldecoder (#1539 )	2022-05-04 17:41:05 -04:00
Anastasia Lubennikova	e2cf77441d	Implement pg_database_size(). In this implementation dbsize equals sum of all relation sizes, excluding shared ones.	2022-05-04 18:14:45 +03:00
Stas Kelvich	5642d0b2b8	Change shutdown_process_on_error thread spawn settings. Now princeple is following: acceptor threads (libpq and http) error will bring the pageserver down, but all per-tenant thread failures will be treated as an error.	2022-05-04 00:42:57 +03:00
Dmitry Rodionov	2f83f793bc	print more details when thread fails	2022-05-03 18:31:23 +03:00
Anastasia Lubennikova	2f9b17b9e5	Add simple test of pageserver recovery after crash. To cause a crash, use failpoints in checkpointer	2022-05-03 17:13:09 +03:00
Dmitry Rodionov	e7cba0b607	use thiserror instead of anyhow in disk_btree	2022-05-03 15:34:23 +03:00
Dmitry Rodionov	ff7e9a86c6	turn panic into an error with more details	2022-05-03 12:44:42 +03:00

1 2 3 4 5 ...

753 Commits