neon/ctl at 66fc465484326f5a87760797715b0bb4959da38d - neon

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-01-04 20:12:54 +00:00

Files

Vlad Lazar 2b11466b59 pageserver: optimise disk io for vectored get (#6780 )

## Problem
The vectored read path proposed in
https://github.com/neondatabase/neon/pull/6576 seems
to be functionally correct, but in my testing (see below) it is about 10-20% slower than the naive
sequential vectored implementation.

## Summary of changes
There's three parts to this PR:
1. Supporting vectored blob reads. This is actually trickier than it
sounds because on disk blobs are prefixed with a variable length size header.
Since the blobs are not necessarily fixed size, we need to juggle the offsets
such that the callers can retrieve the blobs from the resulting buffer.

2. Merge disk read requests issued by the vectored read path up to a
maximum size. Again, the merging is complicated by the fact that blobs
are not fixed size. We keep track of the begin and end offset of each blob
and pass them into the vectored blob reader. In turn, the reader will return
a buffer and the offsets at which the blobs begin and end.

3. A benchmark for basebackup requests against tenant with large SLRU
block counts is added. This required a small change to pagebench and a new config
variable for the pageserver which toggles the vectored get validation.

We can probably optimise things further by adding a little bit of
concurrency for our IO. In principle, it's as simple as spawning a task which deals with issuing
IO and doing the serialisation and handling on the parent task which receives input via a
channel.

2024-02-28 12:06:00 +00:00

src

pageserver: optimise disk io for vectored get (#6780 )

2024-02-28 12:06:00 +00:00

Cargo.toml

fix: cleanup of layers from the future can race with their re-creation (#5890 )

2023-11-23 13:33:41 +00:00