neon/pageserver at 42ab34dc362b1b54dc96c43202b43d5ece558aa7 - neon

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2025-12-22 21:59:59 +00:00

Files

Erik Grinaker 42ab34dc36 pageserver/client_grpc: don't pipeline GetPage requests (#12584 )

## Problem

The communicator gRPC client currently attempts to pipeline GetPage
requests from multiple callers onto the same gRPC stream. This has a
number of issues:

* Head-of-line blocking: the request may block on e.g. layer download or
LSN wait, delaying the next request.
* Cancellation: we can't easily cancel in-progress requests (e.g. due to
timeout or backend termination), so it may keep blocking the next
request (even its own retry).
* Complex stream scheduling: picking a stream becomes harder/slower, and
additional Tokio tasks and synchronization is needed for stream
management.

Touches #11735.
Requires #12579.

## Summary of changes

This patch removes pipelining of gRPC stream requests, and instead
prefers to scale out the number of streams to achieve the same
throughput. Stream scheduling has been rewritten, and mostly follows the
same pattern as the client pool with exclusive acquisition by a single
caller.

[Benchmarks](https://github.com/neondatabase/neon/pull/12583) show that
the cost of an idle server-side GetPage worker task is about 26 KB (2.5
GB for 100,000), so we can afford to scale out.

This has a number of advantages:

* It (mostly) eliminates head-of-line blocking (except at the TCP
level).
* Cancellation becomes trivial, by closing the stream.
* Stream scheduling becomes significantly simpler and cheaper.
* Individual callers can still use client-side batching for pipelining.

2025-07-14 12:11:33 +00:00

benches

Use enum-typed PG versions (#12317 )

2025-06-24 17:25:31 +00:00

client

A few PS changes (#12540 )

2025-07-10 14:39:38 +00:00

client_grpc

pageserver/client_grpc: don't pipeline GetPage requests (#12584 )

2025-07-14 12:11:33 +00:00

compaction

apply clippy fixes for 1.88.0 beta (#12331 )