neon/docs at 44121cc175e4493c69c13448686a178bb136b6cd - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2025-12-22 21:59:59 +00:00

Files

History

Alexey Kondratov 44121cc175 docs(compute): RFC for compute rolling restart with prewarm (#11294 )

## Problem

Neon currently implements several features that guarantee high uptime of
compute nodes:

1. Storage high-availability (HA), i.e. each tenant shard has a
secondary pageserver location, so we can quickly switch over compute to
it in case of primary pageserver failure.
2. Fast compute provisioning, i.e. we have a fleet of pre-created empty
computes, that are ready to serve workload, so restarting unresponsive
compute is very fast.
3. Preemptive NeonVM compute provisioning in case of k8s node
unavailability.

This helps us to be well-within the uptime SLO of 99.95% most of the
time. Problems begin when we go up to multi-TB workloads and 32-64 CU
computes. During restart, compute looses all caches: LFC, shared
buffers, file system cache. Depending on the workload, it can take a lot
of time to warm up the caches, so that performance could be degraded and
might be even unacceptable for certain workloads. The latter means that
although current approach works well for small to
medium workloads, we still have to do some additional work to avoid
performance degradation after restart of large instances.

[Rendered
version](https://github.com/neondatabase/neon/blob/alexk/pg-prewarm-rfc/docs/rfcs/2025-03-17-compute-prewarm.md)

Part of https://github.com/neondatabase/cloud/issues/19011

2025-07-02 17:16:00 +00:00

..

docs(compute): RFC for compute rolling restart with prewarm (#11294 )

2025-07-02 17:16:00 +00:00

.gitignore

Reorganize, expand, improve internal documentation

2022-07-18 17:39:12 +03:00

authentication.md

tests/neon_local: rename "attachment service" -> "storage controller" (#7087 )

2024-03-12 11:36:27 +00:00

book.toml

Reorganize, expand, improve internal documentation

2022-07-18 17:39:12 +03:00

consumption_metrics.md

pageserver: remove resident size from billing metrics (#11699 )

2025-04-29 18:34:56 +00:00

core_changes.md

Replace a few references to Zenith with neon

2024-06-18 20:01:32 +03:00

docker.md

Stop building 'compute-tools' image (#10333 )

2025-01-11 13:09:55 +00:00

error-handling.md

docs: error handling: document preferred anyhow context & logging style (#5178 )

2023-10-17 15:41:47 +01:00

glossary.md

Add a section in glossary to explain what "logical size" means. (#2306 )

2022-08-19 21:57:00 +03:00

multitenancy.md

Rename old project name references

2022-09-14 08:14:05 +03:00

pageserver-compaction.md

docs: add compaction notes (#11415 )

2025-04-02 19:55:08 +00:00

pageserver-page-service.md

Reorganize, expand, improve internal documentation

2022-07-18 17:39:12 +03:00

pageserver-pagecache.md

remove materialized page cache (#8105 )

2024-06-20 11:56:14 +02:00

pageserver-processing-getpage.md

Reorganize, expand, improve internal documentation

2022-07-18 17:39:12 +03:00

pageserver-processing-wal.md

Reorganize, expand, improve internal documentation

2022-07-18 17:39:12 +03:00

pageserver-services.md

Add support to specifying storage account in AzureConfig (#8090 )

2024-06-18 16:03:23 +02:00

pageserver-storage.md

chore: update wording in docs to improve readability (#6607 )

2024-02-04 19:33:38 +00:00

pageserver-tenant-migration.md

Rename old project name references

2022-09-14 08:14:05 +03:00

pageserver-thread-mgmt.md

chore: update wording in docs to improve readability (#6607 )

2024-02-04 19:33:38 +00:00

pageserver-walredo.md

chore: update wording in docs to improve readability (#6607 )

2024-02-04 19:33:38 +00:00

pageserver.md

Reorganize, expand, improve internal documentation

2022-07-18 17:39:12 +03:00

safekeeper-protocol.md

Reorganize, expand, improve internal documentation

2022-07-18 17:39:12 +03:00

separation-compute-storage.md

Reorganize, expand, improve internal documentation

2022-07-18 17:39:12 +03:00

settings.md

remove materialized page cache (#8105 )

2024-06-20 11:56:14 +02:00

sourcetree.md

Python 3.11 (#9515 )

2024-11-21 16:25:31 +00:00

storage_broker.md

Fix deploy after 2d42f84389.

2022-11-24 20:07:41 +04:00

storage_controller.md

storcon: change default stripe size to 16 MB (#11168 )

2025-04-09 08:41:38 +00:00

SUMMARY.md

docs: add compaction notes (#11415 )

2025-04-02 19:55:08 +00:00

synthetic-size.md

Update links in synthetic-size.md (#8501 )

2024-07-26 01:14:12 +01:00

tools.md

Document recommended ccls setup (#4723 )

2023-07-17 09:21:42 -04:00

updating-postgres.md

Document how to use "git merge" for PostgreSQL minor version upgrades. (#8692 )

2024-08-23 09:15:55 +03:00

walservice.md

Fix links to safekeeper protocol docs. (#2188 )

2022-08-09 10:19:18 +07:00