rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-01-07 13:32:57 +00:00

Go to file

Heikki Linnakangas 4902d1daa8 Store base images in separate ImageLayers

Previously, a SnapshotLayer and corresponding file on disk contained the
base image of every page in the segment at the start LSN, and all the
changes (= WAL records) in the range between start and end LSN. That was
a bit awkward, because we had to keep the base image of every page in
memory until we had accumulated enough WAL after the base image to write
out the layer. When it's time to write out a layer, we would really want
to replay the WAL to reconstruct the most recent version of each page, to
save the effort later. That's on the assumption that the client will
usually request the most recent version, not some older one.

Split the SnapshotLayer into two structs: ImageLayer and DeltaLayer. An
image layer contains a "snapshot" of the segment at one specific LSN, and
no WAL records, whereas a delta layer contains WAL records in a range of
LSNs. In order to reconstruct a page version in the delta layer, by
performing WAL redo, you also need the previous image layer. So the delta
layers are "incremental" against the previous layer.

So where previously we would create snapshot files like this:

    rel_100_200
    rel_200_300
    rel_300_400

We now create image and delta files like this:

    rel_100      # image
    rel_100_200  # delta
    rel_200
    rel_200_300
    rel_300
    rel_300_400
    rel_400

That's more files, but as discussed above, this allows storing more
up-to-date page versions on disk, which should reduce the latency of
responding to a GetPage request. It also allows more fine-grained garbage
collection. In the above example, after the old page version are no longer
needed and if the relation is not modified anymore, we only need to keep
the latest image file, 'rel_400', and everything else can be removed.

Implements https://github.com/zenithdb/zenith/issues/339

2021-08-27 02:35:16 +03:00

.circleci

Add support for code format checking using rustfmt in optional

2021-08-23 17:28:45 +03:00

.github/workflows

[postgres] Enable seccomp bpf

2021-07-09 14:59:45 +03:00

control_plane

Handle COPY error

2021-08-26 13:53:10 +03:00

docs

Update zenith readmes:

2021-08-23 10:21:10 +03:00

monitoring

Add some prometheus metrics to pageserver

2021-08-03 21:42:24 +03:00

pageserver

Store base images in separate ImageLayers

2021-08-27 02:35:16 +03:00

postgres_ffi

waldecoder - check for trailing bytes

2021-08-26 10:24:33 -07:00

proxy

Extract message processing function from PostgresBackend's event loop

2021-08-04 10:49:02 +03:00

test_runner

Remove rocksdb implementation.

2021-08-25 18:37:22 +03:00

vendor

Fix decoding XLOG_HEAP_DELETE and XLOG_HEAP_UPDATE records.

2021-08-24 15:59:25 +03:00

walkeeper

Handle COPY error

2021-08-26 13:53:10 +03:00

workspace_hack

add workspace_hack crate

2021-05-07 13:08:31 -07:00

zenith

Remove remnants of choosing between repository formats.

2021-08-25 18:37:22 +03:00

zenith_metrics

cargo fmt pass

2021-08-04 23:51:02 +03:00

zenith_utils

Handle COPY error

2021-08-26 13:53:10 +03:00

.dockerignore

Dockerfile for pageserver.

2021-06-01 16:08:32 +03:00

.gitignore

.gitignore integration_tests/.zenith

2021-05-13 13:47:22 -07:00

.gitmodules

remove unused modules

2021-04-02 10:38:51 +03:00

Cargo.lock

Remove unnecessary dependencies.

2021-08-25 18:51:15 +03:00

Cargo.toml

Add some prometheus metrics to pageserver

2021-08-03 21:42:24 +03:00

cli-v2-story.md

Implement "timelines" in page server

2021-04-20 19:11:27 +03:00

CONTRIBUTING.md

Add CONTRIBUTING.md with some ground rules for submitting PRs.

2021-05-27 23:07:37 +03:00

Add LICENSE and COPYRIGHT files.

2021-05-27 15:33:08 +03:00

docker-entrypoint.sh

Adjust docker container for console's CI pipeline

2021-08-25 17:28:42 +03:00

Dockerfile

Adjust docker container for console's CI pipeline

2021-08-25 17:28:42 +03:00

Dockerfile.alpine

Build zenithdb/zenith:latest in CI (zenithdb/console#18 )

2021-08-19 15:12:35 +03:00

Dockerfile.build

Build zenithdb/zenith:latest in CI (zenithdb/console#18 )

2021-08-19 15:12:35 +03:00

LICENSE

Add LICENSE and COPYRIGHT files.

2021-05-27 15:33:08 +03:00

Makefile

adjust for review

2021-08-23 17:28:45 +03:00

Pipfile

Symlink Pipfile (& Pipfile.lock) at the top level

2021-07-12 21:30:52 +03:00

Pipfile.lock

Symlink Pipfile (& Pipfile.lock) at the top level

2021-07-12 21:30:52 +03:00

pre-commit.py

add ability to disable colors, use argparse for arguments

2021-08-23 17:28:45 +03:00

README.md

Update zenith readmes:

2021-08-23 10:21:10 +03:00

run_clippy.sh

add a clippy shell script

2021-06-15 10:52:11 -07:00

README.md

Zenith

Zenith substitutes PostgreSQL storage layer and redistributes data across a cluster of nodes

Architecture overview

A Zenith installation consists of Compute nodes and Storage engine.

Compute nodes are stateles PostgreSQL nodes, backed by zenith storage.

Zenith storage engine consists of two major components:

Pageserver. Scalable storage backend for compute nodes.
WAL service. The service that receives WAL from compute node and ensures that it is stored durably.

Pageserver consists of:

Page cache repository - Zenith storage implementation.
WAL receiver - service that recieves WAL from WAL service and stores it in the page cache repository.
Page service - service that communicates with compute nodes and responds with pages from the repository.
WAL redo - service that builds pages from base images and WAL records on Page service request.

Running local installation

Install build dependencies and other useful packages

On Ubuntu or Debian this set of packages should be sufficient to build the code:

apt install build-essential libtool libreadline-dev zlib1g-dev flex bison libseccomp-dev \
libssl-dev clang

[Rust] 1.52 or later is also required.

To run the psql client, install the postgresql-client package or modify PATH and LD_LIBRARY_PATH to include tmp_install/bin and tmp_install/lib, respectively.

To run the integration tests (not required to use the code), install Python (3.6 or higher), and install python3 packages with pipenv using pipenv install in the project directory.

Build zenith and patched postgres

git clone --recursive https://github.com/zenithdb/zenith.git
cd zenith
make -j5

Start pageserver and postgres on top of it (should be called from repo root):

# Create repository in .zenith with proper paths to binaries and data
# Later that would be responsibility of a package install script
> ./target/debug/zenith init
pageserver init succeeded

# start pageserver
> ./target/debug/zenith start
Starting pageserver at '127.0.0.1:64000' in .zenith
Pageserver started

# start postgres on top on the pageserver
> ./target/debug/zenith pg start main
Starting postgres node at 'host=127.0.0.1 port=55432 user=stas'
waiting for server to start.... done

# check list of running postgres instances
> ./target/debug/zenith pg list
BRANCH	ADDRESS		LSN		STATUS
main	127.0.0.1:55432	0/1609610	running

Now it is possible to connect to postgres and run some queries:

> psql -p55432 -h 127.0.0.1 postgres
postgres=# CREATE TABLE t(key int primary key, value text);
CREATE TABLE
postgres=# insert into t values(1,1);
INSERT 0 1
postgres=# select * from t;
 key | value
-----+-------
   1 | 1
(1 row)

And create branches and run postgres on them:

# create branch named migration_check
> ./target/debug/zenith branch migration_check main
Created branch 'migration_check' at 0/1609610

# check branches tree
> ./target/debug/zenith branch
 main
 ┗━ @0/1609610: migration_check

# start postgres on that branch
> ./target/debug/zenith pg start migration_check
Starting postgres node at 'host=127.0.0.1 port=55433 user=stas'
waiting for server to start.... done

# this new postgres instance will have all the data from 'main' postgres,
# but all modifications would not affect data in original postgres
> psql -p55433 -h 127.0.0.1 postgres
postgres=# select * from t;
 key | value
-----+-------
   1 | 1
(1 row)

postgres=# insert into t values(2,2);
INSERT 0 1

Running tests

git clone --recursive https://github.com/libzenith/zenith.git
make # builds also postgres and installs it to ./tmp_install
cd test_runner
pytest

Documentation

Now we use README files to cover design ideas and overall architecture for each module and rustdoc style documentation comments. See also /docs/ a top-level overview of all available markdown documentation.

/docs/sourcetree.md contains overview of source tree layout.

To view your rustdoc documentation in a browser, try running cargo doc --no-deps --open

Join the development

Read CONTRIBUTING.md to learn about project code style and practices.
Use glossary in /docs/glossary.md
To get familiar with a source tree layout, use /docs/sourcetree.md.
To learn more about PostgreSQL internals, check http://www.interdb.jp/pg/index.html

Languages

Rust 73.5%

Python 19.4%

C 5.2%

Dockerfile 0.8%

Shell 0.3%

Other 0.8%