## Zenith test runner This directory contains integration tests. Prerequisites: - Python 3.7 or later - Development headers may also be needed to build `psycopg2` from source. - Python 3.7 is recommended if you want to update tests. - Dependencies: install them via `pipenv install`. Note that Debian/Ubuntu packages are stale, as it commonly happens, so manual installation is not recommended. Exact version of `pipenv` is not important unless you change dependencies. Run `pipenv shell` to activate the venv or use `pipenv run` to run a single command in the venv, e.g. `pipenv run pytest`. - Zenith and Postgres binaries - See the root README.md for build directions - Tests can be run from the git tree; or see the environment variables below to run from other directories. - The zenith git repo, including the postgres submodule (for some tests, e.g. pg_regress) ### Test Organization The tests are divided into a few batches, such that each batch takes roughly the same amount of time. The batches can be run in parallel, to minimize total runtime. Currently, there are only two batches: - test_batch_pg_regress: Runs PostgreSQL regression tests - test_others: All other tests ### Running the tests Because pytest will search all subdirectories for tests, it's easiest to run the tests from within the `test_runner` directory. Test state (postgres data, pageserver state, and log files) will be stored under a directory `test_output`. You can run all the tests with: `pytest` If you want to run all the tests in a particular file: `pytest test_pgbench.py` If you want to run all tests that have the string "bench" in their names: `pytest -k bench` Useful environment variables: `ZENITH_BIN`: The directory where zenith binaries can be found. `POSTGRES_DISTRIB_DIR`: The directory where postgres distribution can be found. `TEST_OUTPUT`: Set the directory where test state and test output files should go. `TEST_SHARED_FIXTURES`: Try to re-use a single pageserver for all the tests. Let stdout, stderr and `INFO` log messages go to the terminal instead of capturing them: `pytest -s --log-cli-level=INFO ...` (Note many tests capture subprocess outputs separately, so this may not show much.) Exit after the first test failure: `pytest -x ...` (there are many more pytest options; run `pytest -h` to see them.) ### Writing a test Every test needs a Zenith Environment, or ZenithEnv to operate in. A Zenith Environment is like a little cloud-in-a-box, and consists of a Pageserver, 0-N Safekeepers, and compute Postgres nodes. The connections between them can be configured to use JWT authentication tokens, and some other configuration options can be tweaked too. The easiest way to get access to a Zenith Environment is by using the `zenith_simple_env` fixture. The 'simple' env may be shared across multiple tests, so don't shut down the nodes or make other destructive changes in that environment. Also don't assume that there are no tenants or branches or data in the cluster. For convenience, there is a branch called `empty`, though. The convention is to create a test-specific branch of that and load any test data there, instead of the 'main' branch. For more complicated cases, you can build a custom Zenith Environment, with the `zenith_env` fixture: ```python def test_foobar(zenith_env_builder: ZenithEnvBuilder): # Prescribe the environment. # We want to have 3 safekeeper nodes, and use JWT authentication in the # connections to the page server zenith_env_builder.num_safekeepers = 3 zenith_env_builder.set_pageserver_auth(True) # Now create the environment. This initializes the repository, and starts # up the page server and the safekeepers env = zenith_env_builder.init() # Run the test ... ``` For more information about pytest fixtures, see https://docs.pytest.org/en/stable/fixture.html At the end of a test, all the nodes in the environment are automatically stopped, so you don't need to worry about cleaning up. Logs and test data are preserved for the analysis, in a directory under `../test_output/` ### Before submitting a patch #### Obligatory checks Install dev dependencies via `pipenv --python 3.7 install --dev` (better) or `pipenv install --dev` (if you don't have Python 3.7 and don't need to change dependencies). We force code formatting via yapf and type hints via mypy. Run the following commands in the `test_runner/` directory: ```bash pipenv run yapf -ri . # All code is reformatted pipenv run mypy . # Ensure there are no typing errors ``` #### Advisable actions * Writing a couple of docstrings to clarify the reasoning behind a new test. * Running `flake8` (or a linter of your choice, e.g. `pycodestyle`) and fixing possible defects, if any. * Adding more type hints to your code to avoid `Any`, especially: * For fixture parameters, they are not automatically deduced. * For function arguments and return values. #### Changing dependencies You have to update `Pipfile.lock` if you have changed `Pipfile`: ```bash pipenv --python 3.7 install --dev # Re-create venv for Python 3.7 and install recent pipenv inside pipenv run pipenv --version # Should be at least 2021.5.29 pipenv run pipenv lock # Regenerate Pipfile.lock ``` As the minimal supported version is Python 3.7 and we use it in CI, you have to use a Python 3.7 environment when updating `Pipfile.lock`. Otherwise some back-compatibility packages will be missing. It is also important to run recent `pipenv`. Older versions remove markers from `Pipfile.lock`. If you don't have Python 3.7, you should install it and its headers (for `psycopg2`) separately, e.g.: ```bash # In Ubuntu sudo add-apt-repository ppa:deadsnakes/ppa sudo apt update sudo apt install python3.7 python3.7-dev ```