rust/neon - neon - Gitea: Git with a cup of tea

rust/neon

mirror of https://github.com/neondatabase/neon.git synced 2026-06-01 04:20:39 +00:00

Author	SHA1	Message	Date
Alexander Bayandin	3837fca7a2	compute-node-image: fix postgis download (#4280 ) ## Problem `osgeo.org` is experiencing some problems with DNS resolving which breaks `compute-node-image` (because it can't download postgis) ## Summary of changes - Add `140.211.15.30 download.osgeo.org` to /etc/hosts by passing it via the container option	2023-05-19 15:34:22 +01:00
Alexander Bayandin	1b2ece3715	Re-enable compatibility tests on Postgres 15 (#4274 ) - Enable compatibility tests for Postgres 15 - Also add `PgVersion::v_prefixed` property to return the version number with, _guess what,_ v-prefix!	2023-05-18 19:56:09 +01:00
Alexander Bayandin	30fe310602	Code Coverage: upload reports to S3 (#4256 ) ## Problem `neondatabase/zenith-coverage-data` is too big: - It takes ~6 minutes to clone and push the repo - GitHub fails to publish an HTML report to github.io Part of https://github.com/neondatabase/neon/issues/3543 ## Summary of changes Replace pushing code coverage report to `neondatabase/zenith-coverage-data` with uploading it to S3	2023-05-17 11:30:07 +01:00
Alexander Bayandin	131343ed45	Fix regress-tests job for Postgres 15 on release branch (#4253 ) ## Problem Compatibility tests don't support Postgres 15 yet, but we're still trying to upload compatibility snapshot (which we do not collect). Ref https://github.com/neondatabase/neon/actions/runs/4991394158/jobs/8940369368#step:4:38129 ## Summary of changes Add `pg_version` parameter to `run-python-test-set` actions and do not upload compatibility snapshot for Postgres 15	2023-05-16 17:18:56 +01:00
Alexander Bayandin	a65e0774a5	Increase shared memory size for regression test run (#4232 ) Should fix flakiness caused by the error ``` FATAL: could not resize shared memory segment "/PostgreSQL.3944613150" to 1048576 bytes: No space left on device ```	2023-05-16 14:06:47 +01:00
Alexander Bayandin	bb06d281ea	Run regressions tests on both Postgres 14 and 15 (#4192 ) This PR adds tests runs on Postgres 15 and created unified Allure report with results for all tests. - Split `.github/actions/allure-report` into `.github/actions/allure-report-store` and `.github/actions/allure-report-generate` - Add debug or release pytest parameter for all tests (depending on `BUILD_TYPE` env variable) - Add Postgres version as a pytest parameter for all tests (depending on `DEFAULT_PG_VERSION` env variable) - Fix `test_wal_restore` and `restore_from_wal.sh` to support path with `[`/`]` in it (fixed by applying spellcheck to the script and fixing all warnings), `restore_from_wal_archive.sh` is deleted as unused. - All known failures on Postgres 15 marked with xfail	2023-05-12 15:28:51 +01:00
Sergey Melnikov	0d3d022eb1	Remove deploy workflows (#4157 ) ## Describe your changes Removing deploy workflows (moving to aws repo)	2023-05-08 17:30:16 +02:00
Gleb Novikov	9860d59aa2	Public docker image repository by default	2023-05-08 15:51:54 +04:00
Alexander Bayandin	b114ef26c2	GitHub Autocomment: add a note if no tests were run (#4109 ) - Always (if not cancelled) add a comment to a PR - Mention in the comment if no tests were run / reports were not generated.	2023-05-03 15:38:49 +01:00
Christian Schwarz	5b911e1f9f	build: run clippy for powerset of features (#4077 ) This will catch compiler & clippy warnings in all feature combinations. We should probably use cargo hack for build and test as well, but, that's quite expensive and would add to overall CI wait times. obsoletes https://github.com/neondatabase/neon/pull/4073 refs https://github.com/neondatabase/neon/pull/4070	2023-04-27 15:01:27 +03:00
Alexander Bayandin	2d6fd72177	GitHub Workflows: Fix crane for several registries (#4076 ) Follow-up fix after https://github.com/neondatabase/neon/pull/4067 ``` + crane tag neondatabase/vm-compute-node-v14:3064 latest Error: fetching "neondatabase/vm-compute-node-v14:3064": GET https://index.docker.io/v2/neondatabase/vm-compute-node-v14/manifests/3064: MANIFEST_UNKNOWN: manifest unknown; unknown tag=3064 ``` I reverted back the previous approach for promoting images (login to one registry, save images to local fs, logout and login to another registry, and push images from local fs). It turns out what works for one Google project (kaniko), doesn't work for another (crane) [sigh]	2023-04-25 23:58:59 +01:00
Alexander Bayandin	05ac0e2493	Login to ECR and Docker Hub at once (#4067 ) - Update kaniko to 1.9.2 (from 1.7.0), problem with reproducible build is fixed - Login to ECR and Docker Hub at once, so we can push to several registries, it makes job `push-docker-hub` unneeded - `push-docker-hub` replaced with `promote-images` in `needs:` clause, Pushing images to production ECR moved to `promote-images` job	2023-04-25 17:54:10 +01:00
Alexander Bayandin	c94b8998be	GitHub Workflows: print error messages to stderr	2023-04-12 15:22:18 +01:00
Alexander Bayandin	218062ceba	GitHub Workflows: use ref_name instead of ref	2023-04-12 15:22:18 +01:00
Dmitry Rodionov	b45c92e533	tests: exclude compatibility tests by default (#3975 ) This allows to skip compatibility tests based on `CHECK_ONDISK_DATA_COMPATIBILITY` environment variable. When the variable is missing (default) compatibility tests wont be run.	2023-04-06 21:21:39 +03:00
Alexander Bayandin	9310949b44	GitHub Autocomment: Retry on server errors (#3958 ) Retry posting/updating a comment in case of 5XX errors from GitHub API	2023-04-05 22:08:06 +03:00
Alexander Bayandin	1d23b5d1de	Comment PR with test results (#3907 ) This PR adds posting a comment with test results. Each workflow run updates the comment with new results. The layout and the information that we post can be changed to our needs, right now, it contains failed tests and test which changes status after rerun (i.e. flaky tests)	2023-04-04 12:22:47 +01:00
Alexander Bayandin	105b8bb9d3	test_runner: automatically rerun flaky tests (#3880 ) This PR adds a plugin that automatically reruns (up to 3 times) flaky tests. Internally, it uses data from `TEST_RESULT_CONNSTR` database and `pytest-rerunfailures` plugin. As the first approximation we consider the test flaky if it has failed on the main branch in the last 10 days. Flaky tests are fetched by `scripts/flaky_tests.py` script (it's possible to use it in a standalone mode to learn which tests are flaky), stored to a JSON file, and then the file is passed to the pytest plugin.	2023-04-04 12:21:54 +01:00
Joonas Koivunen	d0711d0896	build: fix git perms for deploy job (#3921 ) copy pasted from `build-neon` job. it is interesting that this is only needed by `build-neon` and `deploy`. Fixes: https://github.com/neondatabase/neon/actions/runs/4568077915/jobs/8070960178 which seems to have been going for a while.	2023-03-31 16:05:15 +03:00
Kirill Bulatov	9d714a8413	Split $CARGO_FLAGS and $CARGO_FEATURES to make e2e tests work	2023-03-29 00:08:30 +03:00
Kirill Bulatov	6c84cbbb58	Run new Rust IT test in CI	2023-03-29 00:08:30 +03:00
Shany Pozin	809acb5fa9	Move neon-image-depot to a larger runner (#3860 ) ## Describe your changes https://neondb.slack.com/archives/C039YKBRZB4/p1679413279637059 ## Issue ticket number and link ## Checklist before requesting a review - [ ] I have performed a self-review of my code. - [ ] If it is a core feature, I have added thorough tests. - [ ] Do we need to implement analytics? if so did you add the relevant metrics to the dashboard? - [ ] If this PR requires public announcement, mark it with /release-notes label and add several sentences in this section.	2023-03-21 19:32:36 +02:00
Alexander Bayandin	3d869cbcde	Replace flake8 and isort with ruff (#3810 ) - Introduce ruff (https://beta.ruff.rs/) to replace flake8 and isort - Update mypy and black	2023-03-14 13:25:44 +00:00
Rory de Zoete	3c4f5af1b9	Try depot.dev for image building (#3768 ) To see if it is faster. Run side-by-side for a while so we can gather enough data.	2023-03-10 11:11:39 +01:00
sharnoff	2153d2e00a	Run compute_ctl in a cgroup in VMs (#3577 )	2023-02-17 14:14:41 -08:00
Alexander Bayandin	567b71c1d2	Require poetry 1.3; regenerate poetry.lock (#3508 ) Ref https://python-poetry.org/blog/announcing-poetry-1.3.0/#new-lock-file-format	2023-02-01 18:11:00 +00:00
Sergey Melnikov	f3dadfb3d0	Confirm that there is an emergency before manual execution of prod deploy workflow (#3507 ) ![image](https://user-images.githubusercontent.com/7127190/215840037-69eda3ee-920e-4b90-bf7d-aa58f0bdfb50.png)	2023-02-01 16:01:27 +01:00
Sergey Melnikov	5e08b35f53	Fix new deploy workflow (#3492 ) Add 'branch' input to specify commit for deploy scripts/configs. Commit can't be passed to workflow as ref, and we need to pin configs to specific commit for main/release deploys Update deploy input descriptions to match GH interface	2023-01-30 22:08:00 +01:00
Sergey Melnikov	82cbcb36ab	Extract neon deploy jobs into separate workflows (#3424 ) Extract deploy jobs from build_and_test.yml to deploy-dev and deploy-prod workflows. Add trigger to run this workflows after Neon is build and tested on main and release branches. This will allow us to redeploy/rollback/patch config without full rebuild.	2023-01-30 20:10:54 +01:00
Rory de Zoete	7bb13569b3	Switch more jobs to small runner (#3483 ) As these jobs don't benefit from additional cores Co-authored-by: Rory de Zoete <rdezoete@RorysMacStudio.fritz.box>	2023-01-30 14:00:44 +01:00
Rory de Zoete	4d291d0e90	Prevent assume error (#3476 ) To fix `Error: The requested DurationSeconds exceeds the MaxSessionDuration set for this role.` Co-authored-by: Rory de Zoete <rdezoete@Rorys-Mac-Studio.fritz.box>	2023-01-27 19:27:23 +01:00
Rory de Zoete	4718c67c17	Update deploy steps (#3470 ) First one isn't optimal, but as it was requested to run the runner as nonroot -> https://github.com/neondatabase/runner/pull/1#discussion_r1069909593 this job will need more significant refactoring. This should unblock the deployment process. --------- Co-authored-by: Rory de Zoete <rdezoete@Rorys-Mac-Studio.fritz.box>	2023-01-27 18:05:49 +01:00
Rory de Zoete	8342e9ea6f	Update helm job (#3467 ) As followup from https://github.com/neondatabase/build/pull/47 Co-authored-by: Rory de Zoete <rdezoete@Rorys-Mac-Studio.fritz.box>	2023-01-27 13:28:26 +01:00
Rory de Zoete	2388981311	Add cleanup tasks for ansible and helm (#3465 ) To fix: https://github.com/neondatabase/neon/actions/runs/4023027504/jobs/6913421070 https://github.com/neondatabase/neon/actions/runs/4023027504/jobs/6913421268 Co-authored-by: Rory de Zoete <rdezoete@RorysMacStudio.fritz.box>	2023-01-27 11:20:51 +01:00
Sergey Melnikov	fb721cdfa5	Setup legacy scram proxy in the new account (#3461 ) This setup proxies with *.cloud.neon.tech certificate in the us-west-2 region of the new account, we are not switching to them here yet	2023-01-27 11:05:05 +01:00
Sergey Melnikov	2ecd0e1f00	Decommission link proxy from old account (#3454 )	2023-01-26 16:18:57 +01:00
Rory de Zoete	b858d70f19	Update promote job (#3455 ) To fix errors such as: `An error occurred (ImageAlreadyExistsException) when calling the PutImage operation: Image with digest 'sha256:da6d8ad97d84e3aec4e6a240c3a35868b626692ee5d199cdd3fe45d29a8e54df' and tag 'latest' already exists in the repository with name 'compute-node-v14' in registry with id '369495373322'` Co-authored-by: Rory de Zoete <rdezoete@RorysMacStudio.fritz.box> Co-authored-by: Rory de Zoete <rdezoete@Rorys-Mac-Studio.fritz.box>	2023-01-26 14:26:23 +01:00
Rory de Zoete	4bcbb7793d	Revert docker hub job (#3453 ) Regression fix as permissions aren't configured properly on gen3 for this job. Co-authored-by: Rory de Zoete <rdezoete@RorysMacStudio.fritz.box>	2023-01-26 11:30:53 +01:00
Rory de Zoete	cd5732d9d8	Gen3 runners (#3220 ) https://github.com/neondatabase/cloud/issues/2738 Co-authored-by: Rory de Zoete <rdezoete@Rorys-Mac-Studio.fritz.box> Co-authored-by: Rory de Zoete <rdezoete@RorysMacStudio.fritz.box>	2023-01-26 10:46:06 +01:00
Sergey Melnikov	4b8dbea5c1	Add production link proxy to new account (#3444 ) This PR setup link proxy in us-east-2 region, but do not redirect pg.neon.tech DNS name to it Will keep old link proxy for the time of migration	2023-01-25 17:15:56 +01:00
Vadim Kharitonov	00f1f54b7a	Leave one Dockerfile	2023-01-25 15:10:45 +01:00
sharnoff	f8e887830a	build: Use `curl -f` on vm-informant download (#3363 ) Without this, we can silently fail	2023-01-17 10:38:33 +01:00
sharnoff	5c6a7a17cb	Add VM informant to vm-compute-node (#3324 ) The general idea is that the VM informant binary is added to the vm-compute-node images only. `compute_tools` then will run whatever's at `/bin/vm-informant`, if the path exists.	2023-01-16 07:05:29 -08:00
Sergey Melnikov	95bf19b85a	Add --atomic to all helm upgrade operations (#3299 ) When number of github actions workers is changed, some jobs get killed. When helm if killed during the upgrade, release stuck in pending-upgrade state. --atomic should initiate automatic rollback in this case.	2023-01-10 10:05:27 +00:00
Sergey Melnikov	14df37c108	Use GHA environments for gradual prod rollout (#3295 ) Each release will wait for manual approval for each region	2023-01-09 20:18:16 +04:00
Sergey Melnikov	93c77b0383	Use GHA environment for per-region deploy approvals on staging (#3293 ) Each main deploy will wait for manual approval for each region	2023-01-09 15:40:14 +04:00
Heikki Linnakangas	e9583db73b	Remove code and test to generate flamegraph on GetPage requests. (#3257 ) It was nice to have and useful at the time, but unfortunately the method used to gather the profiling data doesn't play nicely with 'async'. PR #3228 will turn 'get_page_at_lsn' function async, which will break the profiling support. Let's remove it, and re-introduce some kind of profiling later, using some different method, if we feel like we need it again.	2023-01-03 20:11:32 +02:00
Vadim Kharitonov	0b428f7c41	Enable licenses check for 3rd-parties	2023-01-03 15:11:50 +01:00
Sergey Melnikov	c01f92c081	Fully remove old staging deploy (#3191 )	2022-12-22 20:09:45 +01:00
Sergey Melnikov	7bc17b373e	Fix calculate-deploy-targets (#3189 ) Was broken in https://github.com/neondatabase/neon/pull/3180	2022-12-22 16:28:36 +01:00

1 2 3 4

162 Commits