greptimedb

mirror of https://github.com/GreptimeTeam/greptimedb.git synced 2026-07-07 06:20:39 +00:00

Go to file

Weny Xu 2ae20daa62 feat: add sync region instruction for repartition procedure (#7562 )

* feat: add sync region instruction for repartition procedure

This commit introduces a new sync region instruction and integrates it
into the repartition procedure flow, specifically for metric engine tables.

Changes:
- Add SyncRegion instruction type and SyncRegionsReply in instruction.rs
- Implement SyncRegionHandler in datanode to handle sync region requests
- Add SyncRegion state in repartition procedure to sync newly allocated regions
- Integrate sync region step after enter_staging_region for metric engine tables
- Add sync_region flag and allocated_region_ids to PersistentContext
- Make SyncRegionFromRequest serializable for instruction transmission
- Add test utilities and mock support for sync region operations

The sync region step is conditionally executed based on the table engine type,
ensuring that newly allocated regions in metric engine tables are properly
synced from their source regions before proceeding with manifest remapping.

Signed-off-by: WenyXu <wenymedia@gmail.com>

* chore: add logs

Signed-off-by: WenyXu <wenymedia@gmail.com>

* feat(repartition): improve staging region handling and support metric engine repartition
- Reorder sync region flow: move SyncRegion from EnterStagingRegion to RepartitionStart to sync before applying staging
- Add ExitStaging metadata update state to properly clear staging leader info after repartition completes
- Update build_template_from_raw_table_info to optionally skip metric engine internal columns when creating region requests
- Fix region state transition: set_dropping now expects specific state (Staging or Writable) for proper validation
- Adjust region drop and copy handlers to handle staging regions correctly
- Add comprehensive test cases for metric engine SPLIT/MERGE partition operations on physical tables with logical tables
- Improve logging for table route updates, region drops, and repartition operations

Signed-off-by: WenyXu <wenymedia@gmail.com>

* refactor: removes code duplication

Signed-off-by: WenyXu <wenymedia@gmail.com>

* fix: update result

Signed-off-by: WenyXu <wenymedia@gmail.com>

* chore: refine comments

Signed-off-by: WenyXu <wenymedia@gmail.com>

* feat: add error strategy support for flush region and flush pending deallocate regions

- **Add `ErrorStrategy` enum** in `procedure/utils.rs`:
  - Supports `Ignore` and `Retry` strategies for error handling
  - Refactor `flush_region` to accept `error_strategy` parameter
  - Extract `handle_flush_region_reply` helper function for better code organization

- **Add pending deallocate region support**:
  - Add `pending_deallocate_region_ids` field to `PersistentContext`
  - Implement `flush_pending_deallocate_regions` in `EnterStagingRegion` state
  - Flush pending deallocate regions before entering staging regions to ensure data consistency

- **Update error handling**:
  - `flush_leader_region`: Use `ErrorStrategy::Ignore` to skip unreachable datanodes
  - `sync_region`: Use `ErrorStrategy::Retry` for critical operations
  - `enter_staging_region`: Use `ErrorStrategy::Retry` when flushing pending deallocate regions

This change improves the robustness of the repartition procedure by:
1. Providing flexible error handling strategies for flush operations
2. Ensuring pending deallocate regions are properly flushed before repartitioning
3. Preventing data inconsistency during region migration

Signed-off-by: WenyXu <wenymedia@gmail.com>

* chore: apply suggestions from CR

Signed-off-by: WenyXu <wenymedia@gmail.com>

* fix: compile

Signed-off-by: WenyXu <wenymedia@gmail.com>

---------

Signed-off-by: WenyXu <wenymedia@gmail.com>

2026-01-15 04:52:57 +00:00

.cargo

feat: put sqlness into a separated dir (#6911 )

2025-09-05 01:39:29 +00:00

.config

build: on windows (#2054 )

2023-08-10 08:08:37 +00:00

.github

feat(meta-srv): fallback to non-TLS connection when etcd TLS prefer mode fail (#7507 )

2025-12-31 10:03:34 +00:00

config

feat: allow auto schema creation for pg (#7459 )

2025-12-23 08:55:24 +00:00

cyborg

ci: handle prerelease version (#7492 )

2025-12-29 08:21:05 +00:00

docker

feat: add building option to build images base on distroless image (#7240 )

2025-11-26 05:13:05 +00:00

docs

docs: rfc for vector index (#7353 )

2025-12-24 02:54:25 +00:00

grafana

chore: add grafana dashboard about trigger (#7536 )

2026-01-08 06:47:46 +00:00

scripts

feat: add TLS support for mysql backend (#6979 )

2025-09-16 13:46:37 +00:00

src

feat: add sync region instruction for repartition procedure (#7562 )

2026-01-15 04:52:57 +00:00

tests

feat: add sync region instruction for repartition procedure (#7562 )

2026-01-15 04:52:57 +00:00

tests-fuzz

chore: upgrade DataFusion family (#7558 )

2026-01-14 14:02:31 +00:00

tests-integration

feat: impl vector index scan in storage (#7528 )

2026-01-12 08:30:51 +00:00

.dockerignore

fix: docker build (#1822 )

2023-06-25 11:05:46 +08:00

.editorconfig

feat: to_timezone function (#3470 )

2024-03-12 01:46:19 +00:00

.env.example

feat: add GcsConfig credential field (#4568 )

2024-08-16 03:11:20 +00:00

.gitignore

feat: refine the MemoryGuard (#7466 )

2025-12-25 04:09:32 +00:00

.pre-commit-config.yaml

chore: check for redundant pre-commit hooks (#7506 )

2026-01-07 13:46:42 +00:00

AUTHOR.md

chore: members and committers update (#7341 )

2025-12-04 04:08:43 +00:00

Cargo.lock

chore: upgrade DataFusion family (#7558 )

2026-01-14 14:02:31 +00:00

Cargo.toml

chore: upgrade DataFusion family (#7558 )

2026-01-14 14:02:31 +00:00

cliff.toml

ci: update breaking change title level (#7497 )

2025-12-30 06:17:51 +00:00

codecov.yml

refactor: refactor TableRouteManager (#3392 )

2024-02-28 06:18:09 +00:00

CONTRIBUTING.md

fix: typo in AI-assisted contributions policy (#7472 )

2025-12-25 03:03:14 +00:00

Cross.toml

fix: cross compiling for aarch64 targets and allow customizing page size (#5487 )

2025-02-07 11:21:16 +00:00

flake.lock

feat: grafana postgresql data source query builder support (#7379 )

2025-12-11 03:18:35 +00:00

flake.nix

feat: grafana postgresql data source query builder support (#7379 )

2025-12-11 03:18:35 +00:00

LICENSE

chore: multiple licenses fixes (#2714 )

2023-11-09 10:38:12 +00:00

licenserc.toml

feat: trigger alter parse (#6553 )

2025-07-29 11:07:31 +00:00

Makefile

chore: mount cargo git cache in docker builds (#7484 )

2025-12-26 01:56:11 +00:00

README.md

docs: update project status and tweak readme (#7216 )

2025-11-12 15:06:56 +00:00

rust-toolchain.toml

chore: update rust to nightly 2025-10-01 (#7069 )

2025-10-11 07:30:52 +00:00

rustfmt.toml

chore: specify import style in rustfmt (#460 )

2022-11-15 15:58:54 +08:00

SECURITY.md

feat: Create SECURITY.md (#1270 )

2023-03-28 19:14:29 +08:00

taplo.toml

chore: skip reorder workspace tables in taplo (#3388 )

2024-02-26 08:57:49 +00:00

typos.toml

refactor: explicitly define json struct to ingest jsonbench data (#7462 )

2025-12-24 07:30:22 +00:00

README.md

Real-Time & Cloud-Native Observability Database
for metrics, logs, and traces

Delivers sub-second querying at PB scale and exceptional cost efficiency from edge to cloud.

User Guide | API Docs | Roadmap 2025

Introduction
⭐ Key Features
Quick Comparison
Architecture
Try GreptimeDB
Getting Started
Build From Source
Tools & Extensions
Project Status
Community
License
Commercial Support
Contributing
Acknowledgement

Introduction

GreptimeDB is an open-source, cloud-native database that unifies metrics, logs, and traces, enabling real-time observability at any scale — across edge, cloud, and hybrid environments.

Features

Feature	Description
All-in-One Observability	OpenTelemetry-native platform unifying metrics, logs, and traces. Query via SQL, PromQL, and Flow.
High Performance	Written in Rust with rich indexing (inverted, fulltext, skipping, vector), delivering sub-second responses at PB scale.
Cost Efficiency	50x lower operational and storage costs with compute-storage separation and native object storage (S3, Azure Blob, etc.).
Cloud-Native & Scalable	Purpose-built for Kubernetes with unlimited cross-cloud scaling, handling hundreds of thousands of concurrent requests.
Developer-Friendly	SQL/PromQL interfaces, built-in web dashboard, REST API, MySQL/PostgreSQL protocol compatibility, and native OpenTelemetry support.
Flexible Deployment	Deploy anywhere from ARM-based edge devices (including Android) to cloud, with unified APIs and efficient data sync.

✅ Perfect for:

Unified observability stack replacing Prometheus + Loki + Tempo
Large-scale metrics with high cardinality (millions to billions of time series)
Large-scale observability platform requiring cost efficiency and scalability
IoT and edge computing with resource and bandwidth constraints

Learn more in Why GreptimeDB and Observability 2.0 and the Database for It.

Quick Comparison

Feature	GreptimeDB	Traditional TSDB	Log Stores
Data Types	Metrics, Logs, Traces	Metrics only	Logs only
Query Language	SQL, PromQL	Custom/PromQL	Custom/DSL
Deployment	Edge + Cloud	Cloud/On-prem	Mostly central
Indexing & Performance	PB-Scale, Sub-second	Varies	Varies
Integration	REST API, SQL, Common protocols	Varies	Varies

Performance:

Architecture

GreptimeDB can run in two modes:

Standalone Mode - Single binary for development and small deployments
Distributed Mode - Separate components for production scale:
- Frontend: Query processing and protocol handling
- Datanode: Data storage and retrieval
- Metasrv: Metadata management and coordination

Read the architecture document. DeepWiki provides an in-depth look at GreptimeDB:

Try GreptimeDB

docker pull greptime/greptimedb

docker run -p 127.0.0.1:4000-4003:4000-4003 \
  -v "$(pwd)/greptimedb_data:/greptimedb_data" \
  --name greptime --rm \
  greptime/greptimedb:latest standalone start \
  --http-addr 0.0.0.0:4000 \
  --rpc-bind-addr 0.0.0.0:4001 \
  --mysql-addr 0.0.0.0:4002 \
  --postgres-addr 0.0.0.0:4003

Dashboard: http://localhost:4000/dashboard

Getting Started

Build From Source

Prerequisites:

Rust toolchain (nightly)
Protobuf compiler (>= 3.15)
C/C++ building essentials, including gcc/g++/autoconf and glibc library (eg. libc6-dev on Ubuntu and glibc-devel on Fedora)
Python toolchain (optional): Required only if using some test scripts.

Build and Run:

make
cargo run -- standalone start

Tools & Extensions

Kubernetes: GreptimeDB Operator
Helm Charts: Greptime Helm Charts
Dashboard: Web UI
gRPC Ingester: Go, Java, C++, Erlang, Rust
Grafana Data Source: GreptimeDB Grafana data source plugin
Grafana Dashboard: Official Dashboard for monitoring

Project Status

Status: Beta — marching toward v1.0 GA! GA (v1.0): January 10, 2026

Deployed in production by open-source projects and commercial users
Stable, actively maintained, with regular releases (version info)
Suitable for evaluation and pilot deployments

GreptimeDB v1.0 represents a major milestone toward maturity — marking stable APIs, production readiness, and proven performance.

Roadmap: Beta1 (Nov 10) → Beta2 (Nov 24) → RC1 (Dec 8) → GA (Jan 10, 2026), please read v1.0 highlights and release plan for details.

For production use, we recommend using the latest stable release.

If you find this project useful, a ⭐ would mean a lot to us!

Community

We invite you to engage and contribute!

License

GreptimeDB is licensed under the Apache License 2.0.

Commercial Support

Running GreptimeDB in your organization? We offer enterprise add-ons, services, training, and consulting. Contact us for details.

Contributing

Read our Contribution Guidelines.
Explore Internal Concepts and DeepWiki.
Pick up a good first issue and join the #contributors Slack channel.

Acknowledgement

Special thanks to all contributors! See AUTHORS.md.

Uses Apache Arrow™ (memory model)
Apache Parquet™ (file storage)
Apache DataFusion™ (query engine)
Apache OpenDAL™ (data access abstraction)

Description

Open-source, cloud-native, unified observability database for metrics, logs and traces, supporting SQL/PromQL/Streaming.

analytics cloud-native database distributed greptimedb logs metrics monitoring observability observability-database observability-datalake promql rust rust-database sql time-series traces tsdb

Readme Apache-2.0 810 MiB

README.md

Real-Time & Cloud-Native Observability Databasefor metrics, logs, and traces

User Guide | API Docs | Roadmap 2025

Introduction

Features

Quick Comparison

Architecture

Try GreptimeDB

Getting Started

Build From Source

Tools & Extensions

Project Status

Community

License

Commercial Support

Contributing

Acknowledgement

Real-Time & Cloud-Native Observability Database
for metrics, logs, and traces