greptimedb

mirror of https://github.com/GreptimeTeam/greptimedb.git synced 2026-07-04 04:50:37 +00:00

Author	SHA1	Message	Date
copilot-swe-agent[bot]	48c04c81ee	Allow flush to recreate missing flows	2026-06-10 10:48:52 +00:00
copilot-swe-agent[bot]	f1574d940d	Fail flow recovery on partial recreate errors	2026-06-10 10:17:35 +00:00
Weny Xu	05c4588f90	feat: support remote WAL logical pruning (#8259 ) * feat: support logical deletion for remote WAL pruning Signed-off-by: WenyXu <wenymedia@gmail.com> * fix: trigger remote WAL flush for lagging prunable regions Signed-off-by: WenyXu <wenymedia@gmail.com> * chore: use from_mins Signed-off-by: WenyXu <wenymedia@gmail.com> * fix: fix unit tests Signed-off-by: WenyXu <wenymedia@gmail.com> --------- Signed-off-by: WenyXu <wenymedia@gmail.com>	2026-06-10 04:40:51 +00:00
jeremyhi	6ceae4498f	refactor(cli): stream export-v2 verify data listing (#8268 ) Signed-off-by: fengjiachun <fengjiachun@gmail.com>	2026-06-10 02:59:27 +00:00
Lei, HUANG	e74a73638d	feat: separate datanode query and ingestion runtimes (#8246 ) * feat: add datanode runtime options Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * feat: add datanode runtime handles Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * refactor: wire datanode runtimes into region server Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * feat: route datanode ingestion to ingestion runtime Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * feat: add datanode query runtime stream bridge Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * feat: route datanode reads to query runtime Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * feat: add datanode global runtimes Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * refactor: use common datanode runtimes Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * feat: run mito scan tasks on query runtime Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * refactor: split datanode runtime options Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * fix: clippy Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * fix: share global fallback for datanode runtimes Use the global runtime as the fallback for datanode query and ingestion runtimes when datanode-specific pools are not initialized. This avoids creating unused datanode worker pools in non-datanode services. Files: - `src/common/runtime/src/global.rs` Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * fix: docs Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * fix: forward query runtime stream metrics Forward inner stream metrics through the datanode query runtime bridge so `EXPLAIN ANALYZE` can report plan metrics after stream polling moves to the query runtime. Files: - `src/datanode/src/query_stream.rs` - `src/datanode/src/region_server.rs` Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * fix: route metric batch puts to ingest runtime Run the optimized metric batch put path on the datanode ingest runtime so metric ingestion does not bypass runtime isolation. Files: - `src/datanode/src/region_server.rs` Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * fix: abort query producer on stream drop Abort the datanode query runtime producer when the returned read stream is dropped so cancelled clients do not leave query work running in the background. Files: - `src/datanode/src/query_stream.rs` - `src/datanode/src/region_server.rs` Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * refactor: simplify query stream bridge setup Create the inner read stream before spawning the datanode query runtime producer so setup does not use an extra task and initialization channel. Files: - `src/datanode/src/region_server.rs` Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * feat/runtime-priority: ### Update Datanode Runtime Options and Region Server Logic - `global.rs`: Adjusted `datanode_ingest_rt_size` to utilize all available CPUs for improved performance. - `region_server.rs`: Simplified the collection of `put_requests` and optimized the `put_regions_batch` call for better efficiency. Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * feat/runtime-priority: ### Remove Redundant Checks and Simplify Code - `global.rs`: Removed the assertion check for already initialized global runtimes to streamline the initialization process. - `region_server.rs`: Simplified the extraction of `Put` requests by removing unnecessary cloning and restructuring the iterator logic. Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * fix: remove redundant spawn_datanode_query in RegionServer::handle_read The outer `spawn_datanode_query` wrapped `handle_read_inner` on the same runtime, creating a nested spawn that consumed query runtime threads unnecessarily under concurrent read load. The gRPC handler already provides runtime isolation, so the inner call is sufficient. - `src/datanode/src/region_server.rs` — inline `handle_read_inner` directly instead of spawning onto the datanode query runtime Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * fix: resolve test mismatch and redundant spawn in handle_remote_read - `src/common/runtime/src/global.rs` — update test assertion to match default `datanode_ingest_rt_size` of `cpus` instead of `1` - `src/datanode/src/region_server.rs` — inline `handle_remote_read_inner` directly instead of spawning onto the datanode query runtime Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * refactor: rename datanode runtimes Summary: - Rename datanode runtime APIs from `datanode_query` and `datanode_ingest` to `query` and `ingest`. - Rename runtime config keys from `datanode_query_rt_size` and `datanode_ingest_rt_size` to `query_rt_size` and `ingest_rt_size`. - Update config docs, example config, and config-loading coverage. Files: - `src/common/runtime/src/global.rs` - `src/common/runtime/src/lib.rs` - `src/cmd/tests/load_config_test.rs` - `src/datanode/src/region_server.rs` - `src/mito2/src/read/pruner.rs` - `src/mito2/src/read/range_cache.rs` - `src/mito2/src/read/scan_region.rs` - `src/mito2/src/read/series_scan.rs` - `config/datanode.example.toml` - `config/config.md` Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * refactor: consolidate runtime options Summary: - Embed datanode runtime sizes in shared `RuntimeOptions` and remove the extra `GreptimeOptions` runtime type parameter. - Use the unified `RuntimeOptions` for datanode global and datanode-specific runtime initialization. - Update datanode runtime config coverage and ingest runtime default documentation. Files: - `src/common/runtime/src/global.rs` - `src/common/runtime/src/lib.rs` - `src/cmd/src/options.rs` - `src/cmd/src/datanode.rs` - `src/cmd/src/datanode/builder.rs` - `src/cmd/tests/load_config_test.rs` - `config/datanode.example.toml` - `config/config.md` Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * feat: guard against double initialization of datanode runtimes Add an assertion in `init_datanode_runtimes` to panic when global runtimes are already initialized, preventing silent overwrites. - `src/common/runtime/src/global.rs` — assert guard in `init_datanode_runtimes` and test `test_set_datanode_runtimes_panics_after_global_runtimes_initialized` Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> --------- Signed-off-by: Lei, HUANG <mrsatangel@gmail.com>	2026-06-09 08:21:50 +00:00
Yingwen	1451a51ea0	feat: add tools for development (#8260 ) * feat(cmd): add dev-tools binary with parquetbench command Signed-off-by: evenyag <realevenyag@gmail.com> * chore: add feature gate dev-tools Signed-off-by: evenyag <realevenyag@gmail.com> * feat(dev-tools): print output column count in parquetbench Signed-off-by: evenyag <realevenyag@gmail.com> * feat(dev-tools): print output schema in parquetbench Signed-off-by: evenyag <realevenyag@gmail.com> * refactor(cmd): move parquetbench into datanode CLI, drop dev-tools binary Signed-off-by: evenyag <realevenyag@gmail.com> * fix(dev-tools): validate pprof-after-warmup requires >=2 iterations Signed-off-by: evenyag <realevenyag@gmail.com> * fix(dev-tools): use absolute path instead of super:: import in parquetbench Signed-off-by: evenyag <realevenyag@gmail.com> * fix(dev-tools): base parquetbench throughput on scanned row-group bytes Signed-off-by: evenyag <realevenyag@gmail.com> --------- Signed-off-by: evenyag <realevenyag@gmail.com>	2026-06-09 08:12:21 +00:00
shuiyisong	fc653c84c8	refactor: loki ingestion (#8266 ) * refactor: loki ingestion Signed-off-by: shuiyisong <xixing.sys@gmail.com> * fix: clippy Signed-off-by: shuiyisong <xixing.sys@gmail.com> --------- Signed-off-by: shuiyisong <xixing.sys@gmail.com>	2026-06-09 07:53:45 +00:00
jeremyhi	1e53b1a157	fix(config): align scan memory limit default with code (#8228 ) * fix(config): align scan memory limit default with code Signed-off-by: jeremyhi <fengjiachun@gmail.com> * fix: by AI comments Signed-off-by: jeremyhi <fengjiachun@gmail.com> --------- Signed-off-by: jeremyhi <fengjiachun@gmail.com>	2026-06-08 14:01:48 +00:00
discord9	1c7b4c11fc	fix(flow): support complex SQL full-query flows (#8242 ) * fix(flow): skip TWE detection for complex plans Signed-off-by: discord9 <discord9@163.com> * chore: helper Signed-off-by: discord9 <discord9@163.com> * fix(flow): allow SQL CTE flow parsing Signed-off-by: discord9 <discord9@163.com> * fix(flow): execute unscoped SQL snapshots as SQL Signed-off-by: discord9 <discord9@163.com> * fix: column list Signed-off-by: discord9 <discord9@163.com> * refactor: per review Signed-off-by: discord9 <discord9@163.com> --------- Signed-off-by: discord9 <discord9@163.com>	2026-06-08 13:07:43 +00:00
dennis zhuang	e403133eb2	feat: add information_schema statistics table (#8253 ) * feat: add information_schema statistics table Signed-off-by: Dennis Zhuang <killme2008@gmail.com> * fix: use index-local sequence in statistics Signed-off-by: Dennis Zhuang <killme2008@gmail.com> * fix: ordinal_position for pk Signed-off-by: Dennis Zhuang <killme2008@gmail.com> * fix: statistics.nullable uses empty string for non-nullable columns Signed-off-by: Dennis Zhuang <killme2008@gmail.com> --------- Signed-off-by: Dennis Zhuang <killme2008@gmail.com>	2026-06-08 12:52:02 +00:00
Weny Xu	9a814ca123	chore: add repartition debug logs (#8245 ) Signed-off-by: WenyXu <wenymedia@gmail.com>	2026-06-08 08:48:23 +00:00
discord9	651cf04525	feat: fan out remote dynamic filter updates from FE (#8241 ) * feat: fe send update Signed-off-by: discord9 <discord9@163.com> * refactor: after rebase Signed-off-by: discord9 <discord9@163.com> * fix: encode at least send bounds Signed-off-by: discord9 <discord9@163.com> * refactor: per review Signed-off-by: discord9 <discord9@163.com> --------- Signed-off-by: discord9 <discord9@163.com>	2026-06-08 08:24:57 +00:00
shuiyisong	5fd5b91b29	chore: remove auth in flownode (#8244 ) * chore: remove auth in flownode Signed-off-by: shuiyisong <xixing.sys@gmail.com> * chore: update docs Signed-off-by: shuiyisong <xixing.sys@gmail.com> * chore: add flow startup check Signed-off-by: shuiyisong <xixing.sys@gmail.com> * fix: clippy Signed-off-by: shuiyisong <xixing.sys@gmail.com> --------- Signed-off-by: shuiyisong <xixing.sys@gmail.com>	2026-06-08 08:15:19 +00:00
Ning Sun	0881b7ba32	feat: pgwire 0.40 (#8257 )	2026-06-08 07:31:08 +00:00
Ning Sun	fd64ced4da	feat: introduce plugin setup functions with richer context (#8256 ) feat: enrich plugin setup context	2026-06-08 06:53:08 +00:00
Lei, HUANG	e7ce3ac0c7	fix: accept JSONB OTLP logs against existing schema (#8250 ) * fix: accept JSONB OTLP logs against existing schema Handle existing OTLP log JSONB columns that round-trip as Json while direct log rows carry pre-encoded binary JSONB payloads. Add integration coverage for repeated OTLP log batches after table auto-creation. Files: src/servers/src/otlp/logs.rs, tests-integration/tests/http.rs Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * perf: cache OTLP log JSONB schema check Cache whether an existing OTLP log column is legacy JSONB once per column before coercing rows. Files: src/servers/src/otlp/logs.rs Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * fix: narrow down schema coercion Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> --------- Signed-off-by: Lei, HUANG <mrsatangel@gmail.com>	2026-06-08 01:57:23 +00:00
Weny Xu	7baa4f6854	fix: reset region detector after migration (#8234 ) Signed-off-by: WenyXu <wenymedia@gmail.com>	2026-06-05 09:14:13 +00:00
Ruihang Xia	428cd8137d	feat: identify noneffective binary modifiers (#8230 ) * perf(promql): use tsid for full-label modifier joins * fix(promql): collect narrow range joins by tsid * verify on for both sides Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * check if tsid exists Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * update sqlness Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * correct behavior Signed-off-by: Ruihang Xia <waynestxia@gmail.com> --------- Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2026-06-05 08:44:50 +00:00
Yingwen	f580c42a1f	perf: read primary key as binary if it overflows the dictionary (#8187 ) * perf(mito2): read __primary_key as binary when chunk exceeds dict page limit Signed-off-by: evenyag <realevenyag@gmail.com> * fix(mito2): handle binary __primary_key column in prefilter Signed-off-by: evenyag <realevenyag@gmail.com> * fix(mito2): align prefilter binary PK test with eval_pk_group_mask Signed-off-by: evenyag <realevenyag@gmail.com> * chore: address review comments Signed-off-by: evenyag <realevenyag@gmail.com> --------- Signed-off-by: evenyag <realevenyag@gmail.com>	2026-06-05 08:19:35 +00:00
dennis zhuang	d6c37778ae	feat: table semantic layer information_schema view (Phase 3) (#8240 ) * feat: table semantic layer information_schema view (Phase 3) Add `information_schema.table_semantics`, a queryable view over the table semantic layer. One row per table that carries at least one `greptime.semantic.` option: the signal-agnostic keys (signal_type/source/pipeline/metadata_quality) are promoted to columns and the remaining signal-specific keys are folded into a `semantic_options` JSON string. Tables with no semantic key are excluded. Stacked on Phase 2. Signed-off-by: Dennis Zhuang <killme2008@gmail.com> chore: address PR review on table_semantics - fold JSON serialization failure into None instead of unwrap/panic - drop per-row Vec allocation in predicate eval; use a fixed array - align RFC view name with the shipped `table_semantics` Signed-off-by: Dennis Zhuang <killme2008@gmail.com> * chore: update results Signed-off-by: Dennis Zhuang <killme2008@gmail.com> --------- Signed-off-by: Dennis Zhuang <killme2008@gmail.com>	2026-06-05 08:16:41 +00:00
shuiyisong	d26a80855b	chore: align OTLP logs with existing table schemas (#8229 ) * chore: otlp logs with schema align Signed-off-by: shuiyisong <xixing.sys@gmail.com> * chore: add tests Signed-off-by: shuiyisong <xixing.sys@gmail.com> * fix: CR issue Signed-off-by: shuiyisong <xixing.sys@gmail.com> --------- Signed-off-by: shuiyisong <xixing.sys@gmail.com>	2026-06-04 12:53:09 +00:00
dennis zhuang	31c2c1f6db	feat: table semantic layer per-table enrichment (Phase 2) (#8218 ) * feat: table semantic layer per-table enrichment (Phase 2) Phase 2 of the table semantic layer, plus a vocabulary trim so the layer only records what a machine consumer cannot cheaply recover on its own. Per-table metric enrichment (OTLP), via an internal per-table channel: - A `SemanticIndex` accumulator records, per emitted table, the declared metric keys: type / unit / temporality / metadata_quality=declared / original_name. Conflicting single-valued keys collapse to `mixed`/`unknown`. - Recording happens at the `encode_metrics` level where the base name, metric type, and proto fields are all in scope, so histogram/summary fan-out gets the correct per-subtable type (`_bucket`=histogram, `_sum`/`_count`=counter) without threading state through every encoder. - The index is serialized onto the `greptime.internal.semantic.per_table_index` context extension; `apply_per_table_semantic_options` folds each table's keys into its options at auto-create time. - `trace.conventions` is refined from the request's resource/scope `schema_url`s (concrete when uniform, else `mixed`/`unknown`). Vocabulary trimmed to only meaningful keys. Kept: signal_type, source, pipeline, trace.conventions, metric.{type,unit,temporality,metadata_quality,original_name}. Dropped: metric.monotonic (a function of type), trace.has_events/has_links (constant + derivable from columns), log.severity_scheme/body_format (constant / derivable, and body_format cost an O(rows) scan), resource/scope lineage (restates columns / collector-config concern), source_version (no cheap non-constant value today). Prometheus carries type/unit in the metric name by convention, so it gets identity only — no inferred enrichment. Identity (signal_type + source) extended to the remaining ingest protocols so the discovery view is complete: InfluxDB and OpenTSDB (metric), Loki and Elasticsearch (log). These protocols carry no type/unit metadata, so identity is all that applies. Tests: unit coverage for the accumulator, per-metric-type fan-out, and trace conventions; integration goldens updated for the OTLP metric/trace SHOW CREATE output and the new Loki identity. Signed-off-by: Dennis Zhuang <killme2008@gmail.com> * chore: validate the option value Signed-off-by: Dennis Zhuang <killme2008@gmail.com> --------- Signed-off-by: Dennis Zhuang <killme2008@gmail.com>	2026-06-04 12:52:00 +00:00
discord9	6b7772e457	feat: add remote dynamic filter frontend registration (#8148 ) * feat: filter id Signed-off-by: discord9 <discord9@163.com> * feat: dyn filter registry Signed-off-by: discord9 <discord9@163.com> * feat: filter id&refactor to type Signed-off-by: discord9 <discord9@163.com> * feat: merge scan register dyn filter(not send yet) Signed-off-by: discord9 <discord9@163.com> * feat: init reg dyn filter Signed-off-by: discord9 <discord9@163.com> * wip: remote dyn filter task 03 Signed-off-by: discord9 <discord9@163.com> * fix: resolve remote dyn filter rebase fallout Signed-off-by: discord9 <discord9@163.com> * chore: keep remote dyn filter docs local Signed-off-by: discord9 <discord9@163.com> * chore: remove stale filter id allow Signed-off-by: discord9 <discord9@163.com> * chore: clippy Signed-off-by: discord9 <discord9@163.com> * chore: fix remote dyn filter import style Signed-off-by: discord9 <discord9@163.com> * chore: fix query metrics test fallout Signed-off-by: discord9 <discord9@163.com> * fix: exclude region from remote dyn filter id Signed-off-by: discord9 <discord9@163.com> * chore: import Signed-off-by: discord9 <discord9@163.com> * refactor: rm some to latter Signed-off-by: discord9 <discord9@163.com> * feat: add initial dyn filter snapshot Signed-off-by: discord9 <discord9@163.com> * refactor: per review Signed-off-by: discord9 <discord9@163.com> * docs: better comment, rm some slop Signed-off-by: discord9 <discord9@163.com> * chore: per review Signed-off-by: discord9 <discord9@163.com> --------- Signed-off-by: discord9 <discord9@163.com>	2026-06-04 12:31:42 +00:00
Weny Xu	42eeeaa514	fix: avoid stale metadata cache after invalidation (#8235 ) Signed-off-by: WenyXu <wenymedia@gmail.com>	2026-06-04 12:05:17 +00:00
QuakeWang	6ed67167bb	feat: support headerless CSV copy from (#8233 ) * feat: support headerless CSV copy from Signed-off-by: QuakeWang <wangfuzheng0814@foxmail.com> * fix: update csv copy sqlness result Signed-off-by: QuakeWang <wangfuzheng0814@foxmail.com> * test: cover headerless CSV copy from Signed-off-by: QuakeWang <wangfuzheng0814@foxmail.com> * test: cover headerless CSV column count mismatch Signed-off-by: QuakeWang <wangfuzheng0814@foxmail.com> --------- Signed-off-by: QuakeWang <wangfuzheng0814@foxmail.com>	2026-06-04 12:04:15 +00:00
shuiyisong	52b51c31fc	refactor: streamline pipeline ingestion row building (#8236 ) * refactor: minor pipeline refactor Signed-off-by: shuiyisong <xixing.sys@gmail.com> * refactor: use suffix as group key Signed-off-by: shuiyisong <xixing.sys@gmail.com> --------- Signed-off-by: shuiyisong <xixing.sys@gmail.com>	2026-06-04 09:03:20 +00:00
Ruihang Xia	edd563d2e1	feat: join simplifier for promql binary op (#8211 ) * basic impl Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * refactor, remove shallow methods Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * more sqlness cases Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * clear ctx Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * prefix leaf aliase Signed-off-by: Ruihang Xia <waynestxia@gmail.com> --------- Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2026-06-04 08:13:24 +00:00
Ruihang Xia	a6cb4d2083	feat: look up cache with range calculation (#8123 ) * feat: loop up cache with range calculation Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * copy fragements Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * release dashmap reference Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix bugs Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * remove dashmap Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * remove keys Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * clean up Signed-off-by: Ruihang Xia <waynestxia@gmail.com> --------- Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2026-06-04 07:41:49 +00:00
Weny Xu	3bd915865a	fix: improve remote WAL replay checkpoint handling (#8225 ) * fix: correct topic_latest_entry_id Signed-off-by: WenyXu <wenymedia@gmail.com> * fix: use pruned wal id for replay checkpoint Signed-off-by: WenyXu <wenymedia@gmail.com> * feat: persist remote wal checkpoints periodically Signed-off-by: WenyXu <wenymedia@gmail.com> * test: cover remote wal topic latest entry id Signed-off-by: WenyXu <wenymedia@gmail.com> * chore: apply suggestions Signed-off-by: WenyXu <wenymedia@gmail.com> * fix: merge pruned wal id for migration checkpoint Signed-off-by: WenyXu <wenymedia@gmail.com> --------- Signed-off-by: WenyXu <wenymedia@gmail.com>	2026-06-04 06:10:38 +00:00
Lei, HUANG	da9d314fa4	fix: align bulk insert schema handling (#8222 ) * fix: align bulk insert schema handling Carry `aligned_schema_version` through bulk insert requests so datanode can recheck stale schemas only when needed. Use the physical data region schema version for metric logical bulk inserts, while external bulk insert callers leave alignment unknown. Related files: - `Cargo.toml` - `Cargo.lock` - `src/store-api/src/region_request.rs` - `src/metric-engine/src/engine/bulk_insert.rs` - `src/mito2/src/request.rs` - `src/mito2/src/worker/handle_bulk_insert.rs` - `src/mito2/src/worker/handle_write.rs` - `src/operator/src/bulk_insert.rs` - `src/servers/src/pending_rows_batcher.rs` - `src/mito2/src/engine/edit_region_test.rs` Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * test: cover bulk insert missing columns Add Flight bulk insert coverage for batches that omit nullable columns and verify the database fills the missing values with nulls. Related files: - `tests-integration/src/grpc/flight.rs` Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * chore: update proto to main branch commit Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * fix: align physical metric bulk insert schema Pass the physical schema version through metric bulk insert requests so physical and logical region writes share the same schema alignment behavior. Files: - \`src/metric-engine/src/engine/bulk_insert.rs\` — updates \`bulk_insert_physical_region\` and reuses \`physical_schema_version\` for bulk insert schema alignment. Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * chore: add some doc Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> --------- Signed-off-by: Lei, HUANG <mrsatangel@gmail.com>	2026-06-04 02:55:09 +00:00
discord9	d304df6e75	fix: run eval-interval flow without time window (#8231 ) * fix: run eval-interval flow without time window Signed-off-by: discord9 <discord9@163.com> * test: cover eval-interval flow join query Signed-off-by: discord9 <discord9@163.com> * fix: address eval interval flow review comments Signed-off-by: discord9 <discord9@163.com> --------- Signed-off-by: discord9 <discord9@163.com>	2026-06-03 13:06:22 +00:00
QuakeWang	ca07a53deb	feat: support CSV copy skip bad records (#8198 ) * feat: support CSV copy headers and skip bad records Signed-off-by: QuakeWang <wangfuzheng0814@foxmail.com> * refactor: reuse CSV skippable error helper Signed-off-by: QuakeWang <wangfuzheng0814@foxmail.com> * fix: address CSV skip bad records review Signed-off-by: QuakeWang <wangfuzheng0814@foxmail.com> * docs: clarify CSV skip bad records performance tradeoff Signed-off-by: QuakeWang <wangfuzheng0814@foxmail.com> --------- Signed-off-by: QuakeWang <wangfuzheng0814@foxmail.com>	2026-06-03 12:11:02 +00:00
Weny Xu	31cdd864d2	fix: format detected IPv6 grpc address (#8221 ) Signed-off-by: WenyXu <wenymedia@gmail.com>	2026-06-03 12:05:50 +00:00
shuiyisong	11f06709cd	chore: remove `HttpConfigurator` (#8224 ) Signed-off-by: shuiyisong <xixing.sys@gmail.com>	2026-06-03 04:13:32 +00:00
jeremyhi	aaffe1a048	refactor(cli): split export-v2 verify into plan/scan/reconcile (#8206 ) * refactor(cli): split export-v2 verify into plan/scan/reconcile Signed-off-by: jeremyhi <fengjiachun@gmail.com> * fix: by AI comments Signed-off-by: jeremyhi <fengjiachun@gmail.com> --------- Signed-off-by: jeremyhi <fengjiachun@gmail.com>	2026-06-03 03:10:10 +00:00
shuiyisong	f3919a3117	chore: update vrl to 0.33 (#8219 ) * chore: update vrl to 0.33 Signed-off-by: shuiyisong <xixing.sys@gmail.com> * chore: reuse thread local runtime Signed-off-by: shuiyisong <xixing.sys@gmail.com> --------- Signed-off-by: shuiyisong <xixing.sys@gmail.com>	2026-06-02 10:04:15 +00:00
discord9	7f76ad5439	feat: validate batching flow sink schema on create (#8176 ) * feat: check schema on create Signed-off-by: discord9 <discord9@163.com> * chore: update sqlness Signed-off-by: discord9 <discord9@163.com> * fix(flow): avoid duplicate fields when matching sink schema Signed-off-by: discord9 <discord9@163.com> * fix: null handling Signed-off-by: discord9 <discord9@163.com> * chore: per review Signed-off-by: discord9 <discord9@163.com> * chore: debug log Signed-off-by: discord9 <discord9@163.com> --------- Signed-off-by: discord9 <discord9@163.com>	2026-06-02 08:24:50 +00:00
Yingwen	50b1a07232	feat(datanode): hold query permit for stream and expose limiter timeout (#8215 ) * feat(datanode): hold query permit for stream and expose limiter timeout Signed-off-by: evenyag <realevenyag@gmail.com> * style(datanode): use as_mut() to poll inner stream in PermitGuardedStream Signed-off-by: evenyag <realevenyag@gmail.com> --------- Signed-off-by: evenyag <realevenyag@gmail.com>	2026-06-02 08:00:01 +00:00
Weny Xu	935ef9a361	feat: check open region requirements (#8194 ) * feat: check open region capabilities Signed-off-by: WenyXu <wenymedia@gmail.com> * test: allow file region migration tests Signed-off-by: WenyXu <wenymedia@gmail.com> * refactor: refine open region requirements Signed-off-by: WenyXu <wenymedia@gmail.com> * refactor: use fs scheme constant Signed-off-by: WenyXu <wenymedia@gmail.com> * test: cover open region requirement predicate Signed-off-by: WenyXu <wenymedia@gmail.com> * fix: check file engine open requirements Signed-off-by: WenyXu <wenymedia@gmail.com> --------- Signed-off-by: WenyXu <wenymedia@gmail.com>	2026-06-02 06:43:23 +00:00
Ning Sun	c1b0418377	refactor: give instance to start plugin functions (#8208 ) * refactor: give instance to start plugin functions * refactor: remove start_standalone_plugins because it's never used * chore: make sure code is nicely formatted	2026-06-02 04:11:41 +00:00
LFC	7f6400e460	refactor: optimize compaction's pick time (#8197 ) refactor: optimize compaction's pick Signed-off-by: luofucong <luofc@foxmail.com>	2026-06-02 03:14:17 +00:00
dennis zhuang	f25ab67af1	feat: table semantic layer identity (Phase 1) (#8210 ) * feat: table semantic layer identity (Phase 1) Attach a thin layer of semantic metadata to ingested tables via the existing `table_options` slot, so machine consumers (LLM agents, alert/dashboard builders, MCP servers, ETL) can align a table with the observability concept it stands for without guessing from column names. See docs/rfcs/2026-05-28-table-semantic-layer.md. Phase 1 (identity) only: - New `table::requests::semantic` module: the `greptime.semantic.` vocabulary (signal/source/source_version/pipeline + trace/metric/log/resource-scope keys, defined now, populated by later phases), value constants, the internal `greptime.internal.semantic.per_table_index` transport key (reserved for Phase 2, deliberately outside the public namespace), and `is_semantic_option_key`. - `validate_table_option` accepts the `greptime.semantic.` prefix, so the keys are valid both on the auto-create path and on explicit `CREATE TABLE ... WITH (...)`. - `fill_table_options_for_create` copies every semantic ctx extension into the new table's options (prefix passthrough alongside the fixed allowlist). - Frontend stamps identity on the context at each ingest entry: OTLP metrics (metric/opentelemetry), traces (+pipeline, has_events/has_links/conventions for the v1 model), logs (log/opentelemetry), and Prometheus remote write (metric/prometheus, metadata_quality=inferred). OTLP metric metadata_quality is left for Phase 2 (declared). - Trace identity is stamped only on the main span table; the derived `_services` / `_operations` lookup tables keep the unstamped context and carry no semantic identity (cross-table relationships are out of scope). Semantic options appear in SHOW CREATE TABLE (like table_data_model / otlp_metric_compat) and in information_schema, so an LLM inspecting a table sees its semantics directly. Tests: unit (validation prefix + internal-key rejection, ctx passthrough) and integration assertions that the common keys land for OTLP metrics (metric-engine logical table), traces, logs, and Prometheus remote write; SHOW CREATE goldens updated. Signed-off-by: Dennis Zhuang <killme2008@gmail.com> * fix: prom batcher not cover and white list for semantic keys/values Signed-off-by: Dennis Zhuang <killme2008@gmail.com> * fix: typo Signed-off-by: Dennis Zhuang <killme2008@gmail.com> --------- Signed-off-by: Dennis Zhuang <killme2008@gmail.com>	2026-06-02 01:56:29 +00:00
Weny Xu	6e6ea4c0c7	test: fuzz unpartitioned repartition (#8195 ) * test: fuzz unpartitioned repartition Signed-off-by: WenyXu <wenymedia@gmail.com> * fix: invalidate table info cache Signed-off-by: WenyXu <wenymedia@gmail.com> * chore: set max actions 128 Signed-off-by: WenyXu <wenymedia@gmail.com> * chore: use InitStrategy::VersionChecked for table info cache Signed-off-by: WenyXu <wenymedia@gmail.com> * chore: refine logs Signed-off-by: WenyXu <wenymedia@gmail.com> --------- Signed-off-by: WenyXu <wenymedia@gmail.com>	2026-06-01 09:45:31 +00:00
shuiyisong	a796dbf1a0	chore: update opendal to 0.57 (#8204 ) * chore: update opendal to 0.57 Signed-off-by: shuiyisong <xixing.sys@gmail.com> * chore: remove duplicate compat to use opendal's Signed-off-by: shuiyisong <xixing.sys@gmail.com> --------- Signed-off-by: shuiyisong <xixing.sys@gmail.com>	2026-06-01 07:45:44 +00:00
LFC	33ffa41ac2	refactor: use last compaction scheduled time instead of finished time (#8190 ) * refactor: use last compaction scheduled time instead of finished time Signed-off-by: luofucong <luofc@foxmail.com> * add some logs for debugging Signed-off-by: luofucong <luofc@foxmail.com> --------- Signed-off-by: luofucong <luofc@foxmail.com>	2026-06-01 07:45:38 +00:00
discord9	28fd796f4e	fix(flow): harden incremental read correctness (#8196 ) * fix(flow): harden incremental read correctness Signed-off-by: discord9 <discord9@163.com> * fix(flow): propagate dirty window options Signed-off-by: discord9 <discord9@163.com> * test: more Signed-off-by: discord9 <discord9@163.com> * chore: test config api Signed-off-by: discord9 <discord9@163.com> * refactor: split gen Signed-off-by: discord9 <discord9@163.com> * chore: per review Signed-off-by: discord9 <discord9@163.com> * fix: allowlist key Signed-off-by: discord9 <discord9@163.com> --------- Signed-off-by: discord9 <discord9@163.com>	2026-06-01 02:48:00 +00:00
dennis zhuang	ed9312f8e3	feat: global switch for creating tables automatically (#8203 ) * feat: global switch for creating table automatically Signed-off-by: Dennis Zhuang <killme2008@gmail.com> * chore: make auto_create_table as comment by default Signed-off-by: Dennis Zhuang <killme2008@gmail.com> * feat: respect gloabl switch for metric engine Signed-off-by: Dennis Zhuang <killme2008@gmail.com> --------- Signed-off-by: Dennis Zhuang <killme2008@gmail.com>	2026-05-31 23:51:14 +00:00
LFC	869a584f8a	ci: add nightly jsonbench test (#7750 ) Signed-off-by: luofucong <luofc@foxmail.com>	2026-05-29 07:07:49 +00:00
discord9	ba15a9c056	feat: support pending flow metadata with defer_on_missing_source (#8124 ) * feat: support defer_on_missing_source for pending flow creation Add `defer_on_missing_source` flow option that allows creating flows even when source tables do not yet exist. The flow enters a pending state and is automatically activated when source tables become available. Key changes: - New `FlowStatus::PendingSources` and fields in `FlowInfoValue` for unresolved source table names and last activation error - `defer_on_missing_source` create-time-only option: stripped from runtime/flownode `CreateRequest` but preserved in metadata for SQL round-trip (`SHOW CREATE FLOW`, `information_schema.flows`) - `CreateFlowProcedure` creates pending metadata when sources are missing and `defer_on_missing_source=true`; falls back to `FlowType::Batching` for missing-source flows - `PendingFlowReconcileManager` in meta-srv periodically checks pending flows and activates them when source tables resolve - `ActivatePendingFlowProcedure` handles activation: allocates peers, creates flows on flownodes, updates metadata, invalidates cache - `OR REPLACE` properly handles pending<->active transitions, including peer allocation and flownode flow teardown - `FlowMetadataAllocator::alloc_peers` for peer allocation at activation time - Validated flow options: only `defer_on_missing_source` allowed; unknown options rejected - Known issue: standalone mode does not support flownodes, so pending flow flush/sink behavior covered only in distributed sqlness; operator and meta unit tests cover activation logic Tests: - operator `determine_flow_type_for_source_state` (3 passed) - common-meta `create_flow` (19 passed) including replacement - common-meta `activate_flow` (4 passed) - meta-srv `flow` (11 passed) - sqlness: `flow_pending` covers create/replace/round-trip Signed-off-by: discord9 <discord9@163.com> * chore: simplify pending flow PR scope Reduce PR #8124 to the metadata-only MVP after complexity review. Changes: - Remove automatic activation procedure and meta-srv reconcile wiring - Remove activation tests and activation-only metadata fields - Reject cross-state pending<->active `OR REPLACE` transitions for now - Keep pending metadata creation and SQL round-trip behavior - Allow `DROP FLOW` for pending flows without routes - Reduce flow_pending sqlness to metadata/round-trip/drop coverage only Deferred follow-ups are documented locally in `.tmp/tasks/pending-defer-semantics/deferred-followups.md` and intentionally not committed. Tests: - `cargo test -p operator determine_flow_type_for_source_state` - `cargo test -p common-meta create_flow` - `cargo test -p common-meta drop_flow` - `cargo sqlness bare --test-filter flow_pending --bins-dir /mnt/nvme_rust/rust-targets/pending_defer/debug` Signed-off-by: discord9 <discord9@163.com> * test: cover pending flow metadata edge cases Signed-off-by: discord9 <discord9@163.com> * test: fix pending flow metadata test lint Signed-off-by: discord9 <discord9@163.com> * docs: document pending flow metadata fields Signed-off-by: discord9 <discord9@163.com> * chore: more sleep when test Signed-off-by: discord9 <discord9@163.com> --------- Signed-off-by: discord9 <discord9@163.com>	2026-05-29 06:59:21 +00:00
Ruihang Xia	b5f56b92b4	fix: classify InsertInto as write request (#8200 ) Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2026-05-29 04:06:00 +00:00

1 2 3 4 5 ...

4598 Commits