greptimedb

mirror of https://github.com/GreptimeTeam/greptimedb.git synced 2026-07-04 21:10:37 +00:00

Author	SHA1	Message	Date
Yingwen	e215851c8a	refactor: unify flush and compaction to always use FlatSource (#7799 ) * feat: support write flat as primary key format Signed-off-by: evenyag <realevenyag@gmail.com> * feat: migrate flush to always use FlatSource Add FormatType propagation in SstWriteRequest and use it to choose Flat vs PrimaryKey write paths (write_all_flat vs write_all_flat_as_primary_key) in AccessLayer and WriteCache. Make compactor and flush derive the sst_write_format from region options or engine config. Simplify flush logic and remove the old memtable_source helper. Update tests to set default sst_write_format. Signed-off-by: evenyag <realevenyag@gmail.com> * refactor: compaction use flat source Signed-off-by: evenyag <realevenyag@gmail.com> * refactor: read parquet sequentially as flat batches Signed-off-by: evenyag <realevenyag@gmail.com> * refactor: remove new_batch_with_binary in favor of new_record_batch_with_binary Replace PrimaryKeyWriteFormat with FlatWriteFormat in test_read_large_binary test and use new_record_batch_with_binary directly, removing the now-unused new_batch_with_binary function and its BinaryArray import. Signed-off-by: evenyag <realevenyag@gmail.com> * test: add tests for PrimaryKeyWriteFormat::convert_flat_batch Signed-off-by: evenyag <realevenyag@gmail.com> * refactor: remove Either from SstWriteRequest Signed-off-by: evenyag <realevenyag@gmail.com> * fix: handle index build mode Signed-off-by: evenyag <realevenyag@gmail.com> * fix: consider sparse encoding and last non null in flush Signed-off-by: evenyag <realevenyag@gmail.com> * test: add unit tests for field_column_start edge cases Signed-off-by: evenyag <realevenyag@gmail.com> --------- Signed-off-by: evenyag <realevenyag@gmail.com>	2026-03-13 09:44:13 +00:00
LFC	74ff5c37ea	refactor: customize standalone instance build (#7807 ) * refactor: customize standalone instance build Signed-off-by: luofucong <luofc@foxmail.com> * resolve PR comments Signed-off-by: luofucong <luofc@foxmail.com> --------- Signed-off-by: luofucong <luofc@foxmail.com>	2026-03-13 09:25:21 +00:00
Weny Xu	20f38d8a6a	test(fuzz): add metric table repartition fuzz target (#7754 ) * test: add fuzz_repartition_metric_table target scaffold Signed-off-by: WenyXu <wenymedia@gmail.com> * test: add metric logical lifecycle in repartition fuzz target Signed-off-by: WenyXu <wenymedia@gmail.com> * test: support partitioned metric tables in repartition fuzz Signed-off-by: WenyXu <wenymedia@gmail.com> * test: add repartition loop and partition assertions for metric target Signed-off-by: WenyXu <wenymedia@gmail.com> * test: use shared timestamp clock in metric repartition writes Signed-off-by: WenyXu <wenymedia@gmail.com> * refactor: unify string value and bound generation for fuzzing Signed-off-by: WenyXu <wenymedia@gmail.com> * test: use fixed physical table name in metric repartition fuzz Signed-off-by: WenyXu <wenymedia@gmail.com> * chore: fmt Signed-off-by: WenyXu <wenymedia@gmail.com> * ci: update ci config Signed-off-by: WenyXu <wenymedia@gmail.com> * refactor: use btreemap Signed-off-by: WenyXu <wenymedia@gmail.com> * print count result Signed-off-by: WenyXu <wenymedia@gmail.com> * test: add csv translator for insert expr Introduce a dedicated top-level csv translator so fuzz insert expressions can be converted into writer-ready records through a structured path instead of ad-hoc formatting in targets. Signed-off-by: WenyXu <wenymedia@gmail.com> * test: add csv dump session utilities Introduce CSV dump env helpers and a session writer that creates run directories, emits seed metadata, and flushes staged CSV records for fuzz workflows. Signed-off-by: WenyXu <wenymedia@gmail.com> * test: bound csv dump buffer with auto flush Parse readable buffer sizes from env and flush staged CSV records automatically when the in-memory threshold is reached to prevent unbounded growth during long fuzz runs. Signed-off-by: WenyXu <wenymedia@gmail.com> * test: flush csv dump before repartition validation Wire csv dump session into the metric repartition fuzz flow so successful inserts are translated from insert expressions into CSV records during write loops and flushed to disk right before row validation. Signed-off-by: WenyXu <wenymedia@gmail.com> * test: keep csv dumps on failure and cleanup on pass Capture run outcomes in metric repartition fuzz, remove dump directories only after successful validation, and retain dump paths on failures so CI and local investigations can use the same artifacts. Signed-off-by: WenyXu <wenymedia@gmail.com> * test: align partial csv records with table headers Keep append payload compact by storing partial insert-expression columns, then expand to full table-context headers at flush time and fill missing values with empty strings. Signed-off-by: WenyXu <wenymedia@gmail.com> * chore: add logs Signed-off-by: WenyXu <wenymedia@gmail.com> * dump csv Signed-off-by: WenyXu <wenymedia@gmail.com> * ci: dump csv Signed-off-by: WenyXu <wenymedia@gmail.com> * refactor Signed-off-by: WenyXu <wenymedia@gmail.com> * test: add table-scoped sql dump writer primitives Signed-off-by: WenyXu <wenymedia@gmail.com> * test: capture table-scoped sql traces after execution Record insert and repartition SQL only after successful execution, include started_at_ms and elapsed_ms in trace comments, and broadcast repartition events into every logical-table trace file for consistent debugging context. Signed-off-by: WenyXu <wenymedia@gmail.com> * test: harden sql trace comments and include create sql Normalize multiline trace comments into valid SQL comment lines and append logical-table CREATE SQL to per-table traces for better timeline reconstruction during repartition debugging. Signed-off-by: WenyXu <wenymedia@gmail.com> * test: dump physical create and repartition SQL traces Signed-off-by: WenyXu <wenymedia@gmail.com> * dump repartition sql Signed-off-by: WenyXu <wenymedia@gmail.com> * test: scaffold writer control channel for barrier flow Add Barrier/Resume/Stop control skeleton and channel wiring in write_loop to prepare per-repartition validation barriers. Also align SQL dump tests with broadcast SQL payload behavior. Signed-off-by: WenyXu <wenymedia@gmail.com> * test: implement writer barrier pause and resume control Make writer control messages effective by pausing writes on barrier, resuming on resume, and stopping via channel signaling so the next commit can enforce deterministic per-repartition validation boundaries. Signed-off-by: WenyXu <wenymedia@gmail.com> * test: validate rows after each repartition barrier Add per-action barrier/ack synchronization with timeout, run immediate logical-table row validation after each repartition, and resume writer only after validation completes to improve minimal failure localization. Signed-off-by: WenyXu <wenymedia@gmail.com> * test: flush dump sessions before per-epoch validation Extract a shared flush-and-snapshot helper and call it before each immediate row validation so CSV/SQL artifacts are persisted at the same epoch boundary being validated. Signed-off-by: WenyXu <wenymedia@gmail.com> * fix: fix unit tests Signed-off-by: WenyXu <wenymedia@gmail.com> * chore: add retry Signed-off-by: WenyXu <wenymedia@gmail.com> * chore: apply suggestions from CR Signed-off-by: WenyXu <wenymedia@gmail.com> --------- Signed-off-by: WenyXu <wenymedia@gmail.com>	2026-03-13 08:00:09 +00:00
dependabot[bot]	37105c8354	chore(deps): bump quinn-proto from 0.11.12 to 0.11.14 (#7805 ) Bumps [quinn-proto](https://github.com/quinn-rs/quinn) from 0.11.12 to 0.11.14. - [Release notes](https://github.com/quinn-rs/quinn/releases) - [Commits](https://github.com/quinn-rs/quinn/compare/quinn-proto-0.11.12...quinn-proto-0.11.14) --- updated-dependencies: - dependency-name: quinn-proto dependency-version: 0.11.14 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-03-13 06:28:58 +00:00
dennis zhuang	0572a680af	fix: allow empty string for env values (#7803 ) * fix: allow empty string for env values Signed-off-by: Dennis Zhuang <killme2008@gmail.com> * chore: strip suffix Signed-off-by: Dennis Zhuang <killme2008@gmail.com> --------- Signed-off-by: Dennis Zhuang <killme2008@gmail.com>	2026-03-13 03:57:08 +00:00
Ning Sun	3cdf03d830	feat: introduce APIs for storing perses dashboard definition (#7791 ) * feat: introduce APIs for storing perses dashboard definition * test: ensure we can update dashboard * refactor: construct dashboard defnition directly * refactor: don't create table on list requests	2026-03-13 03:40:04 +00:00
discord9	3beb538aa8	fix: rm useless analyzer (#7797 ) * fix: rm useless analyzer Signed-off-by: discord9 <discord9@163.com> * test: rm related test Signed-off-by: discord9 <discord9@163.com> * test: flow tql avg Signed-off-by: discord9 <discord9@163.com> --------- Signed-off-by: discord9 <discord9@163.com>	2026-03-12 10:53:47 +00:00
Chengjie Jin	7866132920	feat(procedure): detect potential deadlock when parent/child procedures share lock keys (#7752 ) * feat(procedure): detect potential deadlock when parent/child share lock keys Add a deadlock detection mechanism in submit_subprocedure() to warn when a child procedure's lock_key overlaps with its parent's lock_key. When this happens, the parent holds the lock while waiting for the child to complete (at child_notify.notified().await), but the child blocks forever trying to acquire the same lock. This is a classic Hold-and-Wait deadlock. The detection: - Emits a warn! log in all builds (visible in production) - Triggers debug_assert!(false) in debug/test builds for early CI detection This partially addresses the TODO at line 121-122 and is a follow-up to the discussion in: https://github.com/GreptimeTeam/greptimedb/issues/7692 Signed-off-by: YZL0v3ZZ <2055877225@qq.com> * style: fix trailing whitespace Signed-off-by: YZL0v3ZZ <2055877225@qq.com> * refactor(procedure): extract deadlock detection into a testable pure function Signed-off-by: YZL0v3ZZ <2055877225@qq.com> * fix(procedure): preserve lock mode when detecting parent/child deadlock Signed-off-by: YZL0v3ZZ <2055877225@qq.com> * re-run ci check Signed-off-by: YZL0v3ZZ <2055877225@qq.com> --------- Signed-off-by: YZL0v3ZZ <2055877225@qq.com>	2026-03-12 09:33:55 +00:00
Ning Sun	28f97191a0	fix: make pipeline table ttl forever (#7795 ) * fix: make pipeline table ttl forever * chore: use constants when possible	2026-03-12 02:15:25 +00:00
yihong	aefe758be4	docs: update year to 2026 (#7787 ) Signed-off-by: yihong0618 <zouzou0208@gmail.com>	2026-03-11 07:29:35 +00:00
Weny Xu	f8376fd622	chore: bump version rc.2 (#7788 ) Signed-off-by: WenyXu <wenymedia@gmail.com> v1.0.0-rc.2	2026-03-11 06:57:07 +00:00
discord9	922f9cb3d6	feat: use dyn filter (#7545 ) * parent `b2074e3863` author discord9 <discord9@163.com> 1767869295 +0800 committer discord9 <discord9@163.com> 1772529023 +0800 feat: use dyn filter Signed-off-by: discord9 <discord9@163.com> not supported Signed-off-by: discord9 <discord9@163.com> refactor: use make_mut instead Signed-off-by: discord9 <discord9@163.com> refactor: rm need to clone stream ctx Signed-off-by: discord9 <discord9@163.com> r Signed-off-by: discord9 <discord9@163.com> pcr Signed-off-by: discord9 <discord9@163.com> test: wait for datafusion update Signed-off-by: discord9 <discord9@163.com> refactor: use arc swap for dyn filters Signed-off-by: discord9 <discord9@163.com> * test: update sqlness Signed-off-by: discord9 <discord9@163.com> * chore: comment out sqlness Signed-off-by: discord9 <discord9@163.com> * test: update sqlness Signed-off-by: discord9 <discord9@163.com> * test: sqlness fix Signed-off-by: discord9 <discord9@163.com> * refactor: predicate without option Signed-off-by: discord9 <discord9@163.com> * feat: print dyn filters& more tests Signed-off-by: discord9 <discord9@163.com> * test: sqlness vector result update Signed-off-by: discord9 <discord9@163.com> * chore: log Signed-off-by: discord9 <discord9@163.com> * test: properly redact Signed-off-by: discord9 <discord9@163.com> * test: better data dist for non empty dyn filter Signed-off-by: discord9 <discord9@163.com> * test: properly redacted Signed-off-by: discord9 <discord9@163.com> * chore: per review Signed-off-by: discord9 <discord9@163.com> * properly redact Signed-off-by: discord9 <discord9@163.com> * docs: explain why not do it Signed-off-by: discord9 <discord9@163.com> * chore: rename update to add as its more proper Signed-off-by: discord9 <discord9@163.com> * chore: rm no need clone Signed-off-by: discord9 <discord9@163.com> * docs: per review Signed-off-by: discord9 <discord9@163.com> --------- Signed-off-by: discord9 <discord9@163.com>	2026-03-11 03:06:24 +00:00
Ruihang Xia	9e95214fc8	feat: replace shadow-rs with self-maintained version info (#7782 ) * reimplement shadow-rs Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix: remove timestamp build metadata * fix: refresh version build metadata * use git2 Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * warn about git failure Signed-off-by: Ruihang Xia <waynestxia@gmail.com> --------- Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2026-03-10 21:08:26 +00:00
Weny Xu	b1b91e88f4	fix(mito2): ensure enter staging waits for compaction (#7776 ) * fix: do not schedule compaction Signed-off-by: WenyXu <wenymedia@gmail.com> * mito2: add pending ddl queue primitives to compaction scheduler Signed-off-by: WenyXu <wenymedia@gmail.com> * mito2: hand off pending ddls on compaction finish Signed-off-by: WenyXu <wenymedia@gmail.com> * mito2: defer enter staging via compaction pending ddl queue Signed-off-by: WenyXu <wenymedia@gmail.com> * mito2: cover pending-ddl failure paths on region lifecycle events Signed-off-by: WenyXu <wenymedia@gmail.com> * mito2: replay pending ddls directly in compaction handler Signed-off-by: WenyXu <wenymedia@gmail.com> * chore: styling Signed-off-by: WenyXu <wenymedia@gmail.com> * chore: styling Signed-off-by: WenyXu <wenymedia@gmail.com> * chore: apply suggestions from CR Signed-off-by: WenyXu <wenymedia@gmail.com> * chore: apply suggestions Signed-off-by: WenyXu <wenymedia@gmail.com> * test: add unit test Signed-off-by: WenyXu <wenymedia@gmail.com> * chore: add unit tests Signed-off-by: WenyXu <wenymedia@gmail.com> * chore: apply suggestions from CR Signed-off-by: WenyXu <wenymedia@gmail.com> --------- Signed-off-by: WenyXu <wenymedia@gmail.com>	2026-03-10 13:49:40 +00:00
Yingwen	04cd2c8a05	feat: flat read path support primary_key format memtables (#7759 ) * feat: add adapter for batch to flat recordbatch Signed-off-by: evenyag <realevenyag@gmail.com> * feat: support batch to flat record batch in MemtableRange Signed-off-by: evenyag <realevenyag@gmail.com> * refactor: address review issues for BatchToRecordBatchAdapter - Extract duplicated read_column_ids computation into a shared `read_column_ids_from_projection` helper function - Cache `FormatProjection` in `BatchToRecordBatchContext::new()` instead of recomputing it on every `adapt_iter()` call - Remove unnecessary `Arc` wrapping of `read_column_ids` in `SimpleBulkMemtable::ranges()` - Fix clippy `filter_map_bool_then` warning in `batch_adapter.rs` Signed-off-by: evenyag <realevenyag@gmail.com> * chore: simplify comments Signed-off-by: evenyag <realevenyag@gmail.com> * refactor(mito2): use read column ids in batch adapter Signed-off-by: evenyag <realevenyag@gmail.com> * test: test build_record_batch_iter Signed-off-by: evenyag <realevenyag@gmail.com> * chore: fmt code Signed-off-by: evenyag <realevenyag@gmail.com> * test: test build_record_batch_iter for all old memtables Signed-off-by: evenyag <realevenyag@gmail.com> * chore: address comment Signed-off-by: evenyag <realevenyag@gmail.com> * fix: prune time range before adapter Signed-off-by: evenyag <realevenyag@gmail.com> * chore: share BatchToRecordBatchContext in simple_bulk_memtable.rs Signed-off-by: evenyag <realevenyag@gmail.com> * chore: use ScalarValue::to_array_of_size to build repeated value array Signed-off-by: evenyag <realevenyag@gmail.com> --------- Signed-off-by: evenyag <realevenyag@gmail.com>	2026-03-10 12:46:39 +00:00
Ruihang Xia	58528d1334	feat: fast path for empty selection files (#7780 ) * perf(mito2): skip reader context for empty selections * refactor(mito2): make parquet reader input optional	2026-03-10 12:22:03 +00:00
Lei, HUANG	9c98a3d5f6	refactor: remote write related code (#7775 ) * refactor/prom-related-code: ### Commit Message Refactor Byte Handling and Improve Decoding Logic - `prom_decode.rs`: Removed `Bytes` usage in favor of `Vec<u8>` for handling raw data, improving memory management and simplifying the decoding process. - `prom_store.rs`: Updated `try_decompress` function to return `Vec<u8>` instead of `Bytes`, aligning with the new data handling approach. - `prom_row_builder.rs`: Modified `TablesBuilder` to use `Vec<u8>` for `raw_data`, enhancing data manipulation capabilities. - `proto.rs`: Refactored `PromWriteRequest` decoding logic to use `Vec<u8>`, optimizing the buffer management and decoding flow. Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * refactor: mod structure Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * refactor/prom-related-code: - Refactor `prom_store.rs` and `prom_remote_write/mod.rs`: Moved `decode_remote_write_request` and `try_decompress` functions from `prom_store.rs` to `prom_remote_write/mod.rs`. This change centralizes the logic related to remote write request decoding and decompression. - Update `PromValidationMode` in `validation.rs`: Implemented `Default` trait using the `#[derive(Default)]` attribute for `PromValidationMode` and updated related methods to use `Result` instead of `std::result::Result`. Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * refactor/prom-related-code: ### Remove `proto.rs` and Update References - Removed: Deleted the `proto.rs` file, which contained re-exports for Prometheus remote write decode types. - Updated References: Adjusted references to `PromSeriesProcessor` and `PromWriteRequest` in `prom_decode.rs` and `prom_store.rs` to import directly from `prom_remote_write`. - Modified Modules: Removed the `proto` module from `lib.rs`. Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * fix: lint Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * fix: remove assert_eq Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * refactor/prom-related-code: ### Refactor Prometheus Remote Write Module - Modularization of `prom_remote_write`: - Split `PromValidationMode` and `validate_label_name` into a new `validation` module. - Moved `PromSeriesProcessor` and `PromWriteRequest` to a `decode` module. - Separated `PromLabel` into a `types` module and adjusted visibility. - Visibility Adjustments: - Changed `PromTimeSeries` and `PromLabel` structs to `pub(crate)` for internal use. - File Updates: - Updated references in `prom_decode.rs`, `http.rs`, `prom_store.rs`, `decode.rs`, `mod.rs`, `row_builder.rs`, `types.rs`, `prom_store_test.rs`, and `test_util.rs` to reflect module changes. Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> --------- Signed-off-by: Lei, HUANG <mrsatangel@gmail.com>	2026-03-10 12:19:08 +00:00
Ning Sun	2ff70fb3b6	chore: mark datafusion minor version as fixed (#7769 )	2026-03-10 00:02:36 +00:00
Ning Sun	f05c48d7de	chore: update oneshot (#7770 )	2026-03-09 18:44:44 +00:00
Weny Xu	4c2f056f93	fix(meta): remap route addresses when reading full table info (#7778 ) * refactor(meta): expose crate-level table route address remap helper * fix(meta): remap table route address in get_full_table_info * test(meta): cover get_full_table_info route address remap	2026-03-09 18:24:59 +00:00
Weny Xu	3e81345d7f	fix(mito2): avoid parquet redownload when write cache already contains file (#7777 ) * feat: add idempotent write cache download API * fix: skip parquet redownload in manifest edit path * test: cover download_if_absent cache hit and miss * chore: add comments Signed-off-by: WenyXu <wenymedia@gmail.com> --------- Signed-off-by: WenyXu <wenymedia@gmail.com>	2026-03-09 12:16:53 +00:00
Yingwen	4232ce8eaa	test: fix unstable index meta list test (#7774 ) * test: fix unstable index meta list test Signed-off-by: evenyag <realevenyag@gmail.com> * fix: raise bucket_size threshold to avoid bucketing sizes in [512, 999] to 0 Signed-off-by: evenyag <realevenyag@gmail.com> --------- Signed-off-by: evenyag <realevenyag@gmail.com>	2026-03-09 07:57:43 +00:00
dependabot[bot]	4dfad21883	chore(deps): bump time from 0.3.41 to 0.3.47 (#7772 ) Bumps [time](https://github.com/time-rs/time) from 0.3.41 to 0.3.47. - [Release notes](https://github.com/time-rs/time/releases) - [Changelog](https://github.com/time-rs/time/blob/main/CHANGELOG.md) - [Commits](https://github.com/time-rs/time/compare/v0.3.41...v0.3.47) --- updated-dependencies: - dependency-name: time dependency-version: 0.3.47 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-03-09 06:54:58 +00:00
dependabot[bot]	792b7509c3	chore(deps): bump rsa from 0.9.8 to 0.9.10 (#7771 ) Bumps [rsa](https://github.com/RustCrypto/RSA) from 0.9.8 to 0.9.10. - [Changelog](https://github.com/RustCrypto/RSA/blob/v0.9.10/CHANGELOG.md) - [Commits](https://github.com/RustCrypto/RSA/compare/v0.9.8...v0.9.10) --- updated-dependencies: - dependency-name: rsa dependency-version: 0.9.10 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-03-09 06:09:48 +00:00
LFC	167e60601a	chore: update dependency "tracing-subscriber" (#7766 ) Signed-off-by: luofucong <luofc@foxmail.com>	2026-03-09 02:57:38 +00:00
Ning Sun	9b1288f66a	feat: add function rewrite rule for json_get with cast (#7631 ) * feat: initial function rewriter for json_get * feat: make sure rewrite rule is applied * feat: keep analyzer's default rules * feat: implement rewriter for arrow_cast * test: add unit test for tht rewriter * chore: format * refactor: extract some more functions * Apply suggestion from @waynexia Co-authored-by: Ruihang Xia <waynestxia@gmail.com> --------- Co-authored-by: Ruihang Xia <waynestxia@gmail.com>	2026-03-09 02:39:48 +00:00
ZonaHe	44ab4c6857	feat: update dashboard to v0.11.13 (#7763 ) Co-authored-by: sunchanglong <sunchanglong@users.noreply.github.com>	2026-03-09 02:33:02 +00:00
Ruihang Xia	a71df9477a	perf(mito2): speed up parquet scan via minmax caches (#7708 ) * perf(mito2): speed up parquet scan via meta caches * fix(mito2): fix parquet pruning and metadata cache * revert config changes Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * enhance cache file enter logic Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * resolve tiny cr comments Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * only preload from fs or cache Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix vector test Signed-off-by: Ruihang Xia <waynestxia@gmail.com> --------- Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2026-03-07 06:32:47 +00:00
Yingwen	93c48a078c	feat: implement last row cache reader for flat format (#7757 ) * feat: initial implementation Signed-off-by: evenyag <realevenyag@gmail.com> * fix: handle multiple series Signed-off-by: evenyag <realevenyag@gmail.com> * fix: reset state in finish() Signed-off-by: evenyag <realevenyag@gmail.com> * fix: handle duplicated last timestamps across batches Signed-off-by: evenyag <realevenyag@gmail.com> * perf: compact primary key array Signed-off-by: evenyag <realevenyag@gmail.com> * fix(mito2): simplify flat last timestamp selector state Signed-off-by: evenyag <realevenyag@gmail.com> * refactor(mito2): rebuild flat pk dictionary from selector state Signed-off-by: evenyag <realevenyag@gmail.com> * test: reduce tests Signed-off-by: evenyag <realevenyag@gmail.com> * chore: update comment Signed-off-by: evenyag <realevenyag@gmail.com> * chore: more logs to debug Signed-off-by: evenyag <realevenyag@gmail.com> * feat: concat batches in last row reader Signed-off-by: evenyag <realevenyag@gmail.com> * refactor(mito2): simplify flat last row selector output buffer - Replace VecDeque with BatchBuffer struct for output buffering - Remove rebuild_pk_dictionary_for_key as batches go directly into buffer - Remove unused push method and make BatchBuffer pub(crate) - Remove debug logging in maybe_update_cache Signed-off-by: evenyag <realevenyag@gmail.com> * chore: address comments Signed-off-by: evenyag <realevenyag@gmail.com> --------- Signed-off-by: evenyag <realevenyag@gmail.com>	2026-03-06 12:08:58 +00:00
Lei, HUANG	2000611ea1	perf(remote-write): optimize decode prom (#7761 ) * perf: optimize Prometheus label name decoding with byte-level validation Add `decode_label_name` and `validate_label_name` to skip redundant UTF-8 validation for Prometheus label names, which are guaranteed ASCII (`[a-zA-Z_][a-zA-Z0-9_]`). Rename `validate_bytes` to `validate_utf8` for clarity and add benchmarks for label name validation and UTF-8 validation (std vs simdutf8). Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> perf(servers): optimize validate_label_name with lookup table and loop unrolling Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * perf/decode-prom-2: - Refactor UTF-8 Validation and Label Decoding: - Removed `validate_utf8` method and integrated label name validation directly in `decode_label_name` in `http.rs`. - Updated `decode_label_name` to always enforce Prometheus label name validation across all modes. - Adjusted test cases in `http.rs` to reflect the new validation logic. - Enhance Label Validation in `prom_row_builder.rs`: - Replaced UTF-8 validation with direct label name validation using `validate_label_name`. - Updated `decode_label_name` usage to return `&str` and adjusted related logic. Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * perf/decode-prom-2: Refactor `TableBuilder` to Use `RawBytes` for Column Indexes - Updated `TableBuilder` in `prom_row_builder.rs` to use `RawBytes` instead of `Vec<u8>` for `col_indexes`. - Modified `with_capacity` method to directly insert `RawBytes` for timestamp and value columns. - Adjusted schema handling to use `to_owned` for `tag_name` and directly insert `raw_tag_name` into `col_indexes`. Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * perf/decode-prom-2: ### Commit Message Refactor `PromWriteRequest` Method and Enhance Data Handling - Refactor Method: Renamed the `merge` method to `decode` in `PromWriteRequest` to better reflect its functionality. Updated references in `prom_decode.rs`, `prom_store.rs`, and `prom_row_builder.rs`. - Enhance Data Handling: Introduced `raw_data` field in `PromWriteRequest` to store a clone of the buffer for potential future use. Updated the `clear` method to reset `raw_data`. Files affected: `prom_decode.rs`, `prom_store.rs`, `prom_row_builder.rs`, `proto.rs`. Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * perf/decode-prom-2: Commit Summary: - Enhancement in `prom_row_builder.rs`: - Added a new field `raw_data` of type `Bytes` to `TablesBuilder`. - Implemented `set_raw_data` method to update `raw_data`. - Modified `clear` method to reset `raw_data`. - Refactor in `proto.rs`: - Removed `raw_data` field from `PromWriteRequest`. - Updated `decode_and_process` method to use `set_raw_data` from `TablesBuilder` for handling raw data. Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * chore: remove duplicated validation Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * perf/decode-prom-2: ### Commit Message Refactor `TablesBuilder` and `TableBuilder` to Use Lifetime Annotations - Updated `prom_store.rs`: - Modified `PROM_WRITE_REQUEST_POOL` and `decode_remote_write_request` to use lifetime annotations for `PromWriteRequest` and `TablesBuilder`. - Updated `prom_row_builder.rs`: - Refactored `TablesBuilder` and `TableBuilder` structs to include lifetime annotations. - Adjusted methods in `TablesBuilder` and `TableBuilder` to accommodate lifetime changes. - Updated `proto.rs`: - Added lifetime annotations to `PromWriteRequest` and its methods. - Modified `add_to_table_data` to use lifetime annotations for `TablesBuilder`. Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * chore: fmt Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> --------- Signed-off-by: Lei, HUANG <mrsatangel@gmail.com>	2026-03-06 10:23:11 +00:00
Yingwen	33acbf985d	fix: Allow overriding sequence when writing flat SSTs (#7764 ) Signed-off-by: evenyag <realevenyag@gmail.com>	2026-03-06 09:43:58 +00:00
dennis zhuang	0ca3496fd3	docs: add o11y 2.0 link and .net ingester sdk (#7765 ) Signed-off-by: Dennis Zhuang <killme2008@gmail.com>	2026-03-06 09:41:59 +00:00
discord9	56ee8baa3f	feat: admin gc table/regions (#7619 ) * feat: gc table Signed-off-by: discord9 <discord9@163.com> * test: admin gc Signed-off-by: discord9 <discord9@163.com> * chore: after rebase fix Signed-off-by: discord9 <discord9@163.com> * refactor: GcStats Signed-off-by: discord9 <discord9@163.com> * refactor: use gc ticker for admin gc Signed-off-by: discord9 <discord9@163.com> * fix: region routes override Signed-off-by: discord9 <discord9@163.com> * test: non happy path Signed-off-by: discord9 <discord9@163.com> * refactor: gc job report enum Signed-off-by: discord9 <discord9@163.com> * test: process 0 regions Signed-off-by: discord9 <discord9@163.com> * after rebase Signed-off-by: discord9 <discord9@163.com> * feat: allow manual gc to return error Signed-off-by: discord9 <discord9@163.com> * chore: update proto Signed-off-by: discord9 <discord9@163.com> * per review Signed-off-by: discord9 <discord9@163.com> * chore: timeout and update proto Signed-off-by: discord9 <discord9@163.com> * chore: udpate proto Signed-off-by: discord9 <discord9@163.com> --------- Signed-off-by: discord9 <discord9@163.com>	2026-03-06 08:25:44 +00:00
discord9	5e6d2b221e	feat: gc batch delete files (#7733 ) * feat: batch delete Signed-off-by: discord9 <discord9@163.com> * chore: explict error msg Signed-off-by: discord9 <discord9@163.com> * chore: per review Signed-off-by: discord9 <discord9@163.com> * refactor: not fallback when batch failure Signed-off-by: discord9 <discord9@163.com> * chore: per review Signed-off-by: discord9 <discord9@163.com> * pcr Signed-off-by: discord9 <discord9@163.com> * chore: per review Signed-off-by: discord9 <discord9@163.com> * refactor: batch delete in access layer Signed-off-by: discord9 <discord9@163.com> --------- Signed-off-by: discord9 <discord9@163.com>	2026-03-06 07:17:20 +00:00
discord9	d1151b665b	feat: flow tql cte (#7702 ) * feat: flow tql cte Signed-off-by: discord9 <discord9@163.com> * fix: creating flow TQL CTE source tables lose cte part in query Signed-off-by: discord9 <discord9@163.com> * test: update sqlness result Signed-off-by: discord9 <discord9@163.com> * chore Signed-off-by: discord9 <discord9@163.com> * fix: properly canonicalize ident Signed-off-by: discord9 <discord9@163.com> * feat: even stricter check Signed-off-by: discord9 <discord9@163.com> * chore: sqlness update Signed-off-by: discord9 <discord9@163.com> * chore: after rebase fix Signed-off-by: discord9 <discord9@163.com> --------- Signed-off-by: discord9 <discord9@163.com>	2026-03-06 03:36:42 +00:00
discord9	4c30b9efaf	fix: null first for part expr as logical expr (#7747 ) * fix: null first for part expr as logical expr Signed-off-by: discord9 <discord9@163.com> * test: update tests Signed-off-by: discord9 <discord9@163.com> * chore: per review Signed-off-by: discord9 <discord9@163.com> * fix: nulll handle&non-null filter Signed-off-by: discord9 <discord9@163.com> * chore: doc test Signed-off-by: discord9 <discord9@163.com> --------- Signed-off-by: discord9 <discord9@163.com>	2026-03-06 02:53:05 +00:00
LFC	f544b02408	chore: update dependency "keccak" (#7760 ) Signed-off-by: luofucong <luofc@foxmail.com>	2026-03-05 13:42:47 +00:00
Ruihang Xia	b4f0886f80	chore: grouping batch open region logs (#7758 ) Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2026-03-05 13:39:29 +00:00
Ruihang Xia	39140058d0	feat: adapt new name of holt winters fn (#7700 ) * feat: adapt new name of holt winters fn Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * update parser Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * alias old fn name Signed-off-by: Ruihang Xia <waynestxia@gmail.com> --------- Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2026-03-04 23:32:37 +00:00
LFC	284d7a63d6	chore: remove dependency on "fast-float" (#7729 ) Signed-off-by: luofucong <luofc@foxmail.com>	2026-03-04 11:37:01 +00:00
Weny Xu	b17947b5d8	feat: add COPY support for GCS/Azblob with OSS/GCS/Azblob integration coverage (#7743 ) * feat: add gcs and azblob backends to copy datasource Signed-off-by: WenyXu <wenymedia@gmail.com> * test: add copy to/from oss integration coverage Signed-off-by: WenyXu <wenymedia@gmail.com> * test: add copy to/from gcs/azure integration coverage Signed-off-by: WenyXu <wenymedia@gmail.com> * refactor: use existing layers Signed-off-by: WenyXu <wenymedia@gmail.com> * feat: hide `credential_path` in cli Signed-off-by: WenyXu <wenymedia@gmail.com> * chore: remove logs Signed-off-by: WenyXu <wenymedia@gmail.com> * chore: apply suggestions Signed-off-by: WenyXu <wenymedia@gmail.com> --------- Signed-off-by: WenyXu <wenymedia@gmail.com>	2026-03-04 04:21:21 +00:00
Weny Xu	b1b76fde0b	test(fuzz): repartition validation and add dedicated CI GC profile (#7703 ) * test(fuzz): add concurrent write loop and partition-bound value generation for repartition validation Signed-off-by: WenyXu <wenymedia@gmail.com> * ci: run repartition fuzz target with dedicated local-wal GC config Signed-off-by: WenyXu <wenymedia@gmail.com> * chore: fix typos Signed-off-by: WenyXu <wenymedia@gmail.com> * chore: apply suggestions Signed-off-by: WenyXu <wenymedia@gmail.com> * chore: fix typo Signed-off-by: WenyXu <wenymedia@gmail.com> * count distinct timestamp value Signed-off-by: WenyXu <wenymedia@gmail.com> * chore: apply suggestions Signed-off-by: WenyXu <wenymedia@gmail.com> --------- Signed-off-by: WenyXu <wenymedia@gmail.com>	2026-03-03 13:30:34 +00:00
fys	f17a334058	chore: upgrade arrow 57.2 to 57.3 (#7744 ) * chore: upgrade arrow 57.2 to 57.3 * upgrade Cargo.lock	2026-03-03 09:04:47 +00:00
Yingwen	5c8ece27e0	feat: improve filter support for scanbench (#7736 ) * feat: cast filters type for scanbench Signed-off-by: evenyag <realevenyag@gmail.com> * chore: pub file_range mod So we can use the pub struct FileRange in other places Signed-off-by: evenyag <realevenyag@gmail.com> * fix: add api as dev-dependency to cmd for clippy Signed-off-by: evenyag <realevenyag@gmail.com> * feat: support profiling after warmup Signed-off-by: evenyag <realevenyag@gmail.com> --------- Signed-off-by: evenyag <realevenyag@gmail.com>	2026-03-03 09:00:41 +00:00
LFC	b2074e3863	chore: upgrade DataFusion family, again (#7578 ) * chore: upgrade DataFusion family Signed-off-by: luofucong <luofc@foxmail.com> * chore: switch to released version of datafusion-pg-catalog --------- Signed-off-by: luofucong <luofc@foxmail.com> Co-authored-by: Ning Sun <sunning@greptime.com> Co-authored-by: Ning Sun <sunng@protonmail.com>	2026-03-03 07:36:39 +00:00
Weny Xu	aab839b6e4	ci: refactor monitor artifact export and include otel trace derived tables (#7742 ) Signed-off-by: WenyXu <wenymedia@gmail.com>	2026-03-03 06:37:22 +00:00
Lei, HUANG	7ba23a999b	perf: prom decode (#7737 ) * perf/prom-decode: Refactor `PromLabel` to Use Raw Byte Slices - Updated `PromLabel` struct in `proto.rs` to use `RawBytes` for `name` and `value` fields, replacing `Bytes` with static byte slices. - Modified test cases in `prom_row_builder.rs` to accommodate changes in `PromLabel` by using byte literals. - Simplified `merge_bytes` function in `proto.rs` to directly assign byte slices, removing unnecessary memory operations. Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * perf/prom-decode: - Add UTF-8 Validation: Introduced `validate_bytes` method in `http.rs` to validate UTF-8 encoding using `simdutf8` for `PromValidationMode::Strict`. - Update Column Indexing: Modified `prom_row_builder.rs` to use `Vec<u8>` for `col_indexes` keys, ensuring UTF-8 validation for label names. - Dependency Update: Added `simdutf8` version `0.1.5` to `Cargo.toml` and updated `Cargo.lock` to include this new dependency. Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * fix: style issues Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> --------- Signed-off-by: Lei, HUANG <mrsatangel@gmail.com>	2026-03-03 06:29:32 +00:00
Lei, HUANG	be78137165	chore(version): refresh build info on demand (#7738 ) * chore(version): refresh build info on demand Introduce a `refresh-build-info` feature to `common-version` to control whether build timestamps are updated. By default, timestamps are no longer refreshed, and `shadow.rs` regeneration is skipped if it already exists. This prevents the build script from invalidating incremental compilation results when nothing else has changed. CI and release builds are updated to explicitly enable this feature. Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> * chore/refresh-build-info-on-demand: ### Update Build Configuration - Remove `refresh-build-info` Feature: - Removed the `refresh-build-info` feature from `action.yml`, `release.yml`, and `Cargo.toml`. - Updated `build.rs` to refresh timestamps by default in release builds, with an option to disable via `DISABLE_BUILD_INFO`. - Modify GitHub Actions: - Updated `.github/actions/build-linux-artifacts/action.yml` and `.github/workflows/release.yml` to exclude `refresh-build-info` from the `features` list. - Enhance Build Script Logic: - Adjusted logic in `build.rs` to handle timestamp refreshing based on build profile and environment variables. Signed-off-by: Lei, HUANG <mrsatangel@gmail.com> --------- Signed-off-by: Lei, HUANG <mrsatangel@gmail.com>	2026-03-03 02:53:51 +00:00
Weny Xu	0402f24ed8	fix: preserve the order of primary key indices as defined in the original table info (#7740 ) Signed-off-by: WenyXu <wenymedia@gmail.com>	2026-03-02 12:34:55 +00:00
Lei, HUANG	65dd166c44	fix: Install uv to avoid failure while creating databases (#7741 ) Signed-off-by: Lei, HUANG <mrsatangel@gmail.com>	2026-03-02 10:29:55 +00:00

1 2 3 4 5 ...

5286 Commits