greptimedb

mirror of https://github.com/GreptimeTeam/greptimedb.git synced 2026-05-24 17:00:37 +00:00

Author	SHA1	Message	Date
Zhenchi	01e907be40	feat(bloom-filter): integrate indexer with mito2 (#5236 ) * feat(bloom-filter): integrate indexer with mito2 Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> * rename skippingindextype Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> * address comments Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> --------- Signed-off-by: Zhenchi <zhongzc_arch@outlook.com>	2025-01-04 02:12:27 +08:00
Lin Yihai	e4dc5ea243	feat: Add `vec_mul` function. (#5205 )	2025-01-04 02:12:27 +08:00
discord9	3ff5754b5a	feat(flow): check sink table mismatch on flow creation (#5112 ) * tests: more mismatch errors * feat: check sink table schema if exists&prompt nice err msg * chore: rm unused variant * chore: fmt * chore: cargo clippy * feat: check schema on create * feat: better err msg when mismatch * tests: fix a schema mismatch * todo: create sink table * feat: create sink table * fix: find time index * tests: auto created sink table * fix: remove empty keys * refactor: per review * chore: fmt * test: sqlness * chore: after rebase	2025-01-04 02:12:27 +08:00
Ruihang Xia	c22ca3ebd5	feat: add some critical metrics to flownode (#5235 ) * feat: add some critical metrics to flownode Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix clippy Signed-off-by: Ruihang Xia <waynestxia@gmail.com> --------- Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2025-01-04 02:12:27 +08:00
zyy17	327d165ad9	feat: introduce the Limiter in frontend to limit the requests by in-flight write bytes size. (#5231 ) feat: introduct Limiter to limit in-flight write bytes size in frontend	2025-01-04 02:12:27 +08:00
Zhenchi	be81f0db5a	feat(bloom-filter): impl batch push to creator (#5225 ) Signed-off-by: Zhenchi <zhongzc_arch@outlook.com>	2025-01-04 02:12:27 +08:00
Ruihang Xia	6ca7a305ae	fix: correct write cache's metric labels (#5227 ) * refactor: remove unused field in WriteCache Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * refactor: unify read and write cache path Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * update config and fix clippy Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * remove unnecessary methods and adapt test Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * change the default path Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * remove remote-home Signed-off-by: Ruihang Xia <waynestxia@gmail.com> --------- Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2025-01-04 02:12:27 +08:00
Weny Xu	1111a8bd57	chore: add log for converting region to follower (#5222 ) * chore: add log for converting region to follower * chore: apply suggestions from CR	2025-01-04 02:12:27 +08:00
Lei, HUANG	31cfab81ad	feat(mito): parquet memtable reader (#4967 ) * wip: row group reader base * wip: memtable row group reader * Refactor MemtableRowGroupReader to streamline data fetching - Added early return when fetch_ranges is empty to optimize performance. - Replaced inline chunk data assignment with a call to `assign_dense_chunk` for cleaner code. * wip: row group reader * wip: reuse RowGroupReader * wip: bulk part reader * Enhance BulkPart Iteration with Filtering - Introduced `RangeBase` to `BulkIterContext` for improved filter handling. - Implemented filter application in `BulkPartIter` to prune batches based on predicates. - Updated `SimpleFilterContext::new_opt` to be public for broader access. * chore: add prune test * fix: clippy * fix: introduce prune reader for memtable and add more prune test * Enhance BulkPart read method to return Option<BoxedBatchIterator> - Modified `BulkPart::read` to return `Option<BoxedBatchIterator>` to handle cases where no row groups are selected. - Added logic to return `None` when all row groups are filtered out. - Updated tests to handle the new return type and added a test case to verify behavior when no row groups match the pr * refactor/separate-paraquet-reader: Add helper function to parse parquet metadata and integrate it into BulkPartEncoder * refactor/separate-paraquet-reader: Change BulkPartEncoder row_group_size from Option to usize and update tests * refactor/separate-paraquet-reader: Add context module for bulk memtable iteration and refactor part reading • Introduce context module to encapsulate context for bulk memtable iteration. • Refactor BulkPart to use BulkIterContextRef for reading operations. • Remove redundant code in BulkPart by centralizing context creation and row group pruning logic in the new context module. • Create new file context.rs with structures and logic for handling iteration context. • Adjust part_reader.rs and row_group_reader.rs to reference the new BulkIterContextRef. * refactor/separate-paraquet-reader: Refactor RowGroupReader traits and implementations in memtable and parquet reader modules • Rename RowGroupReaderVirtual to RowGroupReaderContext for clarity. • Replace BulkPartVirt with direct usage of BulkIterContextRef in MemtableRowGroupReader. • Simplify MemtableRowGroupReaderBuilder by directly passing context instead of creating a BulkPartVirt instance. • Update RowGroupReaderBase to use context field instead of virt, reflecting the trait renaming and usage. • Modify FileRangeVirt to FileRangeContextRef and adjust implementations accordingly. * refactor/separate-paraquet-reader: Refactor column page reader creation and remove unused code • Centralize creation of SerializedPageReader in RowGroupBase::column_reader method. • Remove unused RowGroupCachedReader and related code from MemtableRowGroupPageFetcher. • Eliminate redundant error handling for invalid column index in multiple places. * chore: rebase main and resolve conflicts * fix: some comments * chore: resolve conflicts * chore: resolve conflicts	2025-01-04 02:12:27 +08:00
Ruihang Xia	dd3a509607	chore: bump opendal to fork version to fix prometheus layer (#5223 ) Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2025-01-04 02:12:27 +08:00
Weny Xu	d4cae6af1e	refactor: remove unnecessary wrap (#5221 ) * chore: remove unnecessary arc * chore: remove unnecessary box	2025-01-04 02:12:27 +08:00
Ruihang Xia	3fec71b5c0	feat: logs query endpoint (#5202 ) * define endpoint Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * planner Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * update lock file Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * add unit test Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix toml format Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * revert metric change Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * Update src/query/src/log_query/planner.rs Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * fix compile Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * refactor and tests Signed-off-by: Ruihang Xia <waynestxia@gmail.com> --------- Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-01-04 02:12:27 +08:00
Zhenchi	9e31a6478b	feat(index-cache): abstract `IndexCache` to be shared by multi types of indexes (#5219 ) * feat(index-cache): abstract `IndexCache` to be shared by multi types of indexes Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> * fix typo Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> * fix: remove added label Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> * refactor: simplify cached reader impl Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> * rename func Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> --------- Signed-off-by: Zhenchi <zhongzc_arch@outlook.com>	2025-01-04 02:12:27 +08:00
Zhenchi	68a05b38bd	feat(bloom-filter): add bloom filter reader (#5204 ) * feat(bloom-filter): add bloom filter reader Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> * chore: remove unused dep Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> * fix conflict Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> * address comments Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> --------- Signed-off-by: Zhenchi <zhongzc_arch@outlook.com>	2025-01-04 02:12:27 +08:00
Zhenchi	ee72ae8bd0	feat(bloom-filter): add memory control for creator (#5185 ) * feat(bloom-filter): add memory control for creator Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> * refactor: remove meaningless buf Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> * feat: add codec for intermediate Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> --------- Signed-off-by: Zhenchi <zhongzc_arch@outlook.com>	2025-01-04 02:12:27 +08:00
Ruihang Xia	1327e8809f	feat: bump opendal and switch prometheus layer to the upstream impl (#5179 ) * feat: bump opendal and switch prometheus layer to the upstream impl Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * remove unused files Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix tests Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * remove unused things Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * remove root dir on recovering cache Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * filter out non-files entry in test Signed-off-by: Ruihang Xia <waynestxia@gmail.com> --------- Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2025-01-04 02:12:27 +08:00
evenyag	1f7d9666b7	chore: Downgrade opendal for releasing 0.11.1 Revert "feat: bump opendal and switch prometheus layer to the upstream impl (#5179)" This reverts commit `422d18da8b`.	2024-12-20 14:12:19 +08:00
discord9	ed8e418716	fix: auto created table ttl check (#5203 ) * fix: auto created table ttl check * tests: with hint	2024-12-20 14:12:19 +08:00
discord9	9e7121c1bb	fix(flow): batch builder with type (#5195 ) * fix: typed builder * chore: clippy * chore: rename * fix: unit tests * refactor: per review	2024-12-20 14:12:19 +08:00
discord9	f5e743379f	feat: show flow's mem usage in INFORMATION_SCHEMA.FLOWS (#4890 ) * feat: add flow mem size to sys table * chore: rm dup def * chore: remove unused variant * chore: minor refactor * refactor: per review	2024-12-20 14:12:19 +08:00
Ruihang Xia	6735e5867e	feat: bump opendal and switch prometheus layer to the upstream impl (#5179 ) * feat: bump opendal and switch prometheus layer to the upstream impl Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * remove unused files Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix tests Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * remove unused things Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * remove root dir on recovering cache Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * filter out non-files entry in test Signed-off-by: Ruihang Xia <waynestxia@gmail.com> --------- Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2024-12-20 14:12:19 +08:00
Weny Xu	925525726b	fix: ensure table route metadata is eventually rolled back on failure (#5174 ) * fix: ensure table route metadata is eventually rolled back on procedure failure * fix(fuzz): enhance procedure condition checking * chore: add logs * feat: close downgraded leader region actively * chore: apply suggestions from CR	2024-12-20 14:12:19 +08:00
Ning Sun	6427682a9a	feat: show create postgresql foreign table (#5143 ) * feat: add show create table for pg in parser * feat: implement show create table operation * fix: adopt upstream changes	2024-12-20 14:12:19 +08:00
Ruihang Xia	2d84cc8d87	refactor: remove unused symbols (#5193 ) chore: remove unused symbols Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2024-12-20 14:12:19 +08:00
Ruihang Xia	443c600bd0	fix: validate matcher op for __name__ in promql (#5191 ) Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2024-12-20 14:12:19 +08:00
jeremyhi	9b5e4e80f7	feat: extract hints from http header (#5128 ) * feat: extract hints from http header * Update src/servers/src/http/hints.rs Co-authored-by: shuiyisong <113876041+shuiyisong@users.noreply.github.com> * chore: by comment * refactor: get instead of loop --------- Co-authored-by: shuiyisong <113876041+shuiyisong@users.noreply.github.com>	2024-12-20 14:12:19 +08:00
Yingwen	041a276b66	feat: do not remove time filters in ScanRegion (#5180 ) * feat: do not remove time filters * chore: remove `time_range` from parquet reader * chore: print more message in the check script * chore: fix unused error	2024-12-20 14:12:19 +08:00
Yingwen	614a25ddc5	feat: do not keep MemtableRefs in ScanInput (#5184 )	2024-12-20 14:12:19 +08:00
dennis zhuang	4337e20010	feat: impl label_join and label_replace for promql (#5153 ) * feat: impl label_join and label_replace for promql * chore: style * fix: dst_label is eqauls to src_label * fix: forgot to sort the results * fix: processing empty source label	2024-12-20 14:12:19 +08:00
Lanqing Yang	65c52cc698	fix: display inverted and fulltext index in show index (#5169 )	2024-12-20 14:12:19 +08:00
Yohan Wal	50f31fd681	feat: introduce Buffer for non-continuous bytes (#5164 ) * feat: introduce Buffer for non-continuous bytes * Update src/mito2/src/cache/index.rs Co-authored-by: Weny Xu <wenymedia@gmail.com> * chore: apply review comments * refactor: use opendal::Buffer --------- Co-authored-by: Weny Xu <wenymedia@gmail.com>	2024-12-20 14:12:19 +08:00
LFC	b5af5aaf8d	refactor: produce BatchBuilder from a Batch to modify it again (#5186 ) chore: pub some mods	2024-12-20 14:12:19 +08:00
Lei, HUANG	27693c7f1e	perf: avoid holding memtable during compaction (#5157 ) * perf/avoid-holding-memtable-during-compaction: Refactor Compaction Version Handling • Introduced CompactionVersion struct to encapsulate region version details for compaction, removing dependency on VersionRef. • Updated CompactionRequest and CompactionRegion to use CompactionVersion. • Modified open_compaction_region to construct CompactionVersion without memtables. • Adjusted WindowedCompactionPicker to work with CompactionVersion. • Enhanced flush logic in WriteBufferManager to improve memory usage checks and logging. * reformat code * chore: change log level * reformat code --------- Co-authored-by: Yingwen <realevenyag@gmail.com>	2024-12-20 14:12:19 +08:00
discord9	a59fef9ffb	test: sqlness upgrade compatibility tests (#5126 ) * feat: simple version switch * chore: remove debug print * chore: add common folder * tests: add drop table * feat: pull versioned binary * chore: don't use native-tls * chore: rm outdated docs * chore: new line * fix: save old bin dir * fix: switch version restart all node * feat: use etcd * fix: wait for election * fix: normal sqlness * refactor: hashmap for bin dir * test: past 3 major version compat crate table * refactor: allow using without setup etcd	2024-12-20 14:12:19 +08:00
Zhenchi	bcecd8ce52	feat(bloom-filter): add basic bloom filter creator (Part 1) (#5177 ) * feat(bloom-filter): add a simple bloom filter creator (Part 1) Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> * fix: clippy Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> * fix: header Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> * docs: add format comment Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> --------- Signed-off-by: Zhenchi <zhongzc_arch@outlook.com>	2024-12-20 14:12:19 +08:00
Yingwen	ffdcb8c1ac	fix: deletion between two put may not work in `last_non_null` mode (#5168 ) * fix: deletion between rows with the same key may not work * test: add sqlness test case * chore: comments	2024-12-20 14:12:19 +08:00
Yingwen	554121ad79	chore: add aquamarine to dep lists (#5181 )	2024-12-20 14:12:19 +08:00
Weny Xu	43c12b4f2c	fix: correct `set_region_role_state_gracefully` behaviors (#5171 ) * fix: reduce default max rows for fuzz testing * chore: remove Postgres setup from fuzz test workflow * chore(fuzz): increase resource limits for GreptimeDB cluster * chore(fuzz): increase resource limits for kafka * fix: correct `set_region_role_state_gracefully` behaviors * chore: remove Postgres setup from fuzz test workflow * chore(fuzz): redue resource limits for GreptimeDB & kafka	2024-12-20 14:12:19 +08:00
ZonaHe	06d7bd99dd	feat: update dashboard to v0.7.3 (#5172 ) Co-authored-by: sunchanglong <sunchanglong@users.noreply.github.com>	2024-12-20 14:12:19 +08:00
Ruihang Xia	b71d842615	feat: introduce SKIPPING index (part 1) (#5155 ) * skip index parser Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * wip: sqlness Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * impl show create part Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * add empty line Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * change keyword to SKIPPING INDEX Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * rename local variables Signed-off-by: Ruihang Xia <waynestxia@gmail.com> --------- Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2024-12-20 14:12:19 +08:00
Lei, HUANG	7f71693b8e	chore: gauge for flush compaction (#5156 ) * add metrics * chore/bench-metrics: Add INFLIGHT_FLUSH_COUNT Metric to Flush Process • Introduced INFLIGHT_FLUSH_COUNT metric to track the number of ongoing flush operations. • Incremented INFLIGHT_FLUSH_COUNT in FlushScheduler to monitor active flushes. • Removed redundant increment of INFLIGHT_FLUSH_COUNT in RegionWorkerLoop to prevent double counting. * chore/bench-metrics: Add Metrics for Compaction and Flush Operations • Introduced INFLIGHT_COMPACTION_COUNT and INFLIGHT_FLUSH_COUNT metrics to track the number of ongoing compaction and flush operations. • Incremented INFLIGHT_COMPACTION_COUNT when scheduling remote and local compaction jobs, and decremented it upon completion. • Added INFLIGHT_FLUSH_COUNT increment and decrement logic around flush tasks to monitor active flush operations. • Removed redundant metric updates in worker.rs and handle_compaction.rs to streamline metric handling. * chore: add metrics for remote compaction jobs * chore: format * chore: also add dashbaord	2024-12-20 14:12:19 +08:00
Lin Yihai	615ea1a171	feat: Add `vector_scalar_mul` function. (#5166 )	2024-12-20 14:12:19 +08:00
shuiyisong	4e725d259d	chore: remove unused dep (#5163 ) * chore: remove unused dep * chore: remove more unused dep	2024-12-20 14:12:19 +08:00
Niwaka	dc2252eb6d	fix: support alter table ~ add ~ custom_type (#5165 )	2024-12-20 14:12:19 +08:00
localhost	6066ce2c4a	fix: loki write row len error (#5161 )	2024-12-20 14:12:19 +08:00
Yohan Wal	fdccf4ff84	refactor: cache inverted index with fixed-size page (#5114 ) * feat: cache inverted index by page instead of file * fix: add unit test and fix bugs * chore: typo * chore: ci * fix: math * chore: apply review comments * chore: renames * test: add unit test for index key calculation * refactor: use ReadableSize * feat: add config for inverted index page size * chore: update config file * refactor: handle multiple range read and fix some related bugs * fix: add config * test: turn to a fs reader to match behaviors of object store	2024-12-20 14:12:19 +08:00
localhost	8b1484c064	chore: pipeline dryrun api can currently receives pipeline raw content (#5142 ) * chore: pipeline dryrun api can currently receives pipeline raw content * chore: remove dryrun v1 and add test * chore: change dryrun pipeline api body schema * chore: remove useless struct PipelineInfo * chore: update PipelineDryrunParams doc * chore: increase code readability * chore: add some comment for pipeline dryrun test * Apply suggestions from code review Co-authored-by: shuiyisong <113876041+shuiyisong@users.noreply.github.com> * chore: format code --------- Co-authored-by: shuiyisong <113876041+shuiyisong@users.noreply.github.com>	2024-12-20 14:12:19 +08:00
Yingwen	576e20ac78	feat: collect reader metrics from prune reader (#5152 )	2024-12-20 14:12:19 +08:00
localhost	10b3e3da0f	chore: decide tag column in log api follow table schema if table exists (#5138 ) * chore: decide tag column in log api follow table schema if table exists * chore: add more test for greptime_identity pipeline * chore: change pipeline get_table function signature * chore: change identity_pipeline_inner tag_column_names type	2024-12-20 14:12:19 +08:00
Weny Xu	4a3ef2d718	feat(index): add `file_size_hint` for remote blob reader (#5147 ) feat(index): add file_size_hint for remote blob reader	2024-12-20 14:12:19 +08:00

1 2 3 4 5 ...

2854 Commits