greptimedb

mirror of https://github.com/GreptimeTeam/greptimedb.git synced 2026-07-09 07:20:39 +00:00

Author	SHA1	Message	Date
Yingwen	f02410c39b	fix: disable field pruning in last non null mode (#4740 ) * fix: don't prune fields in last non null mode * test: add sqlness test for field pruning * test: add flush * refine implementation Signed-off-by: Ruihang Xia <waynestxia@gmail.com> --------- Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Co-authored-by: Ruihang Xia <waynestxia@gmail.com>	2024-09-20 00:35:37 +00:00
Weny Xu	befb6d85f0	fix: determine region role by using is_readonly (#4725 ) fix: correct `is_writable` behavior	2024-09-18 22:17:39 +00:00
Zhenchi	3b5b906543	feat(index): add explicit adapter between `RangeReader` and `AsyncRead` (#4724 ) Signed-off-by: Zhenchi <zhongzc_arch@outlook.com>	2024-09-18 03:33:55 +00:00
Yingwen	3e17c09e45	feat: skip caching uncompressed pages if they are large (#4705 ) * feat: cache each uncompressed page * chore: remove unused function * chore: log * chore: log * chore: row group pages cache kv * feat: also support row group level cache * chore: fix range count * feat: don't cache compressed page for row group cache * feat: use function to get part * chore: log whether scan is from compaction * chore: avoid get column * feat: add timer metrics * chore: Revert "feat: add timer metrics" This reverts commit 4618f57fa2ba13b1e1a8dec83afd01c00ae4c867. * feat: don't cache individual uncompressed page * feat: append in row group level under append mode Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * chore: fetch pages cost * perf: yield * Update src/mito2/src/sst/parquet/row_group.rs * refactor: cache key * feat: print file num and row groups num in explain * test: update sqlness test * chore: Update src/mito2/src/sst/parquet/page_reader.rs --------- Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Co-authored-by: Ruihang Xia <waynestxia@gmail.com>	2024-09-10 11:52:16 +00:00
Ruihang Xia	29f215531a	feat: parallel in row group level under append mode (#4704 ) feat: append in row group level under append mode Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2024-09-10 07:12:23 +00:00
Yohan Wal	04e7dd6fd5	feat: add json data type (#4619 ) * feat: add json type and vector * fix: allow to create and insert json data * feat: udf to query json as string * refactor: remove JsonbValue and JsonVector * feat: show json value as strings * chore: make ci happy * test: adunit test and sqlness test * refactor: use binary as grpc value of json * fix: use non-preserve-order jsonb * test: revert changed test * refactor: change udf get_by_path to jq * chore: make ci happy * fix: distinguish binary and json in proto * chore: delete udf for future pr * refactor: remove Value(Json) * chore: follow review comments * test: some tests and checks * test: fix unit tests * chore: follow review comments * chore: corresponding changes to proto * fix: change grpc and pgsql server behavior alongside with sqlness/crud tests * chore: follow review comments * feat: udf of conversions between json and strings, used for grpc server * refactor: rename to_string to json_to_string * test: add more sqlness test for json * chore: thanks for review :) * Apply suggestions from code review --------- Co-authored-by: Weny Xu <wenymedia@gmail.com>	2024-09-09 11:41:36 +00:00
Ruihang Xia	d2d62e0c6f	fix: unconditional statistics (#4694 ) * fix: unconditional statistics Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * add more sqlness case Signed-off-by: Ruihang Xia <waynestxia@gmail.com> --------- Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2024-09-07 04:28:11 +00:00
Yingwen	506dc20765	fix: last non null iter not init (#4687 )	2024-09-06 04:13:23 +00:00
Lanqing Yang	86cef648cd	feat: add more spans to mito engine (#4643 ) feat: add more span on mito engine	2024-09-05 06:13:22 +00:00
LFC	d43e31c7ed	feat: schedule compaction when adding sst files by editing region (#4648 ) * feat: schedule compaction when adding sst files by editing region * add minimum time interval for two successive compactions * resolve PR comments	2024-09-04 10:10:07 +00:00
Ruihang Xia	8ca35a4a1a	fix: use number of partitions as parallilism in region scanner (#4669 ) * fix: use number of partitions as parallilism in region scanner Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * add sqlness Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Co-authored-by: Lei HUANG <mrsatangel@gmail.com> * order by ts Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * debug pring time range Signed-off-by: Ruihang Xia <waynestxia@gmail.com> --------- Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Co-authored-by: Lei HUANG <mrsatangel@gmail.com>	2024-09-03 13:42:38 +00:00
Ruihang Xia	93f202694c	refactor: remove unused error variants (#4666 ) * add python script Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * remove unused errors Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix all negative cases Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * setup CI Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * add license header Signed-off-by: Ruihang Xia <waynestxia@gmail.com> --------- Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2024-09-03 13:19:38 +00:00
Lei, HUANG	37dcf34bb9	fix(mito): avoid caching empty batches in row group (#4652 ) * fix: avoid caching empty batches in row group * fix: clippy * Update tests/cases/standalone/common/select/last_value.sql * fix: sqlness	2024-09-02 02:43:00 +00:00
Yingwen	8eda36bfe3	feat: remove files from the write cache in purger (#4655 ) * feat: remove files from the write cache in purger * chore: fix typo	2024-08-31 04:19:52 +00:00
Ruihang Xia	a37aeb2814	feat: initialize partition range from ScanInput (#4635 ) * feat: initialize partition range from ScanInput Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * use num_rows instead Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * add todo Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * setup unordered scan Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * Update src/mito2/src/read/scan_region.rs Co-authored-by: jeremyhi <jiachun_feng@proton.me> * leave unordered scan unchanged Signed-off-by: Ruihang Xia <waynestxia@gmail.com> --------- Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Co-authored-by: jeremyhi <jiachun_feng@proton.me>	2024-08-30 07:30:37 +00:00
LFC	8ea4f67e4b	refactor: reduce a object store "stat" call (#4645 )	2024-08-30 03:31:19 +00:00
LFC	d45b04180c	feat: pre-download the ingested sst (#4636 ) * refactor: pre-read the ingested sst file in object store to fill the local cache to accelerate first query * feat: pre-download the ingested SST from remote to accelerate following reads * resolve PR comments * resolve PR comments	2024-08-29 08:36:41 +00:00
Weny Xu	47657ebbc8	feat: replay WAL entries respect index (#4565 ) * feat(log_store): use new `Consumer` * feat: add `from_peer_id` * feat: read WAL entries respect index * test: add test for `build_region_wal_index_iterator` * fix: keep the handle * fix: incorrect last index * fix: replay last entry id may be greater than expected * chore: remove unused code * chore: apply suggestions from CR * chore: rename `datanode_id` to `location_id` * chore: rename `from_peer_id` to `location_id` * chore: rename `from_peer_id` to `location_id` * chore: apply suggestions from CR	2024-08-28 11:37:18 +00:00
Yingwen	28bf549907	fix: fallback to window size in manifest (#4629 )	2024-08-28 06:43:56 +00:00
zyy17	5177717f71	refactor: add `fallback_to_local` region option (#4578 ) * refactor: add 'fallback_to_local_compaction' region option * refactor: use 'fallback_to_local'	2024-08-23 03:09:43 +00:00
Weny Xu	25cd61b310	chore: upgrade toolchain to nightly-2024-08-07 (#4549 ) * chore: upgrade toolchain to `nightly-2024-08-07` * chore(ci): upgrade toolchain * fix: fix unit test	2024-08-22 11:02:18 +00:00
LFC	883c5bc5b0	refactor: skip checking the existence of the SST files (#4602 ) refactor: skip checking the existence of the SST files when region is directly edited	2024-08-22 08:32:27 +00:00
Yingwen	d628079f4c	feat: collect filters metrics for scanners (#4591 ) * feat: collect filter metrics * refactor: reuse ReaderFilterMetrics * feat: record read rows from parquet by type * feat: unordered scan observe rows also fix read type * chore: rename label	2024-08-22 03:22:05 +00:00
Yingwen	a12a905578	chore: disable ttl for write cache by default (#4595 ) * chore: remove default write cache ttl * docs: update example config * chore: fix ci	2024-08-21 08:38:38 +00:00
Ran Joe	9db08dbbe0	refactor(mito2): reduce duplicate IndexOutput struct (#4592 ) * refactor(mito2): reduce duplicate IndexOutput struct * docs(mito2): add index output note	2024-08-20 12:30:17 +00:00
ozewr	30af78700f	feat: Implement the Buf to avoid extra memory allocation (#4585 ) * feat: Implement the Buf to avoid extra memory allocation * fmt toml * fmt code * mv entry.into_buffer to raw_entry_buffer * less reuse opendal * remove todo #4065 * Update src/mito2/src/wal/entry_reader.rs Co-authored-by: Weny Xu <wenymedia@gmail.com> * fmt code --------- Co-authored-by: ozewr <l19ht@google.com> Co-authored-by: Weny Xu <wenymedia@gmail.com>	2024-08-19 12:11:08 +00:00
Weny Xu	76dc906574	feat(log_store): introduce the CollectionTask (#4530 ) * feat: introduce the `CollectionTask` * feat: add config of index collector * chore: remove unused code * feat: truncate indexes * chore: apply suggestions from CR * chore: update config examples * refactor: retrieve latest offset while dumping indexes * chore: print warn	2024-08-19 03:48:35 +00:00
LFC	ec59ce5c9a	feat: able to handle concurrent region edit requests (#4569 ) * feat: able to handle concurrent region edit requests * resolve PR comments	2024-08-16 03:29:03 +00:00
Lei, HUANG	216bce6973	perf: count() for append-only tables (#4545 ) feat: support fast count() for append-only tables fix: total_rows stats in time series memtable * fix: sqlness result changes for SinglePartitionScanner -> StreamScanAdapter * fix: some cr comments	2024-08-13 09:27:50 +00:00
Weny Xu	665b7e5c6e	perf: merge small byte ranges for optimized fetching (#4520 )	2024-08-09 08:17:54 +00:00
Weny Xu	cb4cffe636	chore: bump opendal version to 0.48 (#4499 )	2024-08-04 00:46:04 +00:00
Yingwen	ded874da04	feat: enlarge default page cache size (#4490 )	2024-08-02 07:24:20 +00:00
Yingwen	b388829a96	fix: avoid total size overflow (#4487 ) feat: avoid total size overflow	2024-08-02 06:16:37 +00:00
Lei, HUANG	9d5d7c1f9a	feat(compaction): add file number limits to TWCS compaction (#4481 ) * Add file number limits to TWCS compaction - Introduce `max_active_window_files` and `max_inactive_window_files` to `TwcsOptions`. * feat/limit-files-in-windows: Add max active/inactive window files options to mito engine config * feat/limit-files-in-windows: Add Debug derive to TwcsPicker and implement max file enforcement logging in TWCS compaction * fix: clippy	2024-08-01 12:42:09 +00:00
Yingwen	6c4b8b63a5	fix: notify flush receiver after write buffer is released (#4476 ) * fix: notify the worker after write buffer is released * feat: worker level region count	2024-08-01 07:15:36 +00:00
Yingwen	f382a7695f	perf: reduce lock scope and improve log (#4453 ) * feat: refine logs for scan * feat: improve build parts and unordered scan metrics * feat: change to debug log * fix: release lock before reading part * test: replace region id * test: fix sqlness * chore: add todo Co-authored-by: dennis zhuang <killme2008@gmail.com> --------- Co-authored-by: dennis zhuang <killme2008@gmail.com>	2024-07-31 04:07:34 +00:00
LFC	6d8a502430	chore: add more metrics about parquet and cache (#4410 ) * chore: add more metrics about parquet and cache * resolve PR comments * resolve PR comments * resolve PR comments * resolve PR comments	2024-07-30 12:01:49 +00:00
Ruihang Xia	7daf24c47f	feat: remove dedicated runtime for grpc, mysql and pg protocols (#4436 ) * feat: remove dedicated runtime for grpc, mysql and pg protocols Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * remove other runtimes Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * spawn compact task into compact_runtime Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * refine naming Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * Update src/servers/tests/mysql/mysql_server_test.rs Co-authored-by: Zhenchi <zhongzc_arch@outlook.com> * fix clippy Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * turnoff fuzz test matrix fail fast option Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * chore: update rt config for ci tests --------- Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Co-authored-by: Zhenchi <zhongzc_arch@outlook.com> Co-authored-by: Weny Xu <wenymedia@gmail.com>	2024-07-30 06:17:58 +00:00
Zhenchi	cb94bd45d3	fix(fulltext-search): prune rows in row group forget to take remainder (#4447 ) * fix(fulltext-search): prune rows in row group forget to take remainder Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> * test: add unit test Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> --------- Signed-off-by: Zhenchi <zhongzc_arch@outlook.com>	2024-07-29 06:20:07 +00:00
Lei, HUANG	bdd3d2d9ce	chore: add dynamic cache size adjustment for InvertedIndexConfig (#4433 ) * Add dynamic cache size adjustment for InvertedIndexConfig * Increase cache sizes in integration tests for HTTP - Updated `metadata_cache_size` from 32MiB to 64MiB * Remove cache size settings from config and update drop_lines_with_inconsistent_results function to handle them * Add cache size configurations for inverted index metadata and content - Introduced `metadata_cache_size` with a default of 64MiB. - Introduced `content_cache_size` with a default of 128MiB. * chore/index-content-cache-default-size: Add cache size configuration options for Mito engine's inverted index	2024-07-26 03:36:20 +00:00
Lei, HUANG	0b0ed03ee6	fix(metrics): RowGroupLastRowCachedReader metrics (#4418 ) fix/reader-metrics: Refactor cache hit/miss logic and update metrics in mito2 - Simplify cache retrieval logic in CacheManager by removing inline update_hit_miss function call. - Add separate functions for incrementing cache hit and miss metrics. - Update RowGroupLastRowCachedReader to use new cache hit/miss functions and refactor to new helper methods for creating Hit and Miss variants.	2024-07-25 06:45:43 +00:00
Zhenchi	91dbac4141	fix(fulltext-index): clean up 0-value timer (#4423 ) Signed-off-by: Zhenchi <zhongzc_arch@outlook.com>	2024-07-24 15:03:36 +00:00
Lei, HUANG	c8cf3b1677	fix(wal): handle WAL deletion on region drop (#4400 ) Add LogStore trait bound to RegionWorkerLoop and handle WAL deletion on region drop.	2024-07-19 13:24:10 +00:00
Yingwen	7aae19aa8b	fix: dictionary key type use u32 (#4396 ) * fix: dictionary key type use u32 * fix: fix error whle reading content * fix: bulk memtable dictionary type	2024-07-19 09:51:29 +00:00
Lei, HUANG	2010a2a33d	feat: Add caching for last row reader and expose cache manager (#4375 ) * Add caching for last row reader and expose cache manager - Implement `RowGroupLastRowCachedReader` to handle cache hits and misses for last row reads. * Add projection field to SelectorResultValue and refactor RowGroupLastRowReader - Introduced `projection` field in `SelectorResultValue` to store projection indices.	2024-07-16 07:13:39 +00:00
Lei, HUANG	7b28da277d	refactor: LastRowReader to use LastRowSelector (#4374 ) Refactor LastRowReader to use LastRowSelector - Replaced `last_batch` in `LastRowReader` with `LastRowSelector`.	2024-07-16 03:47:41 +00:00
Lei, HUANG	9fbc4ba649	feat: add PruneReader for optimized row filtering (#4370 ) * Add PruneReader for optimized row filtering and error handling - Introduced `PruneReader` to replace `RowGroupReader` for optimized row filtering. * Commit Message: Make ReaderMetrics fields public for external access * Add row selection support to SeqScan and FileRange readers - Updated `SeqScan::build_part_sources` to accept an optional `TimeSeriesRowSelector`. * Refactor `scan_region.rs` to remove unnecessary cloning of `series_row_selector`. Enhance `file_range.rs` by adding `select_all` method to check if all rows in a row group are selected, and update the logic in `reader` method to use `LastRowReader` only when all rows are selected and no DELETE operations are present. * Commit Message: Enhance PruneReader and ParquetReader with reset functionality and metrics handling Summary: • Made Source enum public in prune.rs. * chore: Update src/mito2/src/sst/parquet/reader.rs --------- Co-authored-by: Yingwen <realevenyag@gmail.com>	2024-07-15 14:23:34 +00:00
Yingwen	2e7b12c344	feat: add a cache for last value result in row group (#4369 ) * feat: add selector result cache to cache manager * feat: expose config	2024-07-15 12:33:36 +00:00
Zhenchi	04ac0c8da0	feat(fulltext_index): integrate full-text indexer with parquet reader (#4348 ) * feat(fulltext_index): integrate full-text indexer with parquet reader Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> * disable reload Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> * address comments Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> * fix: range allow exceeding total row Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> * test: unit tests in index Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> * test: prune row groups Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> * chore: rename creator Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> * test: sst fulltext index Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> * chore: address comment Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> --------- Signed-off-by: Zhenchi <zhongzc_arch@outlook.com>	2024-07-15 08:14:44 +00:00
dennis zhuang	64cad4e891	feat: tweak error and status codes (#4359 ) * feat: tweak status codes * fix: typo * fix: by cr comments	2024-07-15 07:50:16 +00:00

1 2 3 4 5 ...

393 Commits