greptimedb

mirror of https://github.com/GreptimeTeam/greptimedb.git synced 2025-12-23 06:30:05 +00:00

Author	SHA1	Message	Date
dennis zhuang	fbf50c594e	fix: csv format escaping (#6061 ) * fix: csv format escaping * chore: change status code * fix: crate version	2025-05-08 05:52:20 +00:00
LFC	4b5ab75312	refactor: remove some async in ServerHandlers (#6057 ) * refactor: remove some async in ServerHandlers * address PR comments	2025-05-07 03:57:16 +00:00
shuiyisong	56f31d5933	feat(pipeline): select processor (#6019 ) * feat: support auto transform * refactor: replace hashbrown with ahash * refactor: params of run identity pipeline * refactor: minor update * test: add test for auto transform * feat: add select processor * test: select processor * chore: use include and exclude for key * fix: typos * chore: address CR comment * chore: typo * chore: typo * chore: address CR comment * chore: use with_context	2025-05-07 03:40:11 +00:00
Yingwen	07e84a28a3	fix: do not add projection to cast timestamp in label_values (#6040 ) * fix: do not add projection for cast Use cast to build time filter directly instead of adding a projection, which will cause column not found * feat: cast before creating plan	2025-05-06 23:47:41 +00:00
Lei, HUANG	f298a110f9	feat: bridge bulk insert (#5927 ) * feat/bridge-bulk-insert: ## Implement Bulk Insert and Update Dependencies - Bulk Insert Implementation: Added `handle_bulk_inserts` method in `src/operator/src/bulk_insert.rs` to manage bulk insert requests using `FlightDecoder` and `FlightData`. - Dependency Updates: Updated `Cargo.lock` and `Cargo.toml` to use the latest revision of `greptime-proto` and added new dependencies like `arrow`, `arrow-ipc`, `bytes`, and `prost`. - gRPC Enhancements: Modified `put_record_batch` method in `src/frontend/src/instance/grpc.rs` and `src/servers/src/grpc/flight.rs` to handle `FlightData` instead of `RawRecordBatch`. - Error Handling: Added new error types in `src/operator/src/error.rs` for handling Arrow operations and decoding flight data. - Miscellaneous: Updated `src/operator/src/insert.rs` to expose `partition_manager` and `node_manager` as public fields. * feat/bridge-bulk-insert: - Update `greptime-proto` Dependency: Updated the `greptime-proto` dependency to a new revision in `Cargo.lock` and `Cargo.toml`. - Refactor gRPC Query Handling: Removed `RawRecordBatch` usage from `grpc.rs`, `flight.rs`, `greptime_handler.rs`, and test files, simplifying the gRPC query handling. - Enhance Bulk Insert Logic: Improved bulk insert logic in `bulk_insert.rs` and `region_request.rs` by using `FlightDecoder` and `BooleanArray` for better performance and clarity. - Add `common-grpc` Dependency: Added `common-grpc` as a workspace dependency in `store-api/Cargo.toml` to support gRPC functionalities. * fix: clippy * fix schema serialization * feat/bridge-bulk-insert: Add error handling for encoding/decoding in `metadata.rs` and `region_request.rs` - Introduced new error variants `FlightCodec` and `Prost` in `MetadataError` to handle encoding/decoding failures in `metadata.rs`. - Updated `make_region_bulk_inserts` function in `region_request.rs` to use `context` for error handling with `ProstSnafu` and `FlightCodecSnafu`. - Enhanced error handling for `FlightData` decoding and `filter_record_batch` operations. * fix: test * refactor: rename * allow empty app_metadata in FlightData * feat/bridge-bulk-insert: - Remove Logging: Removed unnecessary logging of affected rows in `region_server.rs`. - Error Handling Enhancement: Improved error handling in `bulk_insert.rs` by adding context to `split_record_batch` and handling single datanode fast path. - Error Enum Cleanup: Removed unused `Arrow` error variant from `error.rs`. * fix: standalone test * feat/bridge-bulk-insert: ### Enhance Bulk Insert Handling and Metadata Management - `lib.rs`: Enabled the `result_flattening` feature for improved error handling. - `request.rs`: Made `name_to_index` and `has_null` fields public in `WriteRequest` for better accessibility. - `handle_bulk_insert.rs`: - Added `handle_record_batch` function to streamline processing of bulk insert payloads. - Improved error handling and task management for bulk insert operations. - Updated `region_metadata_to_column_schema` to return both column schemas and a name-to-index map for efficient data access. * feat/bridge-bulk-insert: - Refactor `handle_bulk_insert.rs`: - Replaced `handle_record_batch` with `handle_payload` for handling payloads. - Modified the fast path to use `common_runtime::spawn_global` for asynchronous task execution. - Optimize `multi_dim.rs`: - Added a fast path for single-region scenarios in `MultiDimPartitionRule::partition_record_batch`. * feat/bridge-bulk-insert: - Update `greptime-proto` Dependency: Updated the `greptime-proto` dependency to a new revision in both `Cargo.lock` and `Cargo.toml`. - Optimize Memory Allocation: Increased initial and builder capacities in `time_series.rs` to improve performance. - Enhance Data Handling: Modified `bulk_insert.rs` to use `Bytes` for efficient data handling. - Improve Bulk Insert Logic: Refined the bulk insert logic in `region_request.rs` to handle schema and payload data more effectively and optimize record batch filtering. - String Handling Improvement: Updated string conversion in `helper.rs` for better performance. * fix: clippy warnings * feat/bridge-bulk-insert: Add Metrics and Improve Error Handling - Metrics Enhancements: Introduced new metrics for bulk insert operations in `metrics.rs`, `bulk_insert.rs`, `greptime_handler.rs`, and `region_request.rs`. Added `HANDLE_BULK_INSERT_ELAPSED`, `BULK_REQUEST_MESSAGE_SIZE`, and `GRPC_BULK_INSERT_ELAPSED` histograms to monitor performance. - Error Handling Improvements: Removed unnecessary error handling in `handle_bulk_insert.rs` by eliminating redundant `let _ =` patterns. - Dependency Updates: Added `lazy_static` and `prometheus` to `Cargo.lock` and `Cargo.toml` for metrics support. - Code Refactoring: Simplified function calls in `region_server.rs` and `handle_bulk_insert.rs` for better readability. * chore: rebase main * chore: merge main	2025-05-06 09:53:25 +00:00
LFC	bb4890cff8	refactor: datanode instance builder (#6034 ) remove another piece of REPL codes	2025-05-03 00:28:32 +00:00
shuiyisong	a706edbb73	feat(pipeline): auto transform (#6013 ) * feat: support auto transform * refactor: replace hashbrown with ahash * refactor: params of run identity pipeline * refactor: minor update * test: add test for auto transform * chore: fix cr issues	2025-04-30 07:40:26 +00:00
shuiyisong	3c943be189	chore: update rust toolchain (#5818 ) * chore: update nightly version * chore: sort lint lines * chore: minor fix * chore: update nix * chore: update toolchain to 2024-04-14 * chore: update toolchain to 2024-04-15 * chore: remove unnecessory test * chore: do not assert oid in sqlness test * chore: fix margin issue * chore: fix cr issues * chore: fix cr issues --------- Co-authored-by: Ning Sun <sunning@greptime.com>	2025-04-27 09:02:36 +00:00
discord9	a0900f5b90	feat(flow): use batching mode&fix sqlness (#5903 ) * feat: use flow batching engine broken: try using logical plan fix: use dummy catalog for logical plan fix: insert plan exec&sqlness grpc addr feat: use frontend instance in flownode in standalone feat: flow type in metasrv&fix: flush flow out of sync& column name alias tests: sqlness update tests: sqlness flow rebuild udpate chore: per review refactor: keep chnl mgr refactor: use catalog mgr for get table tests: use valid sql fix: add more check refactor: put flow type determine to frontend * chore: update proto * chore: update proto to main branch * fix: add locks for create/drop flow&docs: update docs * feat: flush_flow flush all ranges now * test: add align time window test * docs: explain `nodeid` use in check task * refactor: AddAutoColumnRewriter check for Projection * refactor: per review * fix: query without time window also clean dirty time window * chore: better logging * chore: add comments per review * refactor: per review * chore: per review * chore: per review rename args * refactor: per review partially * chore: update docs * chore: use better error variant * chore: better error variant * refactor: rename FlowWorkerManager to FlowStreamingEngine * rename again * refactor: per review * chore: rebase after #5963 merged * refactor: rename all flow_worker_manager occurs * docs: rm resolved TODO	2025-04-23 15:12:16 +00:00
Ning Sun	bcefc6b83f	feat: add format support for promql http api (not prometheus) (#5939 ) * feat: add format support for promql http api (not prometheus) * test: add csv format test	2025-04-22 08:10:35 +00:00
Weny Xu	0a4594c9e2	fix: remove obsolete failover detectors after region leader change (#5944 ) * fix: remove obsolete failover detectors after region leader change * chore: apply suggestions from CR * fix: fix unit tests * fix: fix unit test * fix: failover logic	2025-04-22 06:15:47 +00:00
Ruihang Xia	e4556ce12b	fix: label values potential panic (#5921 ) Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2025-04-17 14:01:21 +00:00
LFC	d27b9fc3a1	feat: implement Arrow Flight "DoPut" in Frontend (#5836 ) * feat: implement Arrow Flight "DoPut" in Frontend * support auth for "do_put" * set request_id in DoPut requests and responses * set "db" in request header	2025-04-17 03:46:19 +00:00
Lin Yihai	7274ceba30	feat: Add query pipeline http api (#5819 ) * feat(pipeline): add query pipeline http api. * chore(pipeline): rename get pipepile method * refactor(pipeline): Also insert string piple into cache after inserting into table. --------- Co-authored-by: shuiyisong <113876041+shuiyisong@users.noreply.github.com>	2025-04-16 10:17:20 +00:00
discord9	032df4c533	feat(flow): dual engine (#5881 ) * feat: partial use batch mode(WIP) * feat: add flow engine trait * refactor: more trait method * dual engine * feat: dual engine * refactor: flow map cache * chore: per review * chore: per review	2025-04-15 07:03:12 +00:00
zyy17	7b13376239	refactor: add `partition_rules_for_uuid()` (#5743 ) * refactor: add partition_rules_for_uuid() * refactor: support up to 65536 partitions for partition_rules_for_uuid()	2025-04-15 06:46:31 +00:00
Zhenchi	8d485e9be0	feat: support altering fulltext backend (#5896 ) * feat: add `greptime_index_type` to `information_schema.key_column_usage` Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> * fix: show create Signed-off-by: Zhenchi <zhongzc_arch@outlook.com> --------- Signed-off-by: Zhenchi <zhongzc_arch@outlook.com>	2025-04-15 06:36:06 +00:00
Ning Sun	be837ddc24	test: add tests to ensure nested data structure for identity pipeline (#5888 )	2025-04-14 03:13:46 +00:00
LFC	311727939d	chore: update datafusion family (#5814 )	2025-04-09 02:20:55 +00:00
zyy17	ee4fe9d273	refactor: improve performance for Jaeger APIs (#5838 ) * refactor: improve jaeger '/api/services' performance by adding the trace services table * chore: refine some logic * chore: compatible v0 * test: add integration test * chore: expand default limit from 100 to 2000 * test: fix integration test * refactor: make trace service table configurable * refactor: use a timestamp(2100-01-01 00:00:00) as large as possible * refactor: use '<trace_table>_services' as trace services table name	2025-04-08 02:28:06 +00:00
zyy17	cf1440fc32	refactor: add time range for jager get operations API (#5791 ) * refactor: add default time range for jager get operations API * refactor: use desc order for timestamp colomn * chore: modify http header name	2025-04-07 09:07:31 +00:00
fys	7b48ef1e97	chore: remove patch.crates-io for rustls (#5832 ) * chore: remove patch.crates-io for rustls * enable default-rustls-ring feature for mysql_sync * fix: build error * add comment * update comment	2025-04-07 07:51:50 +00:00
Ning Sun	f2907bb009	refactor!: make pipeline a required parameter when ingesting trace (#5828 ) * feat: make pipeline a required header for trace * test: add test case without pipeline	2025-04-07 06:18:17 +00:00
fys	33c9fb737c	refactor: remove mode option in configuration files (#5809 ) * refactor: remove mode option in configuration files * chore: remove mode in configuration file * remvoe mode field in FlownodeOptions * add comment for test * update config.md * remove mode field in standalone options * fix: ci	2025-04-01 07:14:10 +00:00
Weny Xu	d701c18150	feat: introduce `CustomizedRegionLeaseRenewer` (#5762 ) * feat: add manifest_version to `GrantedRegion` * chore: upgrade proto * chore: apply review suggestions * chore: apply suggestions from CR * feat: introduce `CustomizedRegionLeaseRenewerRef` * chore: upgrade to `103948`	2025-03-31 13:25:05 +00:00
Weny Xu	d3a60d8821	feat: add limit for the number of running procedures (#5793 ) * refactor: remove unused `messages` * feat: introduce running procedure num limit * feat: update config * chore: apply suggestions from CR * feat: impl `status_code` for `log-store` crate	2025-03-31 06:14:21 +00:00
shuiyisong	bef45ed0e8	feat(pipeline): support table name suffix templating in pipeline (#5775 ) * chore: add table name template in pipeline yaml * chore: implement apply function and add simple test * chore: add comment and integration test * chore: minor update * fix: typos * chore: change to table suffix * chore: update comment and test * chore: change name to table_suffix	2025-03-28 18:12:46 +00:00
Yingwen	737558ef53	fix: support __name__ matcher in label values (#5773 )	2025-03-28 02:18:59 +00:00
fys	2b2ea5bf72	chore: upgrade some dependencies (#5777 ) * chore: upgrade some dependencies * chore: upgrade some dependencies * fix: cr * fix: ci * fix: test * fix: cargo fmt	2025-03-27 02:48:44 +00:00
fys	9f9307de73	refactor: make frontend instance clear (#5754 ) * refactor: the startup of frontend * remove unnecessary error type * fix: cr * remove unnecessary trait FrontendInstance * fix: cr * fix: cr * adjust the startup order of services	2025-03-24 06:08:02 +00:00
shuiyisong	c77ce958a3	chore: support custom time index selector for identity pipeline (#5750 ) * chore: minor refactor * chore: minor refactor * chore: support custom ts for identity pipeline * chore: fix clippy * chore: minor refactor & update tests * chore: use ref on identity pipeline param	2025-03-24 04:27:22 +00:00
zyy17	a19441bed8	refactor: remove trace id from primary key in `opentelemetry_traces` table (#5733 ) * refactor: remove trace id in primary key * refactor: remove trace id in primary key in v0 model * refactor: add span id in v1 * fix: integration test	2025-03-19 06:17:58 +00:00
Ning Sun	1ab4ddab8d	feat: update pipeline header name to x-greptime-pipeline-name (#5710 ) * feat: update pipeline header name to x-greptime-pipeline-name * refactor: update string_value_from_header	2025-03-18 02:39:54 +00:00
Ning Sun	9e63018198	feat: disable http timeout (#5721 ) * feat: update to disable http timeout by default * feat: make http timeout default to 0 * test: correct test case * chore: generate new config doc * test: correct tests	2025-03-18 01:18:56 +00:00
yihong	16fddd97a7	chore: revert commit update flate2 version (#5706 )" (#5715 ) Revert "chore: update flate2 version (#5706)" This reverts commit `a5df3954f3`.	2025-03-17 12:16:26 +00:00
Ning Sun	2260782c12	refactor: update jaeger api implementation for new trace modeling (#5655 ) * refactor: update jaeger api implementation * test: add tests for v1 data model * feat: customize trace table name * fix: update column requirements to use Column type instead of String Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix: lint fix * refactor: accumulate resource attributes for v1 * fix: add empty check for additional string * feat: add table option to mark data model version * fix: do not overwrite all tags * feat: use table option to mark table data model version and process accordingly * chore: update comments to reflect query changes * feat: use header for jaeger table name * feat: update index for service_name, drop index for span_name --------- Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Co-authored-by: Ruihang Xia <waynestxia@gmail.com> Co-authored-by: zyy17 <zyylsxm@gmail.com>	2025-03-17 07:31:32 +00:00
yihong	a5df3954f3	chore: update flate2 version (#5706 ) Signed-off-by: yihong0618 <zouzou0208@gmail.com>	2025-03-14 02:15:27 +00:00
shuiyisong	fcb898e9a4	chore: support `inverted` index in pipeline (#5700 ) chore: rebase main	2025-03-13 08:30:29 +00:00
Ning Sun	8fa2fdfc42	feat: make empty parent_span_id null for v1 (#5690 )	2025-03-13 07:48:15 +00:00
shuiyisong	4dc1a1d60f	chore: support `tag` in transform (#5701 ) chore: support tag in transform to specify tag	2025-03-13 07:27:12 +00:00
Yingwen	25645a3303	feat: expose virtual_host_style config for s3 storage (#5696 ) * feat: expose enable_virtual_host_style for s3 storage * docs: update examples * test: fix config test	2025-03-12 13:46:56 +00:00
Yohan Wal	af1920defc	feat: add mysql kvbackend (#5528 ) * feat: add mysql kvbackend txn support * chore: error handling * chore: follow review comments * chore: follow review comments * chore: follow review comments * revert: mysql QAQ * revert: revert changes to sqls This reverts commit `cf98c50dd9`. * chore: add comments	2025-03-12 06:52:56 +00:00
Lin Yihai	2cbf51d0be	refactor!: Remove `Value::DateTime` and `ValueRef::DateTime`. (#5616 ) * refactor: Remove Value::DateTime and ValueRef::DateTime * fix: don't panic if arrow cast field. * fix: map `ColumnDataType::Datetime` to `ConcreteDataType::timestamp_microsecond_datatype` * fix: Map `ValueData::DatetimeValue` correctly. * refactor: Replace `datetime` with `timestamp_micro_second`	2025-03-11 07:03:27 +00:00
Weny Xu	0bd322a078	perf(prom): optimize label values query (#5653 ) perf: optimize label values query	2025-03-10 13:20:47 +00:00
shuiyisong	448e588fa7	chore: improve `/v1/jaeger/api/trace/{trace_id}`'s resp (#5663 ) * chore: improve jaeger trace api resp * chore: fix timestamp type * chore: fix timestamp type * chore: complete more fields * chore: change to microseconds * chore: add empty check & span status code * chore: minor update * chore: update test	2025-03-07 04:31:42 +00:00
Lei, HUANG	a56030e6a5	refactor: remove cluster id field (#5610 ) * chore: resolve conflicts * chore: merge main * test: add compatibility test for DatanodeLeaseKey with missing cluster_id * test: add compatibility test for DatanodeLeaseKey without cluster_id * refactor/remove-cluster-id: - Update `greptime-proto` Dependency: Updated the `greptime-proto` dependency in `Cargo.lock` and `Cargo.toml` to a new revision. - Remove `cluster_id` Usage: Removed the `cluster_id` field and its related logic from various files, including `cluster.rs`, `datanode.rs`, `rpc.rs`, `adapter.rs`, `client.rs`, `ask_leader.rs`, `heartbeat.rs`, `procedure.rs`, `store.rs`, `handler.rs`, `response_header_handler.rs`, `key.rs`, `datanode.rs`, `lease.rs`, `metrics.rs`, `cluster.rs`, `heartbeat.rs`, `procedure.rs`, and `store.rs`. - Refactor Tests: Updated tests in `client.rs`, `response_header_handler.rs`, `store.rs`, and `service` modules to reflect the removal of `cluster_id`. * fix: clippy * refactor/remove-cluster-id: Refactor and Cleanup in Meta Server - `response_header_handler.rs`: Removed unused import of `HeartbeatResponse` and cleaned up the test function by eliminating the creation of an unused `HeartbeatResponse` object. - `node_lease.rs`: Simplified parameter handling in `HttpHandler` implementation by using an underscore for unused parameters. * refactor/remove-cluster-id: ### Remove `TableMetadataAllocatorContext` and Refactor Code - Removed `TableMetadataAllocatorContext`: Eliminated the `TableMetadataAllocatorContext` struct and its usage across multiple files, including `ddl.rs`, `create_table.rs`, `create_view.rs`, `table_meta.rs`, `test_util.rs`, `create_logical_tables.rs`, `drop_table.rs`, and `table_meta_alloc.rs`. - Refactored Function Signatures: Updated function signatures to remove the `TableMetadataAllocatorContext` parameter in methods like `create`, `create_view`, and `alloc` in `table_meta.rs` and `table_meta_alloc.rs`. - Updated Imports: Adjusted import statements to reflect the removal of `TableMetadataAllocatorContext` in affected files. These changes simplify the codebase by removing an unnecessary context struct and updating related function calls. * refactor/remove-cluster-id: ### Update `datanode.rs` to Modify Key Prefix - File Modified: `src/common/meta/src/datanode.rs` - Key Changes: - Updated `DatanodeStatKey::prefix_key` and `From<DatanodeStatKey>` to remove the cluster ID from the key prefix. - Adjusted comments to reflect the changes in key prefix handling. * reformat code * refactor/remove-cluster-id: ### Commit Summary - Refactor `Pusher` Initialization: Removed the `RequestHeader` parameter from the `Pusher::new` method across multiple files, including `handler.rs`, `test_util.rs`, and `heartbeat.rs`. This change simplifies the `Pusher` initialization process by eliminating th unnecessary parameter. - Update Imports: Adjusted import statements in `handler.rs` and `test_util.rs` to remove unused `RequestHeader` references, ensuring cleaner and more efficient code. * chore: update proto	2025-03-05 08:22:18 +00:00
Ning Sun	37f8341963	feat: opentelemetry trace new data modeling (#5622 ) * feat: include trace v1 encoding * feat: add trace ingestion in inserter * feat: add partition rules and index for trace_id * chore: format * chore: fmt * fix: issue introduced with merge * feat: adjust index and add integration test for v1 * refactor: remove comment key * fix: update default value of skip index granularity * fix: update default value of skip index granularity * refactor: rename some functions * feat: remove skipping index from span_id * refactor: made span_id part of primary key for potential dedup purpose * feat: move the special attribute resource_attribute.service.name to top level --------- Co-authored-by: shuiyisong <113876041+shuiyisong@users.noreply.github.com>	2025-03-05 04:08:52 +00:00
shuiyisong	31f29d8a77	chore: support specifying `skipping` index in pipeline (#5635 ) * chore: support setting skipping index in pipeline * chore: fix typo key * chore: add test * chore: fix typo	2025-03-03 18:37:13 +00:00
Ruihang Xia	d69e93b91a	feat: support to generate json output for explain analyze in http api (#5567 ) * impl Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * integration test Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * Update src/servers/src/http/hints.rs Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * refactor: with FORMAT option for explain format * lift some well-known metrics Signed-off-by: Ruihang Xia <waynestxia@gmail.com> --------- Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: Ning Sun <sunning@greptime.com>	2025-02-21 05:13:09 +00:00
shuiyisong	53b25c04a2	chore: support Loki's structured metadata for ingestion (#5541 ) * chore: support loki's structured metadata * test: update test * chore: revert some code change * chore: address CR comment	2025-02-19 16:44:26 +00:00

... 2 3 4 5 6 ...

711 Commits