greptimedb

mirror of https://github.com/GreptimeTeam/greptimedb.git synced 2026-07-04 21:10:37 +00:00

Author	SHA1	Message	Date
LFC	8149932bad	feat: local catalog drop table (#913 ) * feat: local catalog drop table * Update src/catalog/src/local/manager.rs Co-authored-by: Lei, HUANG <6406592+v0y4g3r@users.noreply.github.com> * Update src/catalog/src/local/manager.rs Co-authored-by: Lei, HUANG <6406592+v0y4g3r@users.noreply.github.com> * fix: resolve PR comments --------- Co-authored-by: Lei, HUANG <6406592+v0y4g3r@users.noreply.github.com>	2023-01-31 14:44:03 +08:00
Lei, HUANG	43aefc5d74	feat: prunine sst files according to time range in filters (#887 ) * 1. Reimplement Eq for Timestamp 2. Add and/or for GenericRange * feat: extract time range from filters * feat: select sst files according to time range * fix: clippy * fix: empty value in range * fix: some cr comments * fix: return optional timestamp range * fix: cr comments	2023-01-28 15:16:41 +08:00
Zheming Li	0959c1d16b	feat: support default value when inserting data (#854 )	2023-01-13 14:49:05 +08:00
Yingwen	b39dbcbda9	fix: Fix deleting table with non null column (#849 ) If the table has a non-null column, we need to use default value instead of null to fill the value columns in the record batch for deletion. Otherwise, we can't create the record batch since the schema check doesn't allow null in the non-null column.	2023-01-11 20:06:46 +08:00
Lei, HUANG	8ffc078f88	fix: license header (#815 )	2023-01-03 15:09:49 +08:00
Yingwen	4d56d896ca	feat: Implement delete for the storage engine (#777 ) * docs: Fix incorrect comment of Vector::only_null * feat: Add delete to WriteRequest and WriteBatch * feat: Filter deleted rows * fix: Fix panic after reopening engine This is detected by adding a reopen step to the delete test for region. * fix: Fix OpType::min_type() * test: Add delete absent key test * chore: Address CR comments	2022-12-30 17:12:18 +08:00
Ruihang Xia	90990584b7	feat: Prom `SeriesNormalize` plan (#787 ) * feat: impl SeriesNormalize plan Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * some tests Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * feat: add metrics Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * add license header Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * resolve CR comments Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * update tests Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * make time index column a parameter Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * precompute time index column index Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * sign the TODO Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2022-12-27 22:59:53 +08:00
Ruihang Xia	26a3e93ca7	chore: util workspace deps in more places (#792 ) Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2022-12-27 16:26:59 +08:00
LFC	dc52a51576	chore: upgrade to Arrow 29.0 and use workspace package and dependencies (#782 ) * chore: upgrade to Arrow 29.0 and use workspace package and dependencies * fix: resolve PR comments Co-authored-by: luofucong <luofucong@greptime.com>	2022-12-23 14:28:37 +08:00
LFC	ea9af42091	chore: upgrade Rust to nightly 2022-12-20 (#772 ) * chore: upgrade Rust to nightly 2022-12-20 * chore: upgrade Rust to nightly 2022-12-20 Co-authored-by: luofucong <luofucong@greptime.com>	2022-12-21 19:32:30 +08:00
LFC	77182f5024	chore: upgrade Arrow to version 28, and DataFusion to 15 (#771 ) Co-authored-by: luofucong <luofucong@greptime.com>	2022-12-21 17:02:11 +08:00
Lei, HUANG	0653301754	feat: replace arrow2 with official implementation 🎉 (#753 ) * chore: kick off. change datafusion/arrow/parquet to target version Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * chore: replace one last datafusion dep Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * feat: arrow_array switch to arrow * chore: update dep of binary vector * chore: fix wrong merge commit Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * feat: Switch to datatypes2 * feat: Make recordbatch compile * chore: sort Cargo.toml * feat: Fix common::recordbatch compiler errors * feat: Fix recordbatch test compiling issue * fix: api crate (#708) * fix: rename ConcreteDataType::timestamp_millis_type to ConcreteDataType::timestamp_millisecond_type. fix other warnings regarding timestamp * fix: revert changes in datatypes2 * fix: helper * chore: delete datatypes based on arrow2 * feat: Fix some compiler errors in common::query (#710) * feat: Fix some compiler errors in common::query * feat: test_collect use vectors api * fix: common-query subcrate (#712) * fix: record batch adapter Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix error enum Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix: Fix common::query compiler errors (#713) * feat: Move conversion to ScalarValue to value.rs * fix: Fix common::query compiler errors This commit also make InnerError pub(crate) * feat: Implements diff accumulator using WrapperType (#715) * feat: Remove usage of opaque error from common::recordbatch * feat: Remove opaque error from common::query * feat: Fix diff compiler errors Now common_function just use common_query's Error and Result. Adds a LargestType associated type to LogicalPrimitiveType to get the largest type a logical primitive type can cast to. * feat: Remove LargestType from NativeType trait * chore: Update comments * feat: Restrict Scalar::RefType of WrapperType to itself Add trait bound `for<'a> Scalar<RefType<'a> = Self>` to WrapperType * chore: Address CR comments * chore: Format codes * fix: fix compile error for mean/polyval/pow/interp ops Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * Revert "fix: fix compile error for mean/polyval/pow/interp ops" This reverts commit `fb0b4eb826`. * fix: Fix compiler errors in argmax/rate/median/norm_cdf (#716) * fix: Fix compiler errors in argmax/rate/median/norm_cdf * chore: Address CR comments * fix: fix compile error for mean/polyval/pow/interp ops (#717) * fix: fix compile error for mean/polyval/pow/interp ops Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * simplify type bounds Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix: fix argmin/percentile/clip/interp/scipy_stats_norm_pdf errors (#718) fix: fix argmin/percentile/clip/interp/scipy_stats_norm_pdf compiler errors * fix: fix other compile error in common-function (#719) * further fixing Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix all compile errors in common function Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix: Fix tests and clippy for common-function subcrate (#726) * further fixing Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix all compile errors in common function Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix tests Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix clippy Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * revert test changes Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix: row group pruning (#725) * fix: row group pruning * chore: use macro to simplify stats implemetation * fxi: CR comments * fix: row group metadata length mismatch * fix: simplify code * fix: Fix common::grpc compiler errors (#722) * fix: Fix common::grpc compiler errors This commit refactors RecordBatch and holds vectors in the RecordBatch struct, so we don't need to cast the array to vector when doing serialization or iterating the batch. Now we use the vector API instead of the arrow API in grpc crate. * chore: Address CR comments * fix common record batch Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix: Fix compile error in server subcrate (#727) * fix: Fix compile error in server subcrate Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * remove unused type alias Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * explicitly panic Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * Update src/storage/src/sst/parquet.rs Co-authored-by: Yingwen <realevenyag@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Co-authored-by: Yingwen <realevenyag@gmail.com> * fix: Fix common grpc expr (#730) * fix compile errors Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * rename fn names Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix styles Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix wranings in common-time Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix: pre-cast to avoid tremendous match arms (#734) Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * feat: upgrade storage crate to arrow and parquet offcial impl (#738) * fix: compile erros * fix: parquet reader and writer * fix: parquet reader and writer * fix: WriteBatch IPC encode/decode * fix: clippy errors in storage subcrate * chore: remove suspicious unwrap * fix: some cr comments * fix: CR comments * fix: CR comments * fix: Fix compiler errors in catalog and mito crates (#742) * fix: Fix compiler errors in mito * fix: Fix compiler errors in catalog crate * style: Fix clippy * chore: Fix use * Merge pull request #745 * fix nyc-taxi and util * Merge branch 'replace-arrow2' into fix-others * fix substrait * fix warnings and error in test * fix: Fix imports in optimizer.rs * fix: errors in optimzer * fix: remove unwrap * fix: Fix compiler errors in query crate (#746) * fix: Fix compiler errors in state.rs * fix: fix compiler errors in state * feat: upgrade sqlparser to 0.26 * fix: fix datafusion engine compiler errors * fix: Fix some tests in query crate * fix: Fix all warnings in tests * feat: Remove `Type` from timestamp's type name * fix: fix query tests Now datafusion already supports median, so this commit also remove the median function * style: Fix clippy * feat: Remove RecordBatch::pretty_print * chore: Address CR comments * Update src/query/src/query_engine/state.rs Co-authored-by: Ruihang Xia <waynestxia@gmail.com> * fix: frontend compile errors (#747) fix: fix compile errors in frontend * fix: Fix compiler errors in script crate (#749) * fix: Fix compiler errors in state.rs * fix: fix compiler errors in state * feat: upgrade sqlparser to 0.26 * fix: fix datafusion engine compiler errors * fix: Fix some tests in query crate * fix: Fix all warnings in tests * feat: Remove `Type` from timestamp's type name * fix: fix query tests Now datafusion already supports median, so this commit also remove the median function * style: Fix clippy * feat: Remove RecordBatch::pretty_print * chore: Address CR comments * feat: Add column_by_name to RecordBatch * feat: modify select_from_rb * feat: Fix some compiler errors in vector.rs * feat: Fix more compiler errors in vector.rs * fix: fix table.rs Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix: Fix compiler errors in coprocessor * fix: Fix some compiler errors * fix: Fix compiler errors in script * chore: Remove unused imports and format code * test: disable interval tests * test: Fix test_compile_execute test * style: Fix clippy * feat: Support interval * feat: Add RecordBatch::columns and fix clippy Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Co-authored-by: Ruihang Xia <waynestxia@gmail.com> * fix: Fix All The Tests! (#752) * fix: Fix several tests compile errors Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix: some compile errors in tests Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix: compile errors in frontend tests * fix: compile errors in frontend tests * test: Fix tests in api and common-query * test: Fix test in sql crate * fix: resolve substrait error Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * chore: add more test * test: Fix tests in servers * fix instance_test Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * test: Fix tests in tests-integration Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Co-authored-by: Lei, HUANG <mrsatangel@gmail.com> Co-authored-by: evenyag <realevenyag@gmail.com> * fix: clippy errors Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Co-authored-by: Ruihang Xia <waynestxia@gmail.com> Co-authored-by: evenyag <realevenyag@gmail.com>	2022-12-15 18:49:12 +08:00
LFC	8959dbcef8	feat: Substrait logical plan (#704 ) * feat: use Substrait logical plan to query data from Datanode in Frontend in distributed mode * fix: resolve PR comments * fix: resolve PR comments * fix: resolve PR comments Co-authored-by: luofucong <luofucong@greptime.com>	2022-12-06 19:21:57 +08:00
Yingwen	0791c65149	refactor: replace some usage of MutableBitmap by BitVec (#610 )	2022-11-21 17:36:53 +08:00
Dongxu Wang	b6fa316c65	chore: correct typos (#589 ) (#592 )	2022-11-21 14:07:45 +08:00
Yingwen	22ae983280	refactor: Use re-exported arrow mod from datatypes crate (#571 )	2022-11-18 18:38:07 +08:00
Igor Morozov	e1f326295f	feat: implement DESCRIBE TABLE (#558 ) Also need to support describe table in other catalog/schema	2022-11-18 16:34:00 +08:00
LFC	872ac8058f	feat: distributed execute gRPC and Prometheus query in Frontend (#520 ) * feat: distributed execute GRPC and Prometheus query in Frontend * feat: distributed execute GRPC and Prometheus query in Frontend * Apply suggestions from code review Co-authored-by: Lei, Huang <6406592+v0y4g3r@users.noreply.github.com> * feat: distributed execute GRPC and Prometheus query in Frontend * fix: do not convert timestamp to string when converting logical plan to SQL * fix: tests * refactor: no mock * refactor: 0.0.0.0 -> 127.0.0.1 * refactor: 0.0.0.0 -> 127.0.0.1 * refactor: 0.0.0.0 -> 127.0.0.1 Co-authored-by: luofucong <luofucong@greptime.com> Co-authored-by: Lei, Huang <6406592+v0y4g3r@users.noreply.github.com>	2022-11-16 14:59:48 +08:00
Ruihang Xia	7ba512980a	chore: add APACHE-2.0 license header (#518 ) * feat: add license checker workflow Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix existing header Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * specify license for internal sub-crate Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix rustfmt Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2022-11-15 18:05:46 +08:00
Ruihang Xia	1565c8d236	chore: specify import style in rustfmt (#460 ) Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2022-11-15 15:58:54 +08:00
LFC	2c0d2da5a7	feat: Frontend show tables and databases (#504 ) * feat: Frontend show tables and databases Co-authored-by: luofucong <luofucong@greptime.com>	2022-11-15 14:21:50 +08:00
Yingwen	281eae9f44	fix: Fix filtering out rows incorrectly during dedup phase (#484 ) * fix: dedup should not mark element as unneeded It should only mark element as selected, because some column of different rows may have same value. * refactor: Rename dedup to find_unique As the original `dedup` method only mark bitmap to true when it finds the element is unique, so `find_unique` is more appropriate for its name. * test: Renew bitmap in test_batch_find_unique * chore: Update comments	2022-11-14 21:40:17 +08:00
Lei, Huang	2d869e1e43	refactor: datanode starts frontend (#471 ) * refactor: dependency, from frontend depends on datanode to datanode depends on frontend * wip: start frontend in datanode * wip: migrate create database to frontend * wip: impl alter table * fix: CR comments	2022-11-12 21:07:18 +08:00
dennis zhuang	74ea529d1a	feat: move time index metadata from schema into field (#444 ) * feat: move time index metadata from schema into field * chore: remove useless code * test: test select with column alias * fix: conflicts with develop branch * test: add test * test: order by timestamp to ensure query results order * fix: comment	2022-11-11 15:36:27 +08:00
dennis zhuang	e62b302fb2	feat: some improvements on python coprocessor (#423 ) * feat: supports list array in arrow_array_get * feat: supports string and list type conversions in python coprocessor * test: add test cases for returning list in coprocessor	2022-11-10 11:53:27 +08:00
Yingwen	cefdffff09	fix: CURRENT_TIMESTAMP supports int64 type (#436 ) * fix: Fix int64 type not considered in DEFAULT CURRENT_TIMESTAMP() constraint Also avoid using `ConstantVector` in default constraint, as other user may try to downcast it to a concrete type, and sometimes may forget to check whether it is a constant vector. * test: Add test for writing default value	2022-11-10 11:35:16 +08:00
Yingwen	cba611b9f5	refactor: Serialize RawSchema/RawTableMeta/RawTableInfo (#382 ) * refactor: Serialize Schema/TableMeta/TableInfo to raw structs * test: Add tests for raw struct conversion * style: Fix clippy * refactor: SchemaBuilder::timestamp_index takes Option<usize> So caller could chain the timestamp_index method call where there is no timestamp index. * style(datatypes): Chains SchemaBuilder method calls	2022-11-04 11:25:17 +08:00
Lei, Huang	81716d622e	feat: timestamp column support i64 (#325 ) * feat: align_bucket support i64 and timestamp values * feat: add Int64 to timestamp * feat: support query i64 timestamp vector * test: fix failling tests * refactor: simplify some code * fix: CR comments and add insert and query test for i64 timestamp column	2022-10-28 18:39:11 +08:00
Yingwen	64dac51e83	feat: Holds ColumnMetadata in StoreSchema (#333 ) * chore: Update StoreSchema comment * feat: Add metadata to ColumnSchema * feat: Impl conversion between ColumnMetadata and ColumnSchema We could use this feature to store the ColumnMetadata as arrow's Schema, since the ColumnSchema could be further converted to an arrow schema. Then we could use ColumnMetadata in StoreSchema, which contains more information, especially the column id. * feat(storage): Merge schema::Error to metadata::Error To avoid cyclic dependency of two Errors * feat(storage): Store ColumnMetadata in StoreSchema * feat(storage): Use StoreSchemaRef to avoid cloning the whole StoreSchema struct * test(storage): Fix test_store_schema * feat(datatypes): Return error on duplicate meta key * chore: Address CR comments	2022-10-25 11:06:22 +08:00
LFC	6b0c5281d4	feat: try from DataFusion's ScalarValue for our Value (#329 ) * feat: try from DataFusion's ScalarValue for our Value * Update src/datatypes/src/value.rs Co-authored-by: Lei, Huang <6406592+v0y4g3r@users.noreply.github.com> * fix: resolve CR comments Co-authored-by: luofucong <luofucong@greptime.com> Co-authored-by: Lei, Huang <6406592+v0y4g3r@users.noreply.github.com>	2022-10-20 20:22:40 +08:00
Yingwen	cdf3280fcf	feat: Region supports write requests with old schema (#297 ) * feat: Adds ColumnDefaultConstraint::create_default_vector ColumnDefaultConstraint::create_default_vector is ported from MitoTable::try_get_column_default_constraint_vector. * refactor: Replace try_get_column_default_constraint_vector by create_default_vector * style: Remove unnecessary map_err in MitoTable::insert * feat: Adds compat_write For column in `dest_schema` but not in `write_batch`, this method would insert a vector with default value to the `write_batch`. If there are columns not in `dest_schema`, an error would be returned. * chore: Add info log to RegionInner::alter * feat(storage): RegionImpl::write support request with old version * feat: Add nullable check when creating default value * feat: Validate nullable and default value * chore: Modify PutOperation comments * chore: Make ColumnDescriptor::is_nullable readonly and validate name * feat: Use CompatWrite trait to replace campat::compat_write method Adds a CompactWrite trait to support padding columns to WriteBatch: - The WriteBatch and PutData implements this trait - Fix the issue that WriteBatch::schema is not updated to the schema after compat - Also validate the created column when adding to PutData The WriteBatch would also pad default value to missing columns in PutData, so the memtable inserter don't need to manually check whether the column is nullable and then insert a NullVector. All WriteBatch is ensured to have all columns defined by the schema in its PutData. * feat: Validate constraint by ColumnDefaultConstraint::validate() The ColumnDefaultConstraint::validate() would also ensure the default value has the same data type as the column's. * feat: Use NullVector for null columns * fix: Fix BinaryType returns wrong logical_type_id * fix: Fix tests and revert NullVector for null columns NullVector doesn't support custom logical type make it hard to encode/decode, which also cause the arrow/protobuf codec of write batch fail. * fix: create_default_vector use replicate to create vector with default value This would fix the test_codec_with_none_column_protobuf test, as we need to downcast the vector to construct the protobuf values. * test: add tests for column default constraints * test: Add tests for CompatWrite trait impl * test: Test write region with old schema * fix(storage): Fix replay() applies metadata too early The committed sequence of the RegionChange action is the sequence of the last entry that use the old metadata (schema). During replay, we should apply the new metadata after we see an entry that has sequence greater than (not equals to) the `RegionChange::committed_sequence` Also remove duplicate `set_committed_sequence()` call in persist_manifest_version() * chore: Removes some unreachable codes Also add more comments to document codes in these files * refactor: Refactor MitoTable::insert Return error if we could not create a default vector for given column, instead of ignoring the error * chore: Fix incorrect comments * chore: Fix typo in error message	2022-10-18 10:47:24 +08:00
dennis zhuang	25a16875b6	feat: create table and add new columns automatically in gRPC (#310 ) * fix: readme * feat: change Column's datatype in protobuf from optional to required * feat: supports creating table and adding new columns automatically in gRPC, #279, #283 * fix: test * refactor: execute_grpc_insert * refactor: clean code and add test * fix: test after rebasing develop branch * test: test grpc server with different ports * fix: typo Co-authored-by: Ruihang Xia <waynestxia@gmail.com> * fix: typo Co-authored-by: Ruihang Xia <waynestxia@gmail.com> * chore: minor changes * chore: build_alter_table_request Co-authored-by: Ruihang Xia <waynestxia@gmail.com>	2022-10-17 10:34:52 +08:00
evenyag	a8a6426abf	fix: Fix replicate_primitive doesn't consider null values (#306 )	2022-10-12 16:52:09 +08:00
Lei, Huang	25078e821b	feat: type rewrite optimizer (#272 ) * feat: add type conversion optimizer * feat: add expr rewrite logical plan optimizer * chore: add some doc * fix: unit test * fix: time zone issue in unit tests * chore: add more tests * fix: some CR comments * chore: rebase develop * chore: fix unit tests * fix: unit test use timestamp with time zone * chore: add more tests	2022-09-28 13:56:13 +08:00
dennis zhuang	5f322ba16e	feat: impl default constraint for column (#273 ) * feat: impl default value for column in schema * test: add test for column's default value * refactor: rename ColumnDefaultValue to ColumnDefaultConstraint * fix: timestamp column may be a constant vector * fix: test_shutdown_pg_server * fix: typo Co-authored-by: LFC <bayinamine@gmail.com> * fix: typo Co-authored-by: LFC <bayinamine@gmail.com> * fix: typo Co-authored-by: LFC <bayinamine@gmail.com> * chore: use table_info directly Co-authored-by: LFC <bayinamine@gmail.com> * refactor: by CR comments Co-authored-by: LFC <bayinamine@gmail.com>	2022-09-22 10:43:21 +08:00
evenyag	a954ba862a	feat: Implement dedup reader (#270 ) * feat: Handle empty NullVector in replicate_null * chore: Rename ChunkReaderImpl::sst_reader to batch_reader * feat: dedup reader wip * feat: Add BatchOp Add BatchOp to support dedup/filter Batch and implement BatchOp for ProjectedSchema. Moves compare_row_of_batch to BatchOp::compare_row. * feat: Allow Batch has empty columns * feat: Implement DedupReader Also add From<MutableBitmap> for BooleanVector * test: Test dedup reader Fix issue that compare_row compare by full key not row key * chore: Add comments to BatchOp * feat: Dedup results from merge reader * test: Test merge read after flush * test: Test merge read after flush and reopen * test: Test replicate empty NullVector * test: Add tests for `ProjectedSchema::dedup/filter` * feat: Filter empty batch in DedepReader Also fix clippy warnings and refactor some codes	2022-09-21 17:49:53 +08:00
Ning Sun	8a400669aa	feat: postgre wire protocol for frontend (#269 )	2022-09-19 15:39:53 +08:00
evenyag	e697ba975b	feat: Implement dedup and filter for vectors (#245 ) * feat: Dedup vector * refactor: Re-export Date/DateTime/Timestamp * refactor: Named field for ListValueRef::Ref Use field val instead of tuple for variant ListValueRef::Ref to keep consistence with ListValueRef::Indexed * feat: Implement ScalarVector for ListVector Also implements ScalarVectorBuilder for ListVectorBuilder, Scalar for ListValue and ScalarRef for ListValueRef * test: Add tests for ScalarVector implementation of ListVector * feat: Implement dedup using match_scalar_vector * refactor: Move dedup func to individual mod * chore: Update ListValueRef comments * refactor: Move replicate to VectorOp Move compute operations to VectorOp trait and acts as an super trait of Vector. So we could later put dedup/filter methods to VectorOp trait, avoid to define too many methods in Vector trait. * refactor: Move scalar bounds to PrimitiveElement Move Scalar and ScalarRef trait bounds to PrimitiveElement, so for each native type which implements PrimitiveElement, its PrimitiveVector always implements ScalarVector, so we could use it as ScalarVector without adding additional trait bounds * refactor: Move dedup to VectorOp Remove compute mod and move dedup logic to operations::dedup * feat: Implement VectorOp::filter * test: Move replicate test of primitive to replicate.rs * test: Add more replicate tests * test: Add tests for dedup and filter Also fix NullVector::dedup and ConstantVector::dedup * style: fix clippy * chore: Remove unused scalar.rs * test: Add more tests for VectorOp and fix failed tests Also fix TimestampVector eq not implemented. * chore: Address CR comments * chore: mention vector should be sorted in comment * refactor: slice the vector directly in replicate_primitive_with_type	2022-09-19 14:05:02 +08:00
LFC	a649f34832	fix: select empty table (#268 ) * fix: select empty table Co-authored-by: luofucong <luofucong@greptime.com>	2022-09-19 11:28:12 +08:00
Lei, Huang	1770079691	fix: slice implementation for DateVector/DateTimeVector… (#266 ) * fix: replicate and slice implementation for DateVector/DateTimeVector/TimestampVector * chore: rebase develop	2022-09-16 16:38:46 +08:00
Ning Sun	e67b0eb259	feat: Initial support of postgresql wire protocol (#229 ) * feat: initial commit of postgres protocol adapter * initial commit of postgres server * feat: use common_io runtime and correct testcase * fix previous tests * feat: adopt pgwire api changes and add support for text encoded data * feat: initial integration with datanode * test: add feature flag to test * fix: resolve lint warnings * feat: add postgres feature flags for datanode * feat: add support for newly introduced timestamp type * feat: adopt latest datanode changes * fix: address clippy warning for flattern scenario * fix: make clippy great again * fix: address issues found in review * chore: sort dependencies by name * feat: adopt new Output api * fix: return error on unsupported data types * refactor: extract common code dealing with record batches * fix: resolve clippy warnings * test: adds some unit tests postgres handler * test: correct test for cargo update * fix: update query module name * test: add assertion for error content	2022-09-15 21:39:05 +08:00
LFC	fb6153f7e0	feat: a new type for supplying `Ord` to `Primitive` (#255 ) Co-authored-by: luofucong <luofucong@greptime.com>	2022-09-15 18:32:55 +08:00
dennis zhuang	c8cb705d9e	ci: pre-commit configuration and hooks (#261 ) * feat: adds pre-commit config and hooks * refactor: sort all Cargo.toml by cargo-sort * ci: adds conventional-pre-commit hook to pre-commit * fix: remove .pre-commit-hooks.yaml * fix: readme * Update .pre-commit-config.yaml Co-authored-by: Lei, Huang <6406592+v0y4g3r@users.noreply.github.com> * ci: move clippy hook to push stage * docs: install pre-push github hook Co-authored-by: Lei, Huang <6406592+v0y4g3r@users.noreply.github.com>	2022-09-15 11:30:08 +08:00
Lei, Huang	2dbaad9770	fix: forbid use int64 as timestamp column data type (#248 ) * fix: forbid use int64 as timestamp column data type * fix unit test * fix unit tests * change gmt_created and gmt_modified data type in system tables to timestamp * also change data type in readme	2022-09-14 12:03:16 +08:00
LFC	ec99eb0cd0	feat: frontend instance (#238 ) * feat: frontend instance * no need to carry column length in `Column` proto * add more tests * rebase develop * create a new variant with already provisioned RecordBatches in Output * resolve code review comments * new frontend instance does not connect datanode grpc * add more tests * add more tests * rebase develop Co-authored-by: luofucong <luofucong@greptime.com>	2022-09-13 17:10:22 +08:00
evenyag	d52d1eb122	fix: Only convert LogicalTypeId to ConcreteDataType in tests (#241 ) LogicalTypeId to ConcreteDataType is only allowed in tests, since some additional info is not stored in LogicalTypeId now. It is just an id, or kind, not contains full type info.	2022-09-09 17:48:59 +08:00
Lei, Huang	9366e77407	feat: impl timestamp type, value and vectors (#226 ) * wip: impl timestamp data type * add timestamp vectors * adapt to recent changes to vector module * fix all unit test * rebase develop * fix slice * change default time unit to millisecond * add more tests * fix some CR comments * fix some CR comments * fix clippy * fix some cr comments * fix some CR comments * fix some CR comments * remove time unit in LogicalTypeId::Timestamp	2022-09-09 11:43:30 +08:00
evenyag	7f8195861e	feat: Adds push_value_ref and extend_slice_of to MutableVector (#215 ) * feat: Impl cmp_element() for Vector * chore: Add doc comments to MutableVector * feat: Add create_mutable() to DataType Add `create_mutable()` to create a MutableVector for each DataType. Implement ListVectorBuilder and NullVectorBuilder for ListType and NullType. * feat: Add ValueRef ValueRef is a reference to value, could be used to avoid some allocation when getting data from Vector. To support ValueRef, also implement a ListValueRef for ListValue, but comparision of ListValueRef still requires some allocation, due to the complexity of ListValue and ListVector. Impl some From trait for ValueRef * feat: Implement get_ref for Vector * feat: Remove cmp_element from Vector `cmp_element` could be replaced by `get_ref` and then compare * feat: Implement push/extend for PrimitiveVectorBuilder Implement push_value_ref() and extend_slice_of() for PrimitiveVectorBuilder. Also refactor the DataTypeBuilder trait for primitive types to PrimitiveElement trait, adds necessary cast helper methods to it. - Cast a reference to Vector to reference arrow's primitive array - Cast a ValueRef to primitive type - Also make PrimitiveElement super trait of Primitive * feat: Implement push/extend for all vector builders Implement push_value_ref() and extend_slice_of() for remaining vector builders. Add some helpful cast method to ValueRef and a method to cast Value to ValueRef. Change the behavior of PrimitiveElement::cast_xxx to panic when unable to cast, since push_value_ref() and extend_slice_of() always panic when given invalid input data type. * feat: MutableVector returns error if data type unmatch * test: Add tests for ValueRef * feat: Add tests for Vector::get_ref * feat: NullVector returns error if data type unmatch * test: Add tests for vector builders * fix: Fix compile error in python coprocessor * refactor: Add lifetime param to IntoValueRef The Primitive trait just use the `IntoValueRef<'static>` bound. Also rename create_mutable to create_mutable_vector. * chore: Address CR comments * feat: Customize PartialOrd/Ord for Value/ValueRef Panics if values/refs have different data type * style: Fix clippy * refactor: Use macro to generate body of ValueRef::as_xxx	2022-09-06 13:44:48 +08:00
Lei, Huang	3f9144a2e3	fix: StringVector use Utf8Array (#222 )	2022-09-02 11:25:33 +08:00
evenyag	d71ae7934e	feat: Upgrade rust to nightly-2022-07-14 (#217 ) * feat: upgrade rust to nightly-2022-07-14 * style: Fix some clippy warnings * style: clippy fix * style: fix clippy * style: Fix clippy Some PartialEq warnings have been work around using cfg_attr test * feat: Implement Eq and PartialEq for PrimitiveType * chore: Remove unnecessary allow * chore: Remove usage of cfg_attr for PartialEq	2022-09-01 17:50:48 +08:00

1 2

95 Commits