greptimedb

mirror of https://github.com/GreptimeTeam/greptimedb.git synced 2026-01-17 02:32:56 +00:00

Author	SHA1	Message	Date
Lei, HUANG	e77a7f253c	feat: L0 to L1 compaction strategy (#964 ) * feat: impl simple compaction strategy * chore: rebase to develop and fix clippy warnings * chore: simplify time bucket strcut * chore: some typos	2023-02-11 21:10:24 +08:00
Lei, HUANG	1e9918ddf9	feat: compaction scheduler and rate limiter (#947 ) * wip: compaction schdduler * feat: imple simple compaction scheduler * fix: typo * feat: add generic parameter to make scheduler friendly to tests * chore: add more tests * fix: CR comments * fix: CR comments * fix: ensure idempotency for rate limit token * fix: Cr ct omments	2023-02-09 11:43:20 +08:00
Yingwen	b2ad0e972b	feat: Define procedure related traits (#904 ) * chore: Move uuid to workspace.dependencies * feat: Define procedure related traits * test: Add tests * chore: Update imports * feat: Submit ProcedureWithId to manager * chore: pub ProcedureId::parse_str * refactor: ProcedureId::parse_str returns Result * chore: Address CR comments Also implements FromStr for ProcedureId	2023-01-31 14:17:28 +08:00
Lei, HUANG	43aefc5d74	feat: prunine sst files according to time range in filters (#887 ) * 1. Reimplement Eq for Timestamp 2. Add and/or for GenericRange * feat: extract time range from filters * feat: select sst files according to time range * fix: clippy * fix: empty value in range * fix: some cr comments * fix: return optional timestamp range * fix: cr comments	2023-01-28 15:16:41 +08:00
Lei, HUANG	4015dd8075	feat: record sst file time range in FileMeta (#860 ) * feat: record sst file time range in FileMeta * fix: clippy * chore: add some log and doc	2023-01-11 21:16:07 +08:00
Yingwen	b39dbcbda9	fix: Fix deleting table with non null column (#849 ) If the table has a non-null column, we need to use default value instead of null to fill the value columns in the record batch for deletion. Otherwise, we can't create the record batch since the schema check doesn't allow null in the non-null column.	2023-01-11 20:06:46 +08:00
LFC	72f05a3137	feat: flight aboard (#840 ) feat: replace old GRPC interface with Arrow Flight	2023-01-09 17:06:24 +08:00
Lei, HUANG	fa54870197	fix: parquet native row group pruning support (#845 ) * fix: parquet native row group pruning support * fix: use filter_map instead of flat_map	2023-01-09 12:10:14 +08:00
Xuanwo	777a3182c5	feat: Bump OpenDAL to 0.24 for better seekable support (#847 ) * deps: Bump OpenDAL to 0.24 for better seekable support Signed-off-by: Xuanwo <github@xuanwo.io> * fix: test Signed-off-by: Xuanwo <github@xuanwo.io> Co-authored-by: Lei, HUANG <mrsatangel@gmail.com>	2023-01-09 11:37:43 +08:00
Lei, HUANG	627d444723	fix: remove start from LogStore; fix error message (#837 )	2023-01-06 12:21:00 +08:00
Lei, HUANG	8f5ecefc90	feat: use raft-engine crate to reimplement logstore (#799 ) * chore: remove useless method in Entry trait, add proto definition for entry and namespace * feat: add proto definition for raft-engine based logstore * feat: introduce RaftEngineLogstore * feat: impl read for raft engine log store * feat: impl raft engine logstore * feat: raft engine logstore start and stop * feat: add purge bg task * fix: license header * fix: clippy * fix: toml files * feat: add some test cases * fix: CR comments * fix: CR comments * fix: check namespace validity and state of logstore * fix: CR comments; add config item to control sync/async flush per write * fix: remove unused error variants * fix: unit tests * fix: use compare and exchange to stop logstore * fix: CR comments	2023-01-05 17:18:51 +08:00
LFC	50cc0e9b51	feat: Impl Insert functionality of Arrow Flight service for Frontend Instance (#821 ) * feat: Implement Insert functionality of Arrow Flight service for Frontend Instance * fix: update license content * Update src/common/grpc-expr/src/alter.rs Co-authored-by: fys <40801205+Fengys123@users.noreply.github.com> * fix: resolve PR comments * fix: resolve PR comments Co-authored-by: fys <40801205+Fengys123@users.noreply.github.com>	2023-01-04 17:48:59 +08:00
Lei, HUANG	8ffc078f88	fix: license header (#815 )	2023-01-03 15:09:49 +08:00
Yingwen	4d56d896ca	feat: Implement delete for the storage engine (#777 ) * docs: Fix incorrect comment of Vector::only_null * feat: Add delete to WriteRequest and WriteBatch * feat: Filter deleted rows * fix: Fix panic after reopening engine This is detected by adding a reopen step to the delete test for region. * fix: Fix OpType::min_type() * test: Add delete absent key test * chore: Address CR comments	2022-12-30 17:12:18 +08:00
LFC	04df80e640	fix: further ease the restriction of executing SQLs in new GRPC interface (#797 ) * fix: carry not recordbatch result in FlightData, to allow executing SQLs other than selection in new GRPC interface * Update src/datanode/src/instance/flight/stream.rs Co-authored-by: Jiachun Feng <jiachun_feng@proton.me>	2022-12-28 16:43:21 +08:00
Ruihang Xia	26a3e93ca7	chore: util workspace deps in more places (#792 ) Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2022-12-27 16:26:59 +08:00
Yingwen	f8500e54c1	refactor: Remove PutOperation and Simplify WriteRequest API (#775 ) * chore: Remove unused MutationExtra * refactor(storage): Refactor Mutation and Payload Change Mutation from enum to a struct that holds op type and record batches so the encoder don't need to convert the mutation into record batch. Now The Payload is no more an enum, it just holds the data, to be serialized to the WAL, of the WriteBatch. The encoder and decoder now deal with the Payload instead of the WriteBatch, so we could hold more information not necessary to be stored to the WAL in the WriteBatch. This commit also merge variants in write_batch::Error to storage::Error as some variants of them denote the same error. * test(storage): Pass all tests in storage * chore: Remove unused codes then format codes * test(storage): Fix test_put_unknown_column test * style(storage): Fix clippy * chore: Remove some unused codes * chore: Rebase upstream and fix clippy * chore(storage): Remove unused codes * chore(storage): Update comments * feat: Remove PayloadType from wal.proto * chore: Address CR comments * chore: Remove unused write_batch.proto	2022-12-26 13:11:24 +08:00
LFC	dc52a51576	chore: upgrade to Arrow 29.0 and use workspace package and dependencies (#782 ) * chore: upgrade to Arrow 29.0 and use workspace package and dependencies * fix: resolve PR comments Co-authored-by: luofucong <luofucong@greptime.com>	2022-12-23 14:28:37 +08:00
LFC	ea9af42091	chore: upgrade Rust to nightly 2022-12-20 (#772 ) * chore: upgrade Rust to nightly 2022-12-20 * chore: upgrade Rust to nightly 2022-12-20 Co-authored-by: luofucong <luofucong@greptime.com>	2022-12-21 19:32:30 +08:00
LFC	77182f5024	chore: upgrade Arrow to version 28, and DataFusion to 15 (#771 ) Co-authored-by: luofucong <luofucong@greptime.com>	2022-12-21 17:02:11 +08:00
Yingwen	7c16a4a17b	refactor(storage): Move write_batch::codec to a separate file (#757 ) * refactor(storage): Move write_batch::codec to a separate file * chore: move new_test_batch to write_batch mod	2022-12-16 15:32:59 +08:00
dennis zhuang	28bd7404ad	feat: change column's default property to nullable (#751 ) * feat: change column's default property to nullable * chore: use all instead of any * fix: compile error * fix: dependencies order in cargo	2022-12-16 11:17:01 +08:00
Lei, HUANG	0653301754	feat: replace arrow2 with official implementation 🎉 (#753 ) * chore: kick off. change datafusion/arrow/parquet to target version Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * chore: replace one last datafusion dep Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * feat: arrow_array switch to arrow * chore: update dep of binary vector * chore: fix wrong merge commit Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * feat: Switch to datatypes2 * feat: Make recordbatch compile * chore: sort Cargo.toml * feat: Fix common::recordbatch compiler errors * feat: Fix recordbatch test compiling issue * fix: api crate (#708) * fix: rename ConcreteDataType::timestamp_millis_type to ConcreteDataType::timestamp_millisecond_type. fix other warnings regarding timestamp * fix: revert changes in datatypes2 * fix: helper * chore: delete datatypes based on arrow2 * feat: Fix some compiler errors in common::query (#710) * feat: Fix some compiler errors in common::query * feat: test_collect use vectors api * fix: common-query subcrate (#712) * fix: record batch adapter Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix error enum Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix: Fix common::query compiler errors (#713) * feat: Move conversion to ScalarValue to value.rs * fix: Fix common::query compiler errors This commit also make InnerError pub(crate) * feat: Implements diff accumulator using WrapperType (#715) * feat: Remove usage of opaque error from common::recordbatch * feat: Remove opaque error from common::query * feat: Fix diff compiler errors Now common_function just use common_query's Error and Result. Adds a LargestType associated type to LogicalPrimitiveType to get the largest type a logical primitive type can cast to. * feat: Remove LargestType from NativeType trait * chore: Update comments * feat: Restrict Scalar::RefType of WrapperType to itself Add trait bound `for<'a> Scalar<RefType<'a> = Self>` to WrapperType * chore: Address CR comments * chore: Format codes * fix: fix compile error for mean/polyval/pow/interp ops Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * Revert "fix: fix compile error for mean/polyval/pow/interp ops" This reverts commit `fb0b4eb826`. * fix: Fix compiler errors in argmax/rate/median/norm_cdf (#716) * fix: Fix compiler errors in argmax/rate/median/norm_cdf * chore: Address CR comments * fix: fix compile error for mean/polyval/pow/interp ops (#717) * fix: fix compile error for mean/polyval/pow/interp ops Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * simplify type bounds Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix: fix argmin/percentile/clip/interp/scipy_stats_norm_pdf errors (#718) fix: fix argmin/percentile/clip/interp/scipy_stats_norm_pdf compiler errors * fix: fix other compile error in common-function (#719) * further fixing Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix all compile errors in common function Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix: Fix tests and clippy for common-function subcrate (#726) * further fixing Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix all compile errors in common function Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix tests Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix clippy Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * revert test changes Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix: row group pruning (#725) * fix: row group pruning * chore: use macro to simplify stats implemetation * fxi: CR comments * fix: row group metadata length mismatch * fix: simplify code * fix: Fix common::grpc compiler errors (#722) * fix: Fix common::grpc compiler errors This commit refactors RecordBatch and holds vectors in the RecordBatch struct, so we don't need to cast the array to vector when doing serialization or iterating the batch. Now we use the vector API instead of the arrow API in grpc crate. * chore: Address CR comments * fix common record batch Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix: Fix compile error in server subcrate (#727) * fix: Fix compile error in server subcrate Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * remove unused type alias Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * explicitly panic Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * Update src/storage/src/sst/parquet.rs Co-authored-by: Yingwen <realevenyag@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Co-authored-by: Yingwen <realevenyag@gmail.com> * fix: Fix common grpc expr (#730) * fix compile errors Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * rename fn names Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix styles Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix wranings in common-time Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix: pre-cast to avoid tremendous match arms (#734) Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * feat: upgrade storage crate to arrow and parquet offcial impl (#738) * fix: compile erros * fix: parquet reader and writer * fix: parquet reader and writer * fix: WriteBatch IPC encode/decode * fix: clippy errors in storage subcrate * chore: remove suspicious unwrap * fix: some cr comments * fix: CR comments * fix: CR comments * fix: Fix compiler errors in catalog and mito crates (#742) * fix: Fix compiler errors in mito * fix: Fix compiler errors in catalog crate * style: Fix clippy * chore: Fix use * Merge pull request #745 * fix nyc-taxi and util * Merge branch 'replace-arrow2' into fix-others * fix substrait * fix warnings and error in test * fix: Fix imports in optimizer.rs * fix: errors in optimzer * fix: remove unwrap * fix: Fix compiler errors in query crate (#746) * fix: Fix compiler errors in state.rs * fix: fix compiler errors in state * feat: upgrade sqlparser to 0.26 * fix: fix datafusion engine compiler errors * fix: Fix some tests in query crate * fix: Fix all warnings in tests * feat: Remove `Type` from timestamp's type name * fix: fix query tests Now datafusion already supports median, so this commit also remove the median function * style: Fix clippy * feat: Remove RecordBatch::pretty_print * chore: Address CR comments * Update src/query/src/query_engine/state.rs Co-authored-by: Ruihang Xia <waynestxia@gmail.com> * fix: frontend compile errors (#747) fix: fix compile errors in frontend * fix: Fix compiler errors in script crate (#749) * fix: Fix compiler errors in state.rs * fix: fix compiler errors in state * feat: upgrade sqlparser to 0.26 * fix: fix datafusion engine compiler errors * fix: Fix some tests in query crate * fix: Fix all warnings in tests * feat: Remove `Type` from timestamp's type name * fix: fix query tests Now datafusion already supports median, so this commit also remove the median function * style: Fix clippy * feat: Remove RecordBatch::pretty_print * chore: Address CR comments * feat: Add column_by_name to RecordBatch * feat: modify select_from_rb * feat: Fix some compiler errors in vector.rs * feat: Fix more compiler errors in vector.rs * fix: fix table.rs Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix: Fix compiler errors in coprocessor * fix: Fix some compiler errors * fix: Fix compiler errors in script * chore: Remove unused imports and format code * test: disable interval tests * test: Fix test_compile_execute test * style: Fix clippy * feat: Support interval * feat: Add RecordBatch::columns and fix clippy Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Co-authored-by: Ruihang Xia <waynestxia@gmail.com> * fix: Fix All The Tests! (#752) * fix: Fix several tests compile errors Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix: some compile errors in tests Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix: compile errors in frontend tests * fix: compile errors in frontend tests * test: Fix tests in api and common-query * test: Fix test in sql crate * fix: resolve substrait error Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * chore: add more test * test: Fix tests in servers * fix instance_test Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * test: Fix tests in tests-integration Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Co-authored-by: Lei, HUANG <mrsatangel@gmail.com> Co-authored-by: evenyag <realevenyag@gmail.com> * fix: clippy errors Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Co-authored-by: Ruihang Xia <waynestxia@gmail.com> Co-authored-by: evenyag <realevenyag@gmail.com>	2022-12-15 18:49:12 +08:00
Lei, HUANG	756c068166	feat: logstore compaction (#740 ) * feat: add benchmark for wal * add bin * feat: impl wal compaction * chore: This reverts commit ef9f2326 * chore: This reverts commit 9142ec0e * fix: remove empty files * fix: failing tests * fix: CR comments * fix: Mark log as stable after writer applies manifest * fix: some cr comments and namings * chore: rename all stable_xxx to obsolete_xxx * chore: error message	2022-12-14 16:15:29 +08:00
Yingwen	a17dcbc511	chore: fix SequenceNotMonotonic error message (#664 ) * chore: fix SequenceNotMonotonic error message previous sequence should greater than or equal to given sequence * Apply suggestions from code review Co-authored-by: Ruihang Xia <waynestxia@gmail.com>	2022-11-30 11:58:43 +08:00
Xuanwo	4085fc7899	chore: Bump OpenDAL to v0.21.1 (#639 ) * deps: Bump OpenDAL to v0.21.1 Signed-off-by: Xuanwo <github@xuanwo.io> * Avoid using raw types when not needed Signed-off-by: Xuanwo <github@xuanwo.io> Signed-off-by: Xuanwo <github@xuanwo.io>	2022-11-27 10:18:39 +08:00
Yingwen	0791c65149	refactor: replace some usage of MutableBitmap by BitVec (#610 )	2022-11-21 17:36:53 +08:00
Dongxu Wang	b6fa316c65	chore: correct typos (#589 ) (#592 )	2022-11-21 14:07:45 +08:00
Xuanwo	1f0b39cc8d	chore: Bump OpenDAL to v0.20 (#569 ) Signed-off-by: Xuanwo <github@xuanwo.io>	2022-11-18 14:17:38 +08:00
Ruihang Xia	7ba512980a	chore: add APACHE-2.0 license header (#518 ) * feat: add license checker workflow Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix existing header Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * specify license for internal sub-crate Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix rustfmt Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2022-11-15 18:05:46 +08:00
Ruihang Xia	1565c8d236	chore: specify import style in rustfmt (#460 ) Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2022-11-15 15:58:54 +08:00
Yingwen	281eae9f44	fix: Fix filtering out rows incorrectly during dedup phase (#484 ) * fix: dedup should not mark element as unneeded It should only mark element as selected, because some column of different rows may have same value. * refactor: Rename dedup to find_unique As the original `dedup` method only mark bitmap to true when it finds the element is unique, so `find_unique` is more appropriate for its name. * test: Renew bitmap in test_batch_find_unique * chore: Update comments	2022-11-14 21:40:17 +08:00
dennis zhuang	68b299e04a	fix: apply recovered metadata after last WAL entry (#461 ) * fix: apply recovered metadata after last WAL entry * fix: condition error	2022-11-14 20:43:47 +08:00
Ruihang Xia	e30879f638	feat: Remove memtable's time bucket (#442 ) * refactor: partially replace MemtableSet with Memtable Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * remove MemtableWithMeta and MemtableSet in non-test mod Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * remove dead code Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * make test compile 🤣 Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix broken tests Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * make all tests pass Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix clippys Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * remove redundant clone Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * update comment Co-authored-by: Yingwen <realevenyag@gmail.com> * resolve review comment Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Co-authored-by: Yingwen <realevenyag@gmail.com>	2022-11-11 18:02:34 +08:00
dennis zhuang	74ea529d1a	feat: move time index metadata from schema into field (#444 ) * feat: move time index metadata from schema into field * chore: remove useless code * test: test select with column alias * fix: conflicts with develop branch * test: add test * test: order by timestamp to ensure query results order * fix: comment	2022-11-11 15:36:27 +08:00
Ruihang Xia	af1df2066c	perf: enlarge write row group size (#413 ) Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2022-11-08 11:23:10 +08:00
Ruihang Xia	89a3b39728	perf: improve table scan performance (#407 ) * refactor: improve table scan performance Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * use BufReader to avoid pre-loading all content Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2022-11-07 17:28:53 +08:00
Ruihang Xia	ae147c2a74	chore: refine some unnecessary log (#410 ) * remove some unnecessary informations in log Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * further cleaning Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2022-11-07 16:36:27 +08:00
Yingwen	cba611b9f5	refactor: Serialize RawSchema/RawTableMeta/RawTableInfo (#382 ) * refactor: Serialize Schema/TableMeta/TableInfo to raw structs * test: Add tests for raw struct conversion * style: Fix clippy * refactor: SchemaBuilder::timestamp_index takes Option<usize> So caller could chain the timestamp_index method call where there is no timestamp index. * style(datatypes): Chains SchemaBuilder method calls	2022-11-04 11:25:17 +08:00
Yingwen	f4e22282a4	feat: Region supports reading data with different schema (#342 ) * feat(storage): Implement skeleton of ReadResolver ReadResolver is used to resolve difference between schemas * feat(storage): Add user_column_end to ReadResover * feat(storage): Implement Batch::batch_from_parts Used to construct Batch from parts according to the schema that user expects to read. * feat(storage): Compat memtable schema * feat(storage): Compat parquet file schema * fix(storage): ReadResolver supports projection under same schema version Now ReadResolver takes ProjectedSchemaRef as dest schema, and checks whether a value column is needed by the schema after projection. * feat(storage): Check whether columns are same columns is_source_column_readable() takes ColumnMetadata instead of ColumnSchema, and compares their column id to check whether they are same columns. * refactor(storage): Use row_key_end/user_column_end in source_schema Rename ReadResolver::is_needed to ReadResolver::is_source_needed, and remove row_key_end/user_column_end from ReadResolver, since they should be same as source_schema's * chore(storage): Remove unused codes * test(storage): Add tests for the resolver * feat(storage): Returns error on different source and dest column names * style(storage): Fix clippy * refactor: Rename ReadResolver to ReadAdapter * chore(table): Removed unused comment * refactor: rename to is_source_column_compatible	2022-10-31 11:42:07 +08:00
Lei, Huang	81716d622e	feat: timestamp column support i64 (#325 ) * feat: align_bucket support i64 and timestamp values * feat: add Int64 to timestamp * feat: support query i64 timestamp vector * test: fix failling tests * refactor: simplify some code * fix: CR comments and add insert and query test for i64 timestamp column	2022-10-28 18:39:11 +08:00
Yingwen	64dac51e83	feat: Holds ColumnMetadata in StoreSchema (#333 ) * chore: Update StoreSchema comment * feat: Add metadata to ColumnSchema * feat: Impl conversion between ColumnMetadata and ColumnSchema We could use this feature to store the ColumnMetadata as arrow's Schema, since the ColumnSchema could be further converted to an arrow schema. Then we could use ColumnMetadata in StoreSchema, which contains more information, especially the column id. * feat(storage): Merge schema::Error to metadata::Error To avoid cyclic dependency of two Errors * feat(storage): Store ColumnMetadata in StoreSchema * feat(storage): Use StoreSchemaRef to avoid cloning the whole StoreSchema struct * test(storage): Fix test_store_schema * feat(datatypes): Return error on duplicate meta key * chore: Address CR comments	2022-10-25 11:06:22 +08:00
Yingwen	a457c49d99	refactor: Remove column_null_mask in MutationExtra (#314 ) * refactor: Remove column_null_mask in MutationExtra MutationExtra::column_null_mask is no longer needed as we could ensure there is no missing column in WriteBatch. * feat(storage): Remove MutationExtra Just stores MutationType in the WalHeader, no longer needs MutationExtra	2022-10-24 14:53:35 +08:00
Ruihang Xia	fbea07ea83	chore: remove unused dependencies (#319 ) * chore: remove unused dependences Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix: recover some dev-deps Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2022-10-19 14:08:54 +08:00
Yingwen	4d08ee6fbb	fix: Fix broken wal and memtable benchmarks (#320 )	2022-10-19 10:54:01 +08:00
Yingwen	c6d91edb83	refactor(storage): Split schema mod into multiple sub-mods (#318 )	2022-10-18 18:56:52 +08:00
Yingwen	cdf3280fcf	feat: Region supports write requests with old schema (#297 ) * feat: Adds ColumnDefaultConstraint::create_default_vector ColumnDefaultConstraint::create_default_vector is ported from MitoTable::try_get_column_default_constraint_vector. * refactor: Replace try_get_column_default_constraint_vector by create_default_vector * style: Remove unnecessary map_err in MitoTable::insert * feat: Adds compat_write For column in `dest_schema` but not in `write_batch`, this method would insert a vector with default value to the `write_batch`. If there are columns not in `dest_schema`, an error would be returned. * chore: Add info log to RegionInner::alter * feat(storage): RegionImpl::write support request with old version * feat: Add nullable check when creating default value * feat: Validate nullable and default value * chore: Modify PutOperation comments * chore: Make ColumnDescriptor::is_nullable readonly and validate name * feat: Use CompatWrite trait to replace campat::compat_write method Adds a CompactWrite trait to support padding columns to WriteBatch: - The WriteBatch and PutData implements this trait - Fix the issue that WriteBatch::schema is not updated to the schema after compat - Also validate the created column when adding to PutData The WriteBatch would also pad default value to missing columns in PutData, so the memtable inserter don't need to manually check whether the column is nullable and then insert a NullVector. All WriteBatch is ensured to have all columns defined by the schema in its PutData. * feat: Validate constraint by ColumnDefaultConstraint::validate() The ColumnDefaultConstraint::validate() would also ensure the default value has the same data type as the column's. * feat: Use NullVector for null columns * fix: Fix BinaryType returns wrong logical_type_id * fix: Fix tests and revert NullVector for null columns NullVector doesn't support custom logical type make it hard to encode/decode, which also cause the arrow/protobuf codec of write batch fail. * fix: create_default_vector use replicate to create vector with default value This would fix the test_codec_with_none_column_protobuf test, as we need to downcast the vector to construct the protobuf values. * test: add tests for column default constraints * test: Add tests for CompatWrite trait impl * test: Test write region with old schema * fix(storage): Fix replay() applies metadata too early The committed sequence of the RegionChange action is the sequence of the last entry that use the old metadata (schema). During replay, we should apply the new metadata after we see an entry that has sequence greater than (not equals to) the `RegionChange::committed_sequence` Also remove duplicate `set_committed_sequence()` call in persist_manifest_version() * chore: Removes some unreachable codes Also add more comments to document codes in these files * refactor: Refactor MitoTable::insert Return error if we could not create a default vector for given column, instead of ignoring the error * chore: Fix incorrect comments * chore: Fix typo in error message	2022-10-18 10:47:24 +08:00
Ning Sun	f243649971	refactor: Removed openssl from build requirement (#308 ) * refactor:replace another axum-test-helper branch * refactor: upgrade opendal version * refactor: use cursor for file buffer * refactor:remove native-tls in mysql_async * refactor: use async block and pipeline for newer opendal api * chore: update Cargo.lock * chore: update dependencies * docs: removed openssl from build requirement * fix: call close on pipe writer to flush reader for parquet streamer * refactor: remove redundant return * chore: use pinned revision for our forked mysql_async * style: avoid wild-card import in test code * Apply suggestions from code review Co-authored-by: Yingwen <realevenyag@gmail.com> * style: use chained call for builder Co-authored-by: liangxingjian <965662709@qq.com> Co-authored-by: Yingwen <realevenyag@gmail.com>	2022-10-17 19:29:17 +08:00
dennis zhuang	494a93c4f2	feat: manifest improvements (#303 ) * feat: adds commited_sequence to RegionChange action, #281 * refactor: saving protocol action when writer version is changed * feat: recover all region medata in manifest and replay them when replaying WAL, #282 * refactor: minor change and test recovering metadata after altering table schema * fix: write wrong min_reader_version into manifest for region * refactor: move up DataRow * refactor: by CR comments * test: assert recovered metadata * refactor: by CR comments * fix: comment	2022-10-13 15:43:35 +08:00
Ruihang Xia	b61d5989b7	fix: flaky parquet predicate suits (#307 ) * fix: flaky parquet predicate suits Signed-off-by: Ruihang Xia <waynestxia@gmail.com> * fix: change ParquetWriter::write_rows as well Signed-off-by: Ruihang Xia <waynestxia@gmail.com> Signed-off-by: Ruihang Xia <waynestxia@gmail.com>	2022-10-13 14:00:42 +08:00

1 2 3

111 Commits