tantivy

mirror of https://github.com/quickwit-oss/tantivy.git synced 2026-01-03 15:52:55 +00:00

Author	SHA1	Message	Date
PSeitz	e1679f3fb9	compact doc (#2402 ) * compact doc * add any value type * pass references when building CompactDoc * remove OwnedValue from API * clippy * clippy * fail on large documents * fmt * cleanup * cleanup * implement Value for different types fix serde_json date Value implementation * fmt * cleanup * fmt * cleanup * store positions instead of pos+len * remove nodes array * remove mediumvec * cleanup * infallible serialize into vec * remove positions indirection * remove 24MB limitation in document use u32 for Addr Remove the 3 byte addressing limitation and use VInt instead * cleanup * extend test * cleanup, add comments * rename, remove pub	2024-05-21 10:16:08 +02:00
PSeitz	74940e9345	clippy (#2349 ) * fix clippy * fix clippy * fix duplicate imports	2024-04-09 07:54:44 +02:00
PSeitz	1a9fc10be9	add fields_metadata to SegmentReader, add columnar docs (#2222 ) * add fields_metadata to SegmentReader, add columnar docs * use schema to resolve field, add test * normalize paths * merge for FieldsMetadata, add fields_metadata on Index * Update src/core/segment_reader.rs Co-authored-by: Paul Masurel <paul@quickwit.io> * merge code paths * add Hash * move function oustide --------- Co-authored-by: Paul Masurel <paul@quickwit.io>	2023-11-22 12:29:53 +01:00
PSeitz	054f49dc31	support escaped dot, add agg test (#2250 ) add agg test for nested JSON allow escaping of dot	2023-11-20 03:00:57 +01:00
Chris Tam	6d9a7b7eb0	Derive Debug for SchemaBuilder (#2254 )	2023-11-15 01:03:44 +01:00
PSeitz	28dd6b6546	collect json paths in indexing (#2231 ) * collect json paths in indexing * remove unsafe iter_mut_keys	2023-11-01 11:25:17 +01:00
PSeitz	03a1f40767	rename DocValue to Value (#2197 ) rename DocValue to Value to avoid confusion with lucene DocValues rename Value to OwnedValue	2023-10-02 17:03:00 +02:00
Harrison Burt	1c7c6fd591	POC: Tantivy documents as a trait (#2071 ) * fix windows build (#1) * Fix windows build * Add doc traits * Add field value iter * Add value and serialization * Adjust order * Fix bug * Correct type * Fix generic bugs * Reformat code * Add generic to index writer which I forgot about * Fix missing generics on single segment writer * Add missing type export * Add default methods for convenience * Cleanup * Fix more-like-this query to use standard types * Update API and fix tests * Add doc traits * Add field value iter * Add value and serialization * Adjust order * Fix bug * Correct type * Rebase main and fix conflicts * Reformat code * Merge upstream * Fix missing generics on single segment writer * Add missing type export * Add default methods for convenience * Cleanup * Fix more-like-this query to use standard types * Update API and fix tests * Add tokenizer improvements from previous commits * Add tokenizer improvements from previous commits * Reformat * Fix unit tests * Fix unit tests * Use enum in changes * Stage changes * Add new deserializer logic * Add serializer integration * Add document deserializer * Implement new (de)serialization api for existing types * Fix bugs and type errors * Add helper implementations * Fix errors * Reformat code * Add unit tests and some code organisation for serialization * Add unit tests to deserializer * Add some small docs * Add support for deserializing serde values * Reformat * Fix typo * Fix typo * Change repr of facet * Remove unused trait methods * Add child value type * Resolve comments * Fix build * Fix more build errors * Fix more build errors * Fix the tests I missed * Fix examples * fix numerical order, serialize PreTok Str * fix coverage * rename Document to TantivyDocument, rename DocumentAccess to Document add Binary prefix to binary de/serialization * fix coverage --------- Co-authored-by: Pascal Seitz <pascal.seitz@gmail.com>	2023-10-02 10:01:16 +02:00
PSeitz	c4e2708901	fix clippy, fmt (#2162 )	2023-08-30 08:04:26 +02:00
Paul Masurel	7ee78bda52	Readding s in datetime precision variant names (#2065 ) There is no clear win and it change some serialization in quickwit.	2023-06-01 06:39:46 +02:00
Adrien Guillo	a789ad9aee	Rename `DatePrecision` to `DateTimePrecision` (#2051 )	2023-05-23 17:09:11 +02:00
Paul Masurel	d7e97331e5	Minor refactoring find field (#2055 ) * Minor refactoring Moving find_field_with_default to Schema. * Clippy comments	2023-05-22 15:00:48 +09:00
Paul Masurel	4417be165d	Minor refactoring (#2054 ) Moving find_field_with_default to Schema.	2023-05-22 14:56:38 +09:00
Paul Masurel	bd5eea9852	Integrated columnar work.	2023-02-09 13:14:31 +01:00
PSeitz	f687b3a5aa	start migrate Field to &str (#1772 ) start migrate Field to &str in preparation of columnar return Result for get_field	2023-01-18 16:12:07 +09:00
boraarslan	495824361a	Move `split_full_path` to `Schema` (#1692 )	2022-11-29 20:56:13 +09:00
PSeitz	0c2bd36fe3	Panic on duplicate field names (#1647 ) fixes #1601	2022-10-26 16:17:33 +09:00
Pascal Seitz	5c9cbee29d	handle IpV4 serialization case	2022-10-07 19:52:00 +08:00
Pascal Seitz	0b86658389	rename ip addr, use buffer	2022-10-07 16:25:01 +08:00
Pascal Seitz	4d29ff4d01	finalize ip addr rename	2022-10-07 16:25:01 +08:00
Pascal Seitz	400a20b7af	add ip field add u128 multivalue reader and writer add ip to schema add ip writers, handle merge	2022-10-07 16:25:01 +08:00
Bruce Mitchener	97ccd6d712	Avoid slicing a string in DocParsingError. (#1559 ) Fixes #1339.	2022-09-26 20:27:15 +09:00
Bruce Mitchener	cb252a42af	docs: "associated to" -> "associated with" (#1557 ) This reads better this way.	2022-09-26 20:23:37 +09:00
Evance Soumaoro	a4be239d38	Updated DateTime to hold timestamp in microseconds, while making date field precision configurable (#1396 )	2022-07-12 10:04:28 +09:00
Antoine G	11e4225f23	doc fix (#1391 ) Documentation fix.	2022-06-21 15:53:33 +09:00
boraarslan	811b91ecb3	Edit and add tests	2022-06-07 10:09:37 +03:00
boraarslan	ef2492dba6	Broken commit	2022-06-07 10:09:37 +03:00
Pascal Seitz	bb5254de12	always serialize, use enum as param	2022-04-04 13:50:23 +08:00
Paul Masurel	d7b46d2137	Added JSON Type (#1270 ) - Removed useless copy when ingesting JSON. - Bugfix in phrase query with a missing field norms. - Disabled range query on default fields Closes #1251	2022-02-24 16:25:22 +09:00
Pascal Seitz	704498a1ac	rename IntOptions to NumericOptions keep IntOptions with deprecation warning Fixes #1286	2022-02-21 22:20:07 +01:00
Paul Masurel	d37633e034	Minor changes in indexing. (#1285 )	2022-02-21 17:16:52 +09:00
Paul Masurel	bdedefe07d	Adding an IndexingContext object (#1268 )	2022-02-04 15:08:01 +09:00
Paul Masurel	eca6628b3c	Minor refactoring (#1266 )	2022-01-28 15:55:55 +09:00
Paul Masurel	c81b3030fa	Issue/922b (#1233 ) * Add a NORMED options on field Make fieldnorm indexation optional: * for all types except text => added a NORMED options * for text field if STRING, field has not fieldnorm retained if TEXT, field has fieldnorm computed * Finalize making fieldnorm optional for all field types. - Using Option for fieldnorm readers.	2021-12-10 21:12:29 +09:00
PSeitz	c503c6e4fa	Switch to non-strict schema (#1216 ) Fixes #1211	2021-11-29 10:38:59 +09:00
Pascal Seitz	1e4df54ab3	fix clippy	2021-07-01 17:41:53 +02:00
Paul Masurel	39dd8cfe24	Cargo clippy. Acronym should not be full uppercase apparently.	2021-04-26 11:49:18 +09:00
Evance Souamoro	f82922b354	added a scratched of implementation but still need to craft one detail and write test to validate	2021-04-06 11:46:17 +00:00
Laurent Pouget	4b34231f28	Make facet indexation and storage optional Added a FacetOptions for HierarchicalFacet which add indexed and stored flags to it. Propagate change and update tests accordingly Added a test to ensure that a not indexed flag was taken care of. Added on Value implem the `path()` function to return the stored facet.	2021-03-24 14:56:27 +01:00
Paul Masurel	fe3faf5b3f	Cargo fmt	2021-02-22 14:29:03 +09:00
Paul Masurel	96f946d4c3	Raultang master (#879 ) * add support for indexed bytes fast field * remove backup code file * refine test cases * Simplified unit test. Renamed it as it is testing the storable part. Not the indexed part. * Small refactoring and added unit test. If multivalued we only retain the first FAST value. Co-authored-by: Raul <raul.tang.lc@gmail.com>	2020-10-01 18:03:18 +09:00
Paul Masurel	838c476733	Hirevo move to thiserror (#889 ) * Migrated from `failure` to `thiserror` * Refactoring Co-authored-by: Nicolas Polomack <nicolas@polomack.eu>	2020-09-30 16:34:10 +09:00
Minoru Osuka	749432f949	Make SchemaBuilder::add_field() public (#742 ) * Make add_field() to public * cargo format	2019-12-25 20:37:34 +09:00
Paul Masurel	7b21b3f25a	Refactoring around Field (#673 ) * Refactoring around Field Removing the contract about the order of the field, and the field id allocation. * Update delete_queue.rs * Update field.rs	2019-10-25 09:06:44 +09:00
Paul Masurel	5196ca41d8	Small code clean up	2019-09-03 09:22:32 +09:00
Paul Masurel	b3b0138b82	Change for tantivy-py Schema.convert_named_doc Better Debug string for Terms and TermQueries	2019-08-14 17:44:25 +09:00
Paul Masurel	941f06eb9f	Added Schema.from_named_doc	2019-08-11 16:50:32 +09:00
Paul Masurel	280ea1209c	Changes required for python binding (#610 )	2019-08-01 17:26:21 +09:00
fdb-hiroshima	6eb4e08636	add support for float (#603 ) * add basic support for float as for i64, they are mapped to u64 for indexing query parser don't work yet * Update value.rs * implement support for float in query parser * Update README.md	2019-07-27 17:57:33 +09:00
Paul Masurel	0bc2c64a53	2018 (#585 ) * removing macro import for fail-rs * Downcast-rs * matches	2019-07-07 17:09:04 +09:00

1 2 3

122 Commits