tantivy

mirror of https://github.com/quickwit-oss/tantivy.git synced 2026-05-27 05:30:45 +00:00

Author	SHA1	Message	Date
PSeitz	1a9fc10be9	add fields_metadata to SegmentReader, add columnar docs (#2222 ) * add fields_metadata to SegmentReader, add columnar docs * use schema to resolve field, add test * normalize paths * merge for FieldsMetadata, add fields_metadata on Index * Update src/core/segment_reader.rs Co-authored-by: Paul Masurel <paul@quickwit.io> * merge code paths * add Hash * move function oustide --------- Co-authored-by: Paul Masurel <paul@quickwit.io>	2023-11-22 12:29:53 +01:00
Harrison Burt	1c7c6fd591	POC: Tantivy documents as a trait (#2071 ) * fix windows build (#1) * Fix windows build * Add doc traits * Add field value iter * Add value and serialization * Adjust order * Fix bug * Correct type * Fix generic bugs * Reformat code * Add generic to index writer which I forgot about * Fix missing generics on single segment writer * Add missing type export * Add default methods for convenience * Cleanup * Fix more-like-this query to use standard types * Update API and fix tests * Add doc traits * Add field value iter * Add value and serialization * Adjust order * Fix bug * Correct type * Rebase main and fix conflicts * Reformat code * Merge upstream * Fix missing generics on single segment writer * Add missing type export * Add default methods for convenience * Cleanup * Fix more-like-this query to use standard types * Update API and fix tests * Add tokenizer improvements from previous commits * Add tokenizer improvements from previous commits * Reformat * Fix unit tests * Fix unit tests * Use enum in changes * Stage changes * Add new deserializer logic * Add serializer integration * Add document deserializer * Implement new (de)serialization api for existing types * Fix bugs and type errors * Add helper implementations * Fix errors * Reformat code * Add unit tests and some code organisation for serialization * Add unit tests to deserializer * Add some small docs * Add support for deserializing serde values * Reformat * Fix typo * Fix typo * Change repr of facet * Remove unused trait methods * Add child value type * Resolve comments * Fix build * Fix more build errors * Fix more build errors * Fix the tests I missed * Fix examples * fix numerical order, serialize PreTok Str * fix coverage * rename Document to TantivyDocument, rename DocumentAccess to Document add Binary prefix to binary de/serialization * fix coverage --------- Co-authored-by: Pascal Seitz <pascal.seitz@gmail.com>	2023-10-02 10:01:16 +02:00
PSeitz	44850e1036	move fail dep to dev only (#2094 ) wasm compilation fails with dep only	2023-06-22 06:59:11 +02:00
PSeitz	9e2faecf5b	add memory limit for aggregations (#1942 ) * add memory limit for aggregations introduce AggregationLimits to set memory consumption limit and bucket limits memory limit is checked during aggregation, bucket limit is checked before returning the aggregation request. * Apply suggestions from code review Co-authored-by: Paul Masurel <paul@quickwit.io> * add ByteCount with human readable format --------- Co-authored-by: Paul Masurel <paul@quickwit.io>	2023-03-16 06:21:07 +01:00
Paul Masurel	7fae4d98d7	Adapting for quickwit2 (#1912 ) * Adapting tantivy to make it possible to be plugged to quickwit. * Apply suggestions from code review Co-authored-by: PSeitz <PSeitz@users.noreply.github.com> * Added unit test --------- Co-authored-by: PSeitz <PSeitz@users.noreply.github.com>	2023-03-01 16:27:46 +09:00
Paul Masurel	66ff53b0f4	Various minor code cleanup (#1909 )	2023-02-27 13:48:34 +09:00
Paul Masurel	405e2cf4d9	Merge with main	2023-02-09 14:28:57 +01:00
Paul Masurel	bd5eea9852	Integrated columnar work.	2023-02-09 13:14:31 +01:00
PSeitz	0f20787917	fix doc store cache docs (#1821 ) * fix doc store cache docs addresses an issue reported in #1820 * rename doc_store_cache_size	2023-01-23 07:06:49 +01:00
PSeitz	f687b3a5aa	start migrate Field to &str (#1772 ) start migrate Field to &str in preparation of columnar return Result for get_field	2023-01-18 16:12:07 +09:00
Bruce Mitchener	b3bf9a5716	Documentation improvements.	2022-10-05 14:18:10 +07:00
Bruce Mitchener	cb252a42af	docs: "associated to" -> "associated with" (#1557 ) This reads better this way.	2022-09-26 20:23:37 +09:00
boraarslan	d4b2b7de8b	Expose inner file slice	2022-08-04 18:13:17 +03:00
Pascal Seitz	5750224d4c	set docstore cache size at construction	2022-07-04 14:27:55 +08:00
Antoine G	11e4225f23	doc fix (#1391 ) Documentation fix.	2022-06-21 15:53:33 +09:00
Kanji Yomoda	83d0c13fb0	Fix outdated variable naming and comments to alive bitset (#1387 ) * Fix outdated variables and comments for alive bitset * Fix expired link to delete bitset	2022-06-14 15:59:15 +09:00
Pascal Seitz	8807bfd13d	fast field on string enables FAST on string fields, which creates a fastfield containing the term ordinals	2022-03-29 12:40:10 +08:00
Antoine G	e37775fe21	iff->if or if and only if (#1298 ) * has_xxx is_xxx -> if, these function usualy define equivalence xxx returns bool -> specify equivalence when appropriate * fix doc	2022-03-02 11:00:00 +09:00
Paul Masurel	d7b46d2137	Added JSON Type (#1270 ) - Removed useless copy when ingesting JSON. - Bugfix in phrase query with a missing field norms. - Disabled range query on default fields Closes #1251	2022-02-24 16:25:22 +09:00
Paul Masurel	eca6628b3c	Minor refactoring (#1266 )	2022-01-28 15:55:55 +09:00
Shikhar Bhushan	99d4b1a177	Searcher Warming API (#1261 ) Adds an API to register Warmers in the IndexReader. Co-authored-by: Paul Masurel <paul@quickwit.io>	2022-01-20 23:40:25 +09:00
Paul Masurel	732f6847c0	Field type with codes (#1255 ) * Term are now typed. This change is backward compatible: While the Term has a byte representation that is modified, a Term itself is a transient object that is not serialized as is in the index. Its .field() and .value_bytes() on the other hand are unchanged. This change offers better Debug information for terms. While not necessary it also will help in the support for JSON types. * Renamed Hierarchical Facet -> Facet	2022-01-07 20:49:00 +09:00
Paul Masurel	c81b3030fa	Issue/922b (#1233 ) * Add a NORMED options on field Make fieldnorm indexation optional: * for all types except text => added a NORMED options * for text field if STRING, field has not fieldnorm retained if TEXT, field has fieldnorm computed * Finalize making fieldnorm optional for all field types. - Using Option for fieldnorm readers.	2021-12-10 21:12:29 +09:00
Paul Masurel	7234bef0eb	Issue/1198 (#1201 ) * Unit test reproducing #1198 * Fixing unit test to handle the error from add_document. * Bump project version	2021-11-11 16:42:19 +09:00
Paul Masurel	02cffa4dea	Code simplification. (#1169 ) Code simplification and Clippy	2021-10-07 14:11:44 +09:00
PSeitz	352e0cc58d	Adde demux operation (#1150 ) * add merge for DeleteBitSet, allow custom DeleteBitSet on merge * forward delete bitsets on merge, add tests * add demux operation and tests	2021-10-06 16:05:16 +09:00
Paul Masurel	0855649986	Leaning more on the alive (vs delete) semantics. (#1164 )	2021-10-05 18:53:29 +09:00
Pascal Seitz	5ee5037934	create and use ReadSerializedBitSet	2021-09-24 12:53:33 +08:00
Pascal Seitz	d7a6a409a1	renames	2021-09-23 20:33:11 +08:00
Pascal Seitz	a1f5cead96	AliveBitSet instead of DeleteBitSet	2021-09-23 20:03:57 +08:00
Pascal Seitz	93cbd52bf0	move code to biset, add inline, add benchmark	2021-09-18 17:35:22 +08:00
Pascal Seitz	c22177a005	add iterator	2021-09-17 15:29:27 +08:00
Pascal Seitz	4ae1d87632	add DeleteBitSet iterator	2021-09-15 23:10:04 +08:00
Pascal Seitz	ee0881712a	move bitset to common crate, move composite file to directory	2021-08-19 17:45:09 +01:00
Shikhar Bhushan	4e3771bffc	stale comments in segment_reader.rs	2021-07-15 22:47:32 -04:00
Pascal Seitz	8526434b63	add dynamic fastfield case add dynamic fastfield for single fast field unsorted fix scary documentation bug add num_len instead of len	2021-06-30 08:57:55 +02:00
Paul Masurel	aead5d4068	First stab	2021-04-26 12:46:06 +09:00
Paul Masurel	39dd8cfe24	Cargo clippy. Acronym should not be full uppercase apparently.	2021-04-26 11:49:18 +09:00
Laurent Pouget	4b34231f28	Make facet indexation and storage optional Added a FacetOptions for HierarchicalFacet which add indexed and stored flags to it. Propagate change and update tests accordingly Added a test to ensure that a not indexed flag was taken care of. Added on Value implem the `path()` function to return the stored facet.	2021-03-24 14:56:27 +01:00
Paul Masurel	52b1eb2c37	Clippy fix	2021-03-10 14:35:51 +09:00
Paul Masurel	94d3d7a89a	Rename FastFieldReaders::load_all	2021-01-21 18:38:48 +09:00
Paul Masurel	aa9e79f957	Clippy warnings.	2021-01-21 18:23:20 +09:00
Paul Masurel	1b4be24dca	Fast field are not loaded on the opening of a segment. They are instead loaded lazily when they are request.	2021-01-21 18:13:08 +09:00
Paul Masurel	c23a03ad81	Large API Change in the Directory API. (#901 ) Tantivy used to assume that all files could be somehow memory mapped. After this change, Directory return a `FileSlice` that can be reduced and eventually read into an `OwnedBytes` object. Long and blocking io operation are still required by they do not span over the entire file.	2020-10-08 16:36:51 +09:00
Paul Masurel	ad82b455a3	Minor change	2020-10-01 20:45:07 +09:00
Paul Masurel	848afa43ee	Merge branch 'issue/896' into main	2020-10-01 20:43:42 +09:00
Paul Masurel	7720d21265	Closes #896 - Facet reader related Bugfix. Acquiring a facet reader on a segment that does not contain any doc with this facet returns `None`.	2020-10-01 20:25:28 +09:00
Paul Masurel	96f946d4c3	Raultang master (#879 ) * add support for indexed bytes fast field * remove backup code file * refine test cases * Simplified unit test. Renamed it as it is testing the storable part. Not the indexed part. * Small refactoring and added unit test. If multivalued we only retain the first FAST value. Co-authored-by: Raul <raul.tang.lc@gmail.com>	2020-10-01 18:03:18 +09:00
Paul Masurel	439d6956a9	Returning Result in some of the API (#880 ) * Returning Result in some of the API * Introducing `.writer_for_test(..)`	2020-09-07 15:52:34 +09:00
Paul Masurel	2481c87be8	Block wand (#856 )	2020-08-19 22:36:36 +09:00

1 2 3 4

175 Commits