tantivy

mirror of https://github.com/quickwit-oss/tantivy.git synced 2026-01-16 14:02:55 +00:00

Author	SHA1	Message	Date
PSeitz	e1679f3fb9	compact doc (#2402 ) * compact doc * add any value type * pass references when building CompactDoc * remove OwnedValue from API * clippy * clippy * fail on large documents * fmt * cleanup * cleanup * implement Value for different types fix serde_json date Value implementation * fmt * cleanup * fmt * cleanup * store positions instead of pos+len * remove nodes array * remove mediumvec * cleanup * infallible serialize into vec * remove positions indirection * remove 24MB limitation in document use u32 for Addr Remove the 3 byte addressing limitation and use VInt instead * cleanup * extend test * cleanup, add comments * rename, remove pub	2024-05-21 10:16:08 +02:00
PSeitz	0e9fced336	remove JsonTermWriter (#2238 ) * remove JsonTermWriter remove JsonTermWriter remove path truncation logic, add assertion * fix json_path_writer add sep logic	2024-04-18 16:28:05 +02:00
PSeitz	398817ce7b	add index sorting deprecation warning (#2353 ) * add index sorting deprecation warning * remove deprecated IntOptions and DatePrecision	2024-04-10 08:09:09 +02:00
PSeitz	6097235eff	fix numeric order, refactor Document (#2209 ) fix numeric order to prefer i64 rename and move Document stuff	2023-10-05 16:39:56 +02:00
PSeitz	03a1f40767	rename DocValue to Value (#2197 ) rename DocValue to Value to avoid confusion with lucene DocValues rename Value to OwnedValue	2023-10-02 17:03:00 +02:00
Harrison Burt	1c7c6fd591	POC: Tantivy documents as a trait (#2071 ) * fix windows build (#1) * Fix windows build * Add doc traits * Add field value iter * Add value and serialization * Adjust order * Fix bug * Correct type * Fix generic bugs * Reformat code * Add generic to index writer which I forgot about * Fix missing generics on single segment writer * Add missing type export * Add default methods for convenience * Cleanup * Fix more-like-this query to use standard types * Update API and fix tests * Add doc traits * Add field value iter * Add value and serialization * Adjust order * Fix bug * Correct type * Rebase main and fix conflicts * Reformat code * Merge upstream * Fix missing generics on single segment writer * Add missing type export * Add default methods for convenience * Cleanup * Fix more-like-this query to use standard types * Update API and fix tests * Add tokenizer improvements from previous commits * Add tokenizer improvements from previous commits * Reformat * Fix unit tests * Fix unit tests * Use enum in changes * Stage changes * Add new deserializer logic * Add serializer integration * Add document deserializer * Implement new (de)serialization api for existing types * Fix bugs and type errors * Add helper implementations * Fix errors * Reformat code * Add unit tests and some code organisation for serialization * Add unit tests to deserializer * Add some small docs * Add support for deserializing serde values * Reformat * Fix typo * Fix typo * Change repr of facet * Remove unused trait methods * Add child value type * Resolve comments * Fix build * Fix more build errors * Fix more build errors * Fix the tests I missed * Fix examples * fix numerical order, serialize PreTok Str * fix coverage * rename Document to TantivyDocument, rename DocumentAccess to Document add Binary prefix to binary de/serialization * fix coverage --------- Co-authored-by: Pascal Seitz <pascal.seitz@gmail.com>	2023-10-02 10:01:16 +02:00
trinity-1686a	b92082b748	implement lenient parser (#2129 ) * move query parser to nom * add suupport for term grouping * initial work on infallible parser * fmt * add tests and fix minor parsing bugs * address review comments * add support for lenient queries in tantivy * make lenient parser report errors * allow mixing occur and bool in query	2023-08-08 15:41:29 +02:00
Adrien Guillo	a789ad9aee	Rename `DatePrecision` to `DateTimePrecision` (#2051 )	2023-05-23 17:09:11 +02:00
PSeitz	74f9eafefc	refactor Term (#2006 ) * refactor Term add ValueBytes for serialized term values add missing debug for ip skip unnecessary json path validation remove code duplication add DATE_TIME_PRECISION_INDEXED constant add missing Term clarification remove weird value_bytes_mut() API * fix naming	2023-04-20 15:31:43 +02:00
PSeitz	faa706d804	add coerce option for text and numbers types (#1904 ) * add coerce option for text and numbers types allow to coerce the field type when indexing if the type does not match * Apply suggestions from code review Co-authored-by: Paul Masurel <paul@quickwit.io> * add tests,add COERCE flag, include bool in coercion --------- Co-authored-by: Paul Masurel <paul@quickwit.io>	2023-03-01 11:36:59 +01:00
Paul Masurel	f537334e4f	Adding a write schema to columnar's merge operations. (#1884 ) * Adding a write schema to columnar's merge operations. * Added unit test checking min/max when columns are empty. * CR comment * Rename to value_type_to_column_type	2023-02-21 18:25:16 +09:00
Paul Masurel	bd5eea9852	Integrated columnar work.	2023-02-09 13:14:31 +01:00
Pascal Seitz	6800fdec9d	add indexing for ip field Closes #1595	2022-10-18 10:07:48 +08:00
Pascal Seitz	4d29ff4d01	finalize ip addr rename	2022-10-07 16:25:01 +08:00
Bruce Mitchener	b3bf9a5716	Documentation improvements.	2022-10-05 14:18:10 +07:00
Pascal Seitz	f757471077	prepare for ip field	2022-09-26 16:27:35 +08:00
Bruce Mitchener	cf02e32578	Improvements to doc linking, grammar, etc.	2022-09-19 18:10:22 +07:00
Evance Soumaoro	a4be239d38	Updated DateTime to hold timestamp in microseconds, while making date field precision configurable (#1396 )	2022-07-12 10:04:28 +09:00
Antoine G	11e4225f23	doc fix (#1391 ) Documentation fix.	2022-06-21 15:53:33 +09:00
Paul Masurel	d7b46d2137	Added JSON Type (#1270 ) - Removed useless copy when ingesting JSON. - Bugfix in phrase query with a missing field norms. - Disabled range query on default fields Closes #1251	2022-02-24 16:25:22 +09:00
Pascal Seitz	704498a1ac	rename IntOptions to NumericOptions keep IntOptions with deprecation warning Fixes #1286	2022-02-21 22:20:07 +01:00
Paul Masurel	eca6628b3c	Minor refactoring (#1266 )	2022-01-28 15:55:55 +09:00
sigaloid	096ce7488e	Resolve some clippys, format (#1144 ) * cargo +nightly clippy --fix -Z unstable-options	2021-08-26 08:46:00 +09:00
François Massot	f4b2e71800	Handle field names with any characters with a known set of special (#1109 ) * Handle field names with any characters with a known set of special characters and an escape one * Update field name validation rule to check only if it has at least one character and does not start with `-` Closes #1087.	2021-07-05 22:31:36 +09:00
Moriyoshi Koizumi	4afba005f9	Provide a means to deal with malformed facet text representation for the query parser (#1056 ) * Provide a means to deal with malformed facet text representation for the query parser. * Specific error enum for the facet parse error.	2021-05-27 12:16:49 +09:00
Laurent Pouget	4b34231f28	Make facet indexation and storage optional Added a FacetOptions for HierarchicalFacet which add indexed and stored flags to it. Propagate change and update tests accordingly Added a test to ensure that a not indexed flag was taken care of. Added on Value implem the `path()` function to return the stored facet.	2021-03-24 14:56:27 +01:00
Paul Masurel	96f946d4c3	Raultang master (#879 ) * add support for indexed bytes fast field * remove backup code file * refine test cases * Simplified unit test. Renamed it as it is testing the storable part. Not the indexed part. * Small refactoring and added unit test. If multivalued we only retain the first FAST value. Co-authored-by: Raul <raul.tang.lc@gmail.com>	2020-10-01 18:03:18 +09:00
Paul Masurel	439d6956a9	Returning Result in some of the API (#880 ) * Returning Result in some of the API * Introducing `.writer_for_test(..)`	2020-09-07 15:52:34 +09:00
Paul Masurel	3a72b1cb98	Accept dash within field names. (#874 ) Accept dash in field names and enforce field names constraint at the creation of the schema. Closes #796	2020-09-01 13:38:52 +09:00
Ximo Guanter	00816f5529	Fix outdated reference in documentation (#720 )	2019-12-08 18:10:50 +09:00
Paul Masurel	5c6580eb15	fmt (#661 )	2019-10-04 12:10:01 +09:00
fdb-hiroshima	6eb4e08636	add support for float (#603 ) * add basic support for float as for i64, they are mapped to u64 for indexing query parser don't work yet * Update value.rs * implement support for float in query parser * Update README.md	2019-07-27 17:57:33 +09:00
Paul Masurel	4867be3d3b	Kompass master (#590 ) * Use once_cell in place of lazy_static * Minor changes	2019-07-10 19:24:54 +09:00
Paul Masurel	94f1885334	Issue/513 (#514 ) * Closes #513 * Clean up and doc * Updated changelog	2019-03-07 09:39:30 +09:00
Paul Masurel	07d87e154b	Collector refactoring and multithreaded search (#437 ) * Split Collector into an overall Collector and a per-segment SegmentCollector. Precursor to cross-segment parallelism, and as a side benefit cleans up any per-segment fields from being Option<T> to just T. * Attempt to add MultiCollector back * working. Chained collector is broken though * Fix chained collector * Fix test * Make Weight Send+Sync for parallelization purposes * Expose parameters of RangeQuery for external usage * Removed &mut self * fixing tests * Restored TestCollectors * blop * multicollector working * chained collector working * test broken * fixing unit test * blop * blop * Blop * simplifying APi * blop * better syntax * Simplifying top_collector * refactoring * blop * Sync with master * Added multithread search * Collector refactoring * Schema::builder * CR and rustdoc * CR comments * blop * Added an executor * Sorted the segment readers in the searcher * Update searcher.rs * Fixed unit testst * changed the place where we have the sort-segment-by-count heuristic * using crossbeam::channel * inlining * Comments about panics propagating * Added unit test for executor panicking * Readded default * Removed Default impl * Added unit test for executor	2018-11-30 22:46:59 +09:00
Paul Masurel	78673172d0	Cargo fmt	2018-04-21 20:05:36 +09:00
pmasurel	0804b42afa	Checking the type of range queries	2018-04-16 14:01:10 +09:00
Paul Masurel	3edb3dce6a	Test not passing	2018-01-25 12:46:32 +09:00
Paul Masurel	1e55189db1	NOBUG rustfmt	2017-12-14 19:30:31 +09:00
Paul Masurel	05ce093f97	doc	2017-11-26 11:43:11 +09:00
Paul Masurel	974c321153	cargo fmt	2017-11-26 11:02:02 +09:00
Paul Masurel	ac4d433fad	Renamed analyzer to tokenizer	2017-11-24 16:50:32 +09:00
Paul Masurel	2c9302290f	#191 Analyzer	2017-09-20 22:56:55 +09:00
Paul Masurel	426cc436da	Test passing	2017-09-10 17:48:41 +09:00
Paul Masurel	ca49d6130f	Test not passing	2017-09-09 17:32:47 +09:00
Dru Sellers	2bb85ed575	Minor Doc Changes (#206 ) * Various small documentation tweaks * walking through the docs * Update lib.rs * Update lib.rs * Update mod.rs	2017-08-06 09:22:03 +09:00
Paul Masurel	ac0b1a21eb	Term as a wrapper Small changes Plastic	2017-05-25 23:49:54 +09:00
Paul Masurel	7b2b181652	Merge branch 'master' into issue/136 Conflicts: src/datastruct/stacker/hashmap.rs src/datastruct/stacker/heap.rs src/datastruct/stacker/mod.rs src/indexer/index_writer.rs src/indexer/merger.rs src/indexer/segment_updater.rs src/indexer/segment_writer.rs src/postings/postings_writer.rs src/postings/recorder.rs src/schema/term.rs	2017-05-17 18:40:09 +09:00
Laurentiu Nicola	3dde748b25	Make rustfmt happy	2017-05-16 00:49:05 +03:00
Paul Masurel	4c8f9742f8	format	2017-05-15 22:30:18 +09:00

1 2

71 Commits