tantivy

mirror of https://github.com/quickwit-oss/tantivy.git synced 2026-01-06 01:02:55 +00:00

Author	SHA1	Message	Date
Paul Masurel	eca6628b3c	Minor refactoring (#1266 )	2022-01-28 15:55:55 +09:00
Shikhar Bhushan	5a2497b6fd	Avoid exposing TrackedObject from Warmer API (#1264 )	2022-01-25 10:04:08 +09:00
Shikhar Bhushan	99d4b1a177	Searcher Warming API (#1261 ) Adds an API to register Warmers in the IndexReader. Co-authored-by: Paul Masurel <paul@quickwit.io>	2022-01-20 23:40:25 +09:00
Liam Warfield	17e00df112	Change Snippet.fragments -> Snippet.fragment (#1243 ) * Change Snippet.fragments -> Snippet.fragment * Apply suggestions from code review Co-authored-by: Liam Warfield <lwarfield@arista.com>	2022-01-03 22:23:51 +09:00
Paul Masurel	dde49ac8e2	Closes #1195 (#1222 ) Removes the indexed option for facets. Facets are now always indexed. Closes #1195	2021-12-02 14:37:19 +09:00
Paul Masurel	7234bef0eb	Issue/1198 (#1201 ) * Unit test reproducing #1198 * Fixing unit test to handle the error from add_document. * Bump project version	2021-11-11 16:42:19 +09:00
sigaloid	096ce7488e	Resolve some clippys, format (#1144 ) * cargo +nightly clippy --fix -Z unstable-options	2021-08-26 08:46:00 +09:00
Pascal Seitz	0062fe705d	cargo fmt	2021-07-01 18:17:08 +02:00
Pascal Seitz	1e4df54ab3	fix clippy	2021-07-01 17:41:53 +02:00
Bernard Swart	9f32d40b27	Misspelling of misspelled was fixed (#1078 )	2021-06-14 16:29:12 +09:00
PSeitz	dff0ffd38a	prepare for multiple fastfield codecs (#1063 ) * prepare for multiple fastfield codecs prepare for multiple fastfield codecs by wrapping the codecs in an enum #1042 * add FastFieldSerializer trait, add DynamicFastFieldSerializer add FastFieldSerializer trait add DynamicFastFieldSerializer enum to wrap all implementors of the FastFieldSerializer trait * add estimation for fastfield bitpacker	2021-05-31 23:14:14 +09:00
Moriyoshi Koizumi	4afba005f9	Provide a means to deal with malformed facet text representation for the query parser (#1056 ) * Provide a means to deal with malformed facet text representation for the query parser. * Specific error enum for the facet parse error.	2021-05-27 12:16:49 +09:00
Laurent Pouget	4b34231f28	Make facet indexation and storage optional Added a FacetOptions for HierarchicalFacet which add indexed and stored flags to it. Propagate change and update tests accordingly Added a test to ensure that a not indexed flag was taken care of. Added on Value implem the `path()` function to return the stored facet.	2021-03-24 14:56:27 +01:00
Paul Masurel	31137beea6	Replacing (start, end) by Range	2021-03-10 14:06:21 +09:00
Vishal Sodani	6ae96038c2	Fixed spelling	2021-02-08 20:18:45 +05:30
Vishal Sodani	4bcdca8545	Fixed spelling	2021-02-08 19:51:36 +05:30
Paul Masurel	1b4be24dca	Fast field are not loaded on the opening of a segment. They are instead loaded lazily when they are request.	2021-01-21 18:13:08 +09:00
Paul Masurel	80a99539ce	Several TermDict operation now returns an io::Result	2020-12-03 13:13:11 +09:00
Paul Masurel	c23a03ad81	Large API Change in the Directory API. (#901 ) Tantivy used to assume that all files could be somehow memory mapped. After this change, Directory return a `FileSlice` that can be reduced and eventually read into an `OwnedBytes` object. Long and blocking io operation are still required by they do not span over the entire file.	2020-10-08 16:36:51 +09:00
Paul Masurel	848afa43ee	Merge branch 'issue/896' into main	2020-10-01 20:43:42 +09:00
Paul Masurel	7720d21265	Closes #896 - Facet reader related Bugfix. Acquiring a facet reader on a segment that does not contain any doc with this facet returns `None`.	2020-10-01 20:25:28 +09:00
stephenlagree	5e06e7de5a	Update basic_search.rs (#865 ) Remove duplicated document entry.	2020-08-21 11:23:09 +09:00
Paul Masurel	2481c87be8	Block wand (#856 )	2020-08-19 22:36:36 +09:00
Paul Masurel	f71b04acb0	Bugfix. (#849 ) go_to_first_doc was typically calling seek with a target smaller than doc. Since SegmentPostings typically do a linear search on the full block, regardless of the current position, it could have our segment postings go backward.	2020-07-16 10:57:51 +09:00
Paul Masurel	e25284bafe	Major change in the DocSet/Scorer API (#824 ) - Change in the DocSet and Scorer API. (@fulmicoton). A freshly created DocSet point directly to their first doc. A sentinel value called TERMINATED marks the end of a DocSet. `.advance()` returns the new DocId. `Scorer::skip(target)` has been replaced by `Scorer::seek(target)` and returns the resulting DocId. As a result, iterating through DocSet now looks as follows ```rust let mut doc = docset.doc(); while doc != TERMINATED { // ... doc = docset.advance(); } ``` The change made it possible to greatly simplify a lot of the docset's code. - Misc internal optimization and introduction of the `Scorer::for_each_pruning` function. (@fulmicoton)	2020-05-16 16:33:36 +09:00
Rob Young	77f363987a	Make TweakScore and CustomScore mutable at the segment level (#807 ) * Make TweakScore and CustomScore mutable Make TweakScore and CustomScore mutable at the segment level. Addresses issue #806 * Add example to show tweak_score working for facets	2020-04-19 15:54:00 +09:00
Paul Masurel	486b8fa9c5	Removing serde-derive dependency (#786 )	2020-03-06 23:33:58 +09:00
Paul Masurel	811fd0cb9e	Dynamic analyzer (#755 ) * Removed generics in tokenizers * lowercaser * Added TokenizerExt * Introducing BoxedTokenizer * Introducing BoxXXXXX helper struct * Closes #762. * Introducing a TextAnalyzer	2020-01-29 18:23:37 +09:00
Paul Masurel	c1400f25a7	Handle facet search in the QueryParser. (#741 ) Closes #738	2019-12-25 17:43:33 +09:00
Paul Masurel	afe0134d0f	Kkoziara remove tokens from doc store (#715 ) * Prevent tokens from being stored in the document store. Commit adds prepare_for_store method to Document, which changes all PreTokenizedString values into String values. The method is called before adding document to the document store to prevent tokens from being saved there. Commit also adds small changes to comments in pre_tokenized_text example. * Avoid storing the pretokenized text.	2019-11-25 22:39:12 +09:00
Paul Masurel	fb3d6fa332	Adding Value::From<PretokenizedText> (#697 )	2019-11-10 14:39:44 +09:00
kkoziara	0519056bd8	Added handling of pre-tokenized text fields (#642 ). (#669 ) * Added handling of pre-tokenized text fields (#642). * * Updated changelog and examples concerning #642. * Added tokenized_text method to Value implementation. * Implemented From<TokenizedString> for TokenizedStream. * * Removed tokenized flag from TextOptions and code reliance on the flag. * Changed naming to use word "pre-tokenized" instead of "tokenized". * Updated example code. * Fixed comments. * Minor code refactoring. Test improvements.	2019-11-07 10:10:56 +09:00
Paul Masurel	4b9c1dce69	Moving queyr grammar to a different crate. (#645 )	2019-09-05 09:37:28 +09:00
Joshua Dutton	9f74786db2	Update import statements in examples, doctests (#633 ) Update import statements to edition 2018, including removing `extern crate` and `#[macro_use]`. Alphabetize the statements.	2019-08-19 07:26:35 +09:00
Joshua Dutton	84c615cff1	Fixing typos (#634 )	2019-08-19 07:24:05 +09:00
Paul Masurel	001af3876f	cargo fmt	2019-08-08 18:07:19 +09:00
Kornel	754b55eee5	Bump deps (#613 ) * Bump crossbeam * Warnings-- * Remove outdated tempdir	2019-08-05 22:21:22 +09:00
Paul Masurel	462774b15c	Tiqb feature/2018 (#583 ) * rust 2018 * Added CHANGELOG comment	2019-07-01 10:01:46 +09:00
Paul Masurel	444662485f	Remove mut in add_document and delete_term. Made stamper ordering rel… (#551 ) * Remove mut in add_document and delete_term. Made stamper ordering relaxed. * Made batch operations &mut self -> &self * Added example	2019-05-28 10:26:00 +09:00
Paul Masurel	7102b363f5	Fix build	2019-05-05 14:19:54 +09:00
Paul Masurel	66b4615e4e	Issue/542 (#543 ) * Closes 542. Fast fields are all loaded when the segment reader is created.	2019-05-05 13:52:43 +09:00
Paul Masurel	663dd89c05	Feature/reader (#517 ) Adding IndexReader to the API. Making it possible to watch for changes. * Closes #500	2019-03-20 08:39:22 +09:00
Paul Masurel	94f1885334	Issue/513 (#514 ) * Closes #513 * Clean up and doc * Updated changelog	2019-03-07 09:39:30 +09:00
petr-tik	76d2b4dab6	Add integer range search example (#490 ) Copied and simplified the example in the range_query mod	2019-02-05 23:34:06 +01:00
Paul Masurel	279a9eb5e3	Closes #449 (#450 ) Clippy working on stable. Clippy warnings addressed	2018-12-10 12:20:59 +09:00
fdb-hiroshima	21a24672d8	Add accessors for Snippet and HighlightSection (#448 ) * Add accessors for Snippet and HighlightSection And add an example of custom highlighter * Remove inline(always) and unnecessary empty lines	2018-12-02 18:00:16 +09:00
Paul Masurel	a6e767c877	Cargo fmt	2018-11-30 22:52:45 +09:00
Paul Masurel	07d87e154b	Collector refactoring and multithreaded search (#437 ) * Split Collector into an overall Collector and a per-segment SegmentCollector. Precursor to cross-segment parallelism, and as a side benefit cleans up any per-segment fields from being Option<T> to just T. * Attempt to add MultiCollector back * working. Chained collector is broken though * Fix chained collector * Fix test * Make Weight Send+Sync for parallelization purposes * Expose parameters of RangeQuery for external usage * Removed &mut self * fixing tests * Restored TestCollectors * blop * multicollector working * chained collector working * test broken * fixing unit test * blop * blop * Blop * simplifying APi * blop * better syntax * Simplifying top_collector * refactoring * blop * Sync with master * Added multithread search * Collector refactoring * Schema::builder * CR and rustdoc * CR comments * blop * Added an executor * Sorted the segment readers in the searcher * Update searcher.rs * Fixed unit testst * changed the place where we have the sort-segment-by-count heuristic * using crossbeam::channel * inlining * Comments about panics propagating * Added unit test for executor panicking * Readded default * Removed Default impl * Added unit test for executor	2018-11-30 22:46:59 +09:00
Paul Masurel	10f6c07c53	Clippy (#422 ) * Cargo Format * Clippy	2018-09-15 20:20:22 +09:00
Paul Masurel	37e4280c0a	Cargo Format (#420 )	2018-09-15 07:44:22 +09:00

1 2

96 Commits