tantivy

mirror of https://github.com/quickwit-oss/tantivy.git synced 2026-01-06 01:02:55 +00:00

Author	SHA1	Message	Date
Paul Masurel	1b4be24dca	Fast field are not loaded on the opening of a segment. They are instead loaded lazily when they are request.	2021-01-21 18:13:08 +09:00
Paul Masurel	80a99539ce	Several TermDict operation now returns an io::Result	2020-12-03 13:13:11 +09:00
Paul Masurel	c23a03ad81	Large API Change in the Directory API. (#901 ) Tantivy used to assume that all files could be somehow memory mapped. After this change, Directory return a `FileSlice` that can be reduced and eventually read into an `OwnedBytes` object. Long and blocking io operation are still required by they do not span over the entire file.	2020-10-08 16:36:51 +09:00
Paul Masurel	848afa43ee	Merge branch 'issue/896' into main	2020-10-01 20:43:42 +09:00
Paul Masurel	7720d21265	Closes #896 - Facet reader related Bugfix. Acquiring a facet reader on a segment that does not contain any doc with this facet returns `None`.	2020-10-01 20:25:28 +09:00
stephenlagree	5e06e7de5a	Update basic_search.rs (#865 ) Remove duplicated document entry.	2020-08-21 11:23:09 +09:00
Paul Masurel	2481c87be8	Block wand (#856 )	2020-08-19 22:36:36 +09:00
Paul Masurel	f71b04acb0	Bugfix. (#849 ) go_to_first_doc was typically calling seek with a target smaller than doc. Since SegmentPostings typically do a linear search on the full block, regardless of the current position, it could have our segment postings go backward.	2020-07-16 10:57:51 +09:00
Paul Masurel	e25284bafe	Major change in the DocSet/Scorer API (#824 ) - Change in the DocSet and Scorer API. (@fulmicoton). A freshly created DocSet point directly to their first doc. A sentinel value called TERMINATED marks the end of a DocSet. `.advance()` returns the new DocId. `Scorer::skip(target)` has been replaced by `Scorer::seek(target)` and returns the resulting DocId. As a result, iterating through DocSet now looks as follows ```rust let mut doc = docset.doc(); while doc != TERMINATED { // ... doc = docset.advance(); } ``` The change made it possible to greatly simplify a lot of the docset's code. - Misc internal optimization and introduction of the `Scorer::for_each_pruning` function. (@fulmicoton)	2020-05-16 16:33:36 +09:00
Rob Young	77f363987a	Make TweakScore and CustomScore mutable at the segment level (#807 ) * Make TweakScore and CustomScore mutable Make TweakScore and CustomScore mutable at the segment level. Addresses issue #806 * Add example to show tweak_score working for facets	2020-04-19 15:54:00 +09:00
Paul Masurel	486b8fa9c5	Removing serde-derive dependency (#786 )	2020-03-06 23:33:58 +09:00
Paul Masurel	811fd0cb9e	Dynamic analyzer (#755 ) * Removed generics in tokenizers * lowercaser * Added TokenizerExt * Introducing BoxedTokenizer * Introducing BoxXXXXX helper struct * Closes #762. * Introducing a TextAnalyzer	2020-01-29 18:23:37 +09:00
Paul Masurel	c1400f25a7	Handle facet search in the QueryParser. (#741 ) Closes #738	2019-12-25 17:43:33 +09:00
Paul Masurel	afe0134d0f	Kkoziara remove tokens from doc store (#715 ) * Prevent tokens from being stored in the document store. Commit adds prepare_for_store method to Document, which changes all PreTokenizedString values into String values. The method is called before adding document to the document store to prevent tokens from being saved there. Commit also adds small changes to comments in pre_tokenized_text example. * Avoid storing the pretokenized text.	2019-11-25 22:39:12 +09:00
Paul Masurel	fb3d6fa332	Adding Value::From<PretokenizedText> (#697 )	2019-11-10 14:39:44 +09:00
kkoziara	0519056bd8	Added handling of pre-tokenized text fields (#642 ). (#669 ) * Added handling of pre-tokenized text fields (#642). * * Updated changelog and examples concerning #642. * Added tokenized_text method to Value implementation. * Implemented From<TokenizedString> for TokenizedStream. * * Removed tokenized flag from TextOptions and code reliance on the flag. * Changed naming to use word "pre-tokenized" instead of "tokenized". * Updated example code. * Fixed comments. * Minor code refactoring. Test improvements.	2019-11-07 10:10:56 +09:00
Paul Masurel	4b9c1dce69	Moving queyr grammar to a different crate. (#645 )	2019-09-05 09:37:28 +09:00
Joshua Dutton	9f74786db2	Update import statements in examples, doctests (#633 ) Update import statements to edition 2018, including removing `extern crate` and `#[macro_use]`. Alphabetize the statements.	2019-08-19 07:26:35 +09:00
Joshua Dutton	84c615cff1	Fixing typos (#634 )	2019-08-19 07:24:05 +09:00
Paul Masurel	001af3876f	cargo fmt	2019-08-08 18:07:19 +09:00
Kornel	754b55eee5	Bump deps (#613 ) * Bump crossbeam * Warnings-- * Remove outdated tempdir	2019-08-05 22:21:22 +09:00
Paul Masurel	462774b15c	Tiqb feature/2018 (#583 ) * rust 2018 * Added CHANGELOG comment	2019-07-01 10:01:46 +09:00
Paul Masurel	444662485f	Remove mut in add_document and delete_term. Made stamper ordering rel… (#551 ) * Remove mut in add_document and delete_term. Made stamper ordering relaxed. * Made batch operations &mut self -> &self * Added example	2019-05-28 10:26:00 +09:00
Paul Masurel	7102b363f5	Fix build	2019-05-05 14:19:54 +09:00
Paul Masurel	66b4615e4e	Issue/542 (#543 ) * Closes 542. Fast fields are all loaded when the segment reader is created.	2019-05-05 13:52:43 +09:00
Paul Masurel	663dd89c05	Feature/reader (#517 ) Adding IndexReader to the API. Making it possible to watch for changes. * Closes #500	2019-03-20 08:39:22 +09:00
Paul Masurel	94f1885334	Issue/513 (#514 ) * Closes #513 * Clean up and doc * Updated changelog	2019-03-07 09:39:30 +09:00
petr-tik	76d2b4dab6	Add integer range search example (#490 ) Copied and simplified the example in the range_query mod	2019-02-05 23:34:06 +01:00
Paul Masurel	279a9eb5e3	Closes #449 (#450 ) Clippy working on stable. Clippy warnings addressed	2018-12-10 12:20:59 +09:00
fdb-hiroshima	21a24672d8	Add accessors for Snippet and HighlightSection (#448 ) * Add accessors for Snippet and HighlightSection And add an example of custom highlighter * Remove inline(always) and unnecessary empty lines	2018-12-02 18:00:16 +09:00
Paul Masurel	a6e767c877	Cargo fmt	2018-11-30 22:52:45 +09:00
Paul Masurel	07d87e154b	Collector refactoring and multithreaded search (#437 ) * Split Collector into an overall Collector and a per-segment SegmentCollector. Precursor to cross-segment parallelism, and as a side benefit cleans up any per-segment fields from being Option<T> to just T. * Attempt to add MultiCollector back * working. Chained collector is broken though * Fix chained collector * Fix test * Make Weight Send+Sync for parallelization purposes * Expose parameters of RangeQuery for external usage * Removed &mut self * fixing tests * Restored TestCollectors * blop * multicollector working * chained collector working * test broken * fixing unit test * blop * blop * Blop * simplifying APi * blop * better syntax * Simplifying top_collector * refactoring * blop * Sync with master * Added multithread search * Collector refactoring * Schema::builder * CR and rustdoc * CR comments * blop * Added an executor * Sorted the segment readers in the searcher * Update searcher.rs * Fixed unit testst * changed the place where we have the sort-segment-by-count heuristic * using crossbeam::channel * inlining * Comments about panics propagating * Added unit test for executor panicking * Readded default * Removed Default impl * Added unit test for executor	2018-11-30 22:46:59 +09:00
Paul Masurel	10f6c07c53	Clippy (#422 ) * Cargo Format * Clippy	2018-09-15 20:20:22 +09:00
Paul Masurel	37e4280c0a	Cargo Format (#420 )	2018-09-15 07:44:22 +09:00
Paul Masurel	0ba1cf93f7	Remove Searcher dereference (#419 )	2018-09-14 09:54:26 +09:00
Paul Masurel	82d25b8397	Fixing snippet example	2018-09-13 12:39:42 +09:00
Paul Masurel	63868733a3	Added SnippetGenerator	2018-09-11 09:45:27 +09:00
Paul Masurel	644d8a3a10	Added snippet generator	2018-09-10 16:39:45 +09:00
Paul Masurel	a78f4cca37	Merge branch 'issue/368' into issue/368b	2018-09-09 16:04:20 +09:00
Paul Masurel	98c7fbdc6f	Issue/378 (#392 ) * Added failing unit test * Closes #378. Handling queries that end up empty after going through the analyzer. * Fixed stop word example	2018-09-06 10:11:54 +09:00
Paul Masurel	f570fe37d4	small changes	2018-08-31 09:03:44 +09:00
Paul Masurel	a12d211330	Extracting terms matching query in the document	2018-08-30 09:23:34 +09:00
Paul Masurel	3a8e524f77	Added example to show how to access the inverted list directly	2018-08-21 09:36:13 +09:00
Paul Masurel	c0641c2b47	Remove generate html script. It moved to tantivy-search.github.io	2018-08-21 08:26:46 +09:00
Dru Sellers	cc50bdb06a	Add a basic faceted search example (#383 ) * Add a basic faceted search example * quieting the compiler	2018-08-19 08:07:54 +09:00
Dru Sellers	674524ba91	Add an example of using the stopwords filter (#377 )	2018-08-17 12:52:21 +09:00
Paul Masurel	60a9a7f837	Added example showing how to delete/update documents	2018-08-17 09:43:55 +09:00
Paul Masurel	5b5c706581	Simplified examples	2018-08-16 22:38:39 +09:00
Dru Sellers	2b8f02764b	Standardizes the Index::create_* APIs (#317 ) * Pull all creation methods next to each other The goal here is to make it clear which methods are performing the same function, and to assist with standardizing the API calls. * Make `from_directory` private This seems to be an internal function, so lets make it internal. * Rename `create` to `create_in_dir` This lets the name match the `create_in_ram` pattern and opens up `create` for the generic implementation. * Implement the generic create function All of the create methods now delegate to the common create function and future `create_in_*` functions now have a clear pattern to follow as well	2018-06-14 11:08:42 +09:00
Paul Masurel	9a0b7f9855	Rustfmt	2018-05-07 19:50:35 -07:00

1 2 3

130 Commits