tantivy

mirror of https://github.com/quickwit-oss/tantivy.git synced 2026-01-08 10:02:55 +00:00

Author	SHA1	Message	Date
Paul Masurel	92dac7af5c	Return an error instead of panicking when sorting by a non fast field. (#748 ) Closes #747	2020-01-08 13:41:02 +09:00
Paul Masurel	801905d77f	Davide romanini arm atomic mutex (#746 ) * Add atomic mutex implementation for ARM. * Applied rustfmt. * rustfmt Co-authored-by: davide-romanini <davide.romanini@gmail.com>	2019-12-30 23:42:11 +09:00
Paul Horn	8f5ac86f30	Expose UserOperation as a public type. (#744 ) In order to make `IndexWriter::run` callable from outside of the create, the `UserOperation` type needs to be publicly available. Since the `indexer` module is private, we just export the `UserOperation` type directly.	2019-12-29 22:37:13 +09:00
Paul Masurel	d12a06b65b	Tiny code simplification.	2019-12-26 09:33:17 +09:00
Minoru Osuka	749432f949	Make SchemaBuilder::add_field() public (#742 ) * Make add_field() to public * cargo format	2019-12-25 20:37:34 +09:00
Paul Masurel	c1400f25a7	Handle facet search in the QueryParser. (#741 ) Closes #738	2019-12-25 17:43:33 +09:00
Paul Masurel	87120acf7c	Bump version 0.11.3	2019-12-20 21:22:43 +09:00
Paul Masurel	401f74f7ae	Implement fast field for DateTime. (#736 )	2019-12-20 21:20:15 +09:00
Paul Masurel	03d31f6713	Update CHANGELOG	2019-12-19 10:07:43 +09:00
Paul Masurel	a57faf07f6	Added a constructor for `WatchHandle` (#734 ) Closes #731	2019-12-19 10:06:02 +09:00
Paul Masurel	562ea9a839	Merge branch 'master' of github.com:tantivy-search/tantivy	2019-12-19 09:32:50 +09:00
Paul Masurel	cf92cc1ada	Closes #732 (#733 ) The future returned by `IndexWriter::merge` does not borrow `&mut self`	2019-12-18 23:25:22 +09:00
Paul Masurel	f6000aece7	Closes #732 The future returned by `IndexWriter::merge` does not borrow `&mut self`	2019-12-18 21:48:51 +09:00
Paul Masurel	2b3fe3a2b5	Bumped version for hotfix 0.11.1	2019-12-17 21:10:50 +09:00
Paul Masurel	0fde90faac	Closes #729 (#730 ) Bug related with merge and deletes...	2019-12-17 21:09:08 +09:00
Paul Masurel	5838644b03	Added README in tantivy-query-grammar 0.11	2019-12-16 08:41:21 +09:00
Paul Masurel	c0011edd05	Added version for tantivy-grammar before publish	2019-12-16 08:35:17 +09:00
petr-tik	431c187a60	Make error handling richer in Footer::is_compatible (#724 ) * WIP implemented is_compatible hide Footer::from_bytes from public consumption - only found Footer::extract used outside the module Add a new error type for IncompatibleIndex add a prototypical call to footer.is_compatible() in ManagedDirectory::open_read to make sure we error before reading it further * Make error handling more ergonomic Add an error subtype for OpenReadError and converters to TantivyError * Remove an unnecessary assert it's follower by the same check that Errors instead of panicking * Correct the compatibility check logic Leave a defensive versioned footer check to make sure we add new logic handling when we add possible footer versions Restricted VersionedFooter::from_bytes to be used inside the crate only remove a half-baked test * WIP. * Return an error if index incompatible - closes #662 Enrich the error type with incompatibility Change return type to Result<bool, TantivyError>, instead of bool Add an Incompatibility enum that enriches the IncompatibleIndex error variant with information, which then allows us to generate a developer-friendly hint how to upgrade library version or switch feature flags for a different compression algorithm Updated changelog Change the signature of is_compatible Added documentation to the Incompatibility Added a conditional test on a Footer with lz4 erroring	2019-12-14 09:14:33 +09:00
Caio Romão	392abec420	Make u64_lenient() handle f64 fast fields too (#726 ) * Make u64_lenient() handle f64 fast fields too Without this, we get a panic during merge since the merger will get a `None` where it expects something. Prior to this patch, you can reproduce the panic with: use tantivy::{ self, schema::{SchemaBuilder, FAST}, Document, Index, Result, }; #[test] fn pass() -> Result<()> { let mut builder = SchemaBuilder::new(); let field = builder.add_f64_field("f64", FAST); let index = Index::create_in_ram(builder.build()); let mut writer = index.writer_with_num_threads(1, 50_000_000)?; for i in 0..1000 { let mut doc = Document::new(); doc.add_f64(field, 0.42); writer.add_document(doc); if i % 5 == 0 { writer.commit()?; } } writer.commit()?; Ok(()) } * Add test to verify that f64 fields are merged * Ensure multi-valued fast fields can be merged too	2019-12-13 23:41:22 +09:00
Paul Masurel	dfbe337fe2	Optimize deletes (#723 ) Closes #710	2019-12-13 09:50:00 +09:00
Paul Masurel	b9896c4962	Cleanup	2019-12-10 23:01:07 +09:00
Paul Masurel	afa5715e56	Added unit test.	2019-12-10 22:49:32 +09:00
Paul Masurel	79474288d0	Some clippy minor fixes (#722 )	2019-12-09 13:40:04 +09:00
Paul Masurel	daf64487b4	Fixing JSON se/deserialization of dates. (#721 ) Closes #719	2019-12-09 13:31:35 +09:00
Ximo Guanter	00816f5529	Fix outdated reference in documentation (#720 )	2019-12-08 18:10:50 +09:00
Paul Masurel	f73787e6e5	Merge branch 'master' of github.com:tantivy-search/tantivy	2019-12-06 10:06:09 +09:00
Paul Masurel	5cffa71467	Using census 0.4	2019-12-06 10:04:01 +09:00
Christian Hunstad	02af28b3b7	add norwegian stemmer (#717 )	2019-11-27 21:08:59 +09:00
Paul Masurel	afe0134d0f	Kkoziara remove tokens from doc store (#715 ) * Prevent tokens from being stored in the document store. Commit adds prepare_for_store method to Document, which changes all PreTokenizedString values into String values. The method is called before adding document to the document store to prevent tokens from being saved there. Commit also adds small changes to comments in pre_tokenized_text example. * Avoid storing the pretokenized text.	2019-11-25 22:39:12 +09:00
Christian Hunstad	db9e81d0f9	Updated rust-stemmers version to 1.2 (#716 ) * Updated rust-stemmers version to 1.2 * 1.2.0 -> 1.2	2019-11-25 22:38:48 +09:00
Paul Masurel	3821f57ecc	Closes #712 (#714 ) Fixing the memory leak in the DeleteQueue.	2019-11-25 15:57:29 +09:00
Paul Masurel	d379f98b22	Waiting for indexing threads when dropping IndexWriter	2019-11-23 15:00:27 +09:00
Paul Masurel	ef3eddf3da	clippy first stab (#711 )	2019-11-22 13:09:35 +09:00
Paul Masurel	08a2368845	Closes #708 (#709 ) Fixes a race condition in the test.	2019-11-21 11:41:59 +09:00
Paul Masurel	1868fc1e2c	Text fix	2019-11-20 23:00:39 +09:00
Paul Masurel	451a0252ab	thread pool merge (#704 )	2019-11-20 21:18:05 +09:00
Paul Masurel	42756c7474	Removing futures-cpupool and upgrading to futures-0.3	2019-11-15 18:35:31 +09:00
Paul Masurel	598b076240	Making some of the IndexWriter's method public.	2019-11-11 12:41:45 +09:00
Paul Masurel	f1f96fc417	Updating some doc.	2019-11-11 10:04:12 +09:00
Paul Masurel	9c941603f5	Petr tik n662 errror incompatible footer version (#696 ) * code tidy-up Replace `20` magic constant with COMMON_FOOTER_SIZE Add a docstring showing how footer is serialised Add a test for footer length checking * Add more tests for VersionedFooter successful and panicking .to_bytes() calls * Minor changes in footer.rs	2019-11-10 14:40:06 +09:00
Paul Masurel	fb3d6fa332	Adding Value::From<PretokenizedText> (#697 )	2019-11-10 14:39:44 +09:00
Paul Masurel	88fd7f091a	SegmentUpdater.add_segment does not need to return true (#693 ) 0.1	2019-11-09 21:18:51 +09:00
Jacob Brown	6e4fdfd4bf	replace scoped_pool (#685 )	2019-11-07 10:26:08 +09:00
kkoziara	0519056bd8	Added handling of pre-tokenized text fields (#642 ). (#669 ) * Added handling of pre-tokenized text fields (#642). * * Updated changelog and examples concerning #642. * Added tokenized_text method to Value implementation. * Implemented From<TokenizedString> for TokenizedStream. * * Removed tokenized flag from TextOptions and code reliance on the flag. * Changed naming to use word "pre-tokenized" instead of "tokenized". * Updated example code. * Fixed comments. * Minor code refactoring. Test improvements.	2019-11-07 10:10:56 +09:00
dependabot-preview[bot]	7305ad575e	Update smallvec requirement from 0.6 to 1.0 (#686 ) Updates the requirements on [smallvec](https://github.com/servo/rust-smallvec) to permit the latest version. - [Release notes](https://github.com/servo/rust-smallvec/releases) - [Commits](https://github.com/servo/rust-smallvec/compare/v0.6.0...v1.0.0) Signed-off-by: dependabot-preview[bot] <support@dependabot.com>	2019-11-07 09:55:33 +09:00
Paul Masurel	79f64ac2f4	Create FUNDING.yml	2019-11-05 16:26:12 +09:00
Paul Masurel	67bce6cbf2	Fixing the construction of the DeleteBitset. (#683 ) Closes #681	2019-11-04 15:39:11 +09:00
xiaoniu-578fa6bff964d005	e5316a4388	Reduce unnecessary clone. (#684 )	2019-11-04 13:57:59 +09:00
Mathias Svensson	6a8a8557d2	Use `slice::iter` instead of `into_iter` to avoid future breakage (#679 ) * Use `slice::iter` instead of `into_iter` to avoid future breakage `an_array.into_iter()` currently just works because of the autoref feature, which then calls `<[T] as IntoIterator>::into_iter`. But in the future, arrays will implement `IntoIterator`, too. In order to avoid problems in the future, the call is replaced by `iter()` which is shorter and more explicit. * cargo fmt	2019-10-31 20:59:50 +09:00
Alberto Piai	3a65dc84c8	TopDocs: ensure stable sorting on equal score (#675 ) * TopDocs: ensure stable sorting on equal score When selecting the top K documents by score, we need to ensure stable sorting. Until now, for documents with the same score, we were relying on the (arbitrary) order returned by the BinaryHeap used to implement the collectors. This patch fixes the problem by explicitly using the doc address when harvesting the `TopSegmentCollector` and when merging the results in `TopCollector::merge_fruits()`. This is important (for example) to implement pagination correctly using the TopDocs collector. If sorting isn't stable, documents that have the same score might be ranked in different positions depending on the specific K that was used, thus appearing in two different pages, or in none at all. Fixes gh-671 * TMP: alternative solution (see previous commit) If we add the constrait that D is also PartialOrd in ComparableDoc<T, D>, then we can move the comparison by doc address directly in the cmp implementation of ComparableDoc. * TMP rebase as first commit: add benchmarks for TopSegmentCollector * fixup! TMP: alternative solution (see previous commit) * TMP add changelog entry * TMP run cargo fmt	2019-10-26 15:27:25 +09:00

1 2 3 4 5 ...

1523 Commits