tantivy

mirror of https://github.com/quickwit-oss/tantivy.git synced 2026-01-13 04:22:54 +00:00

Author	SHA1	Message	Date
PSeitz	f9171a3981	fix clippy (#1725 ) * fix clippy * fix clippy fastfield codecs * fix clippy bitpacker * fix clippy common * fix clippy stacker * fix clippy sstable * fmt	2022-12-20 07:30:06 +01:00
Paul Masurel	136a8f4124	Isolating sstable and stacker in independant crates. (#1718 ) Both crate will be used in the new (optional + dynamic) fastfield work.	2022-12-13 11:44:17 +09:00
Adam Reichold	bbb058d976	Replace FNV by rustc-hash Both construction have similar goals but rustc-hash ist better suited for contemporary CPU as it works one word at a time instead of byte per byte.	2022-10-27 00:35:09 +02:00
Paul Masurel	483b1d13d4	Added unit test for long tokens (#1635 ) * Bugfix on long tokens and multivalue text fields. Fixes a minor bug for the strong edge case in which a tokenizer would emit tokens where the last token does not cover the last position. More importantly, this adds unit tests. Closes #1634 * Update src/indexer/segment_writer.rs Co-authored-by: PSeitz <PSeitz@users.noreply.github.com> Co-authored-by: PSeitz <PSeitz@users.noreply.github.com>	2022-10-20 15:05:37 +09:00
Paul Masurel	94313b62f8	Hotfix issue/1629 - position broken (#1633 ) * Bugfix position broken. For Field with several FieldValues, with a value that contained no token at all, the token position was reinitialized to 0. As a result, PhraseQueries can show some false positives. In addition, after the computation of the position delta, we can underflow u32, and end up with gigantic delta. We haven't been able to actually explain the bug in 1629, but it is assumed that in some corner case these delta can cause a panic. Closes #1629	2022-10-20 11:03:55 +09:00
Christoph Herzog	96c3d54ac7	fix: Fix power of two computation on 32bit architectures (#1624 ) The current `compute_previous_power_of_two()` implementation used for TermHashmap takes and returns `usize` , but actually only works correclty on 64 bit architectures (aka usize == u64) On other architectures the leading_zeros computation is run on the wrong type (must be u64), and leads to overflows. Fixed simply computing the leading_zeros based on a u64 value.	2022-10-18 11:55:02 +09:00
Pascal Seitz	8d75e451bd	fix truncate, remove mutable access from term	2022-10-17 12:14:35 +08:00
PSeitz	77a415cbe4	rename NothingRecorder to DocIdRecorder (#1615 )	2022-10-13 15:43:40 +09:00
Pascal Seitz	309449dba3	rename to IpAddr	2022-10-07 16:25:01 +08:00
Pascal Seitz	6113e0408c	remove comment	2022-10-07 16:25:01 +08:00
Pascal Seitz	400a20b7af	add ip field add u128 multivalue reader and writer add ip to schema add ip writers, handle merge	2022-10-07 16:25:01 +08:00
Bruce Mitchener	b3bf9a5716	Documentation improvements.	2022-10-05 14:18:10 +07:00
Bruce Mitchener	44e03791f9	Fix warnings when doc'ing private items. (#1579 ) This also fixes a couple of typos, but plenty remain!	2022-10-03 14:24:00 +09:00
Bruce Mitchener	cb252a42af	docs: "associated to" -> "associated with" (#1557 ) This reads better this way.	2022-09-26 20:23:37 +09:00
Bruce Mitchener	ea8e6d7b1d	Tidy up clippy config. (#1547 ) * Checking cfg_attr is no longer necessary. * Don't need multiple `clippy::` prefixes on a name.	2022-09-26 09:37:55 +09:00
Bruce Mitchener	6a88ac3fe3	Documentation improvements. Fix some linking, some grammar, some typos, etc.	2022-09-18 18:05:37 +07:00
Paul Masurel	4e350c5f1b	Clippy	2022-09-02 13:05:00 +09:00
Paul Masurel	08c4412d73	Adding dragon API to build index without any thread. (#1496 ) Closes #1487	2022-09-01 10:32:36 +09:00
Paul Masurel	a451f6d60d	Minor refactoring. (#1495 )	2022-08-31 12:00:58 +09:00
Kian-Meng Ang	014b1adc3e	cargo +nightly fmt	2022-08-17 22:33:44 +08:00
Kian-Meng Ang	84295d5b35	cargo fmt	2022-08-15 21:07:01 +08:00
Kian-Meng Ang	625bcb4877	Fix typos and markdowns Found via these commands: codespell -L crate,ser,panting,beauti,hart,ue,atleast,childs,ond,pris,hel,mot markdownlint .md doc/src/.md --disable MD013 MD025 MD033 MD001 MD024 MD036 MD041 MD003	2022-08-13 18:25:47 +08:00
Kanji Yomoda	af84e74284	Replace deprecated std package's constants on floats and integers (#1420 )	2022-07-22 08:05:08 +09:00
Antoine G	11e4225f23	doc fix (#1391 ) Documentation fix.	2022-06-21 15:53:33 +09:00
boraarslan	26a0fd1fbe	cargo fmt	2022-06-07 10:09:37 +03:00
boraarslan	2981e6c1df	First commit	2022-06-07 10:09:37 +03:00
Pascal Seitz	8807bfd13d	fast field on string enables FAST on string fields, which creates a fastfield containing the term ordinals	2022-03-29 12:40:10 +08:00
Antoine G	e37775fe21	iff->if or if and only if (#1298 ) * has_xxx is_xxx -> if, these function usualy define equivalence xxx returns bool -> specify equivalence when appropriate * fix doc	2022-03-02 11:00:00 +09:00
Paul Masurel	5004290daa	Return an error on certain type of corruption. (#1296 )	2022-03-01 11:35:56 +09:00
Paul Masurel	2ead010c83	Tantivy quickwit (#1293 ) * Added sstable and enabling it by default, and parallel boolean query. * Added async API for FileSlice. * Added async get_doc * Reduce blocksize to 32_000 * Added debug logs Quickwit specific feature a hidden behind the quickwit feature flag.	2022-02-25 17:32:49 +09:00
Paul Masurel	d7b46d2137	Added JSON Type (#1270 ) - Removed useless copy when ingesting JSON. - Bugfix in phrase query with a missing field norms. - Disabled range query on default fields Closes #1251	2022-02-24 16:25:22 +09:00
Paul Masurel	d37633e034	Minor changes in indexing. (#1285 )	2022-02-21 17:16:52 +09:00
Paul Masurel	4dc80cfa25	Removes TokenStream chain. (#1283 ) This change is mostly motivated by the introduction of json object. We need to be able to inject a position object to make the position shift.	2022-02-21 09:51:27 +09:00
Paul Masurel	e028515caf	Simplified expull code. (#1281 )	2022-02-18 18:57:10 +09:00
Paul Masurel	bdedefe07d	Adding an IndexingContext object (#1268 )	2022-02-04 15:08:01 +09:00
Paul Masurel	eca6628b3c	Minor refactoring (#1266 )	2022-01-28 15:55:55 +09:00
Paul Masurel	732f6847c0	Field type with codes (#1255 ) * Term are now typed. This change is backward compatible: While the Term has a byte representation that is modified, a Term itself is a transient object that is not serialized as is in the index. Its .field() and .value_bytes() on the other hand are unchanged. This change offers better Debug information for terms. While not necessary it also will help in the support for JSON types. * Renamed Hierarchical Facet -> Facet	2022-01-07 20:49:00 +09:00
Paul Masurel	3ea6800ac5	Pleasing clippy (#1253 )	2022-01-06 16:41:24 +09:00
Paul Masurel	c81b3030fa	Issue/922b (#1233 ) * Add a NORMED options on field Make fieldnorm indexation optional: * for all types except text => added a NORMED options * for text field if STRING, field has not fieldnorm retained if TEXT, field has fieldnorm computed * Finalize making fieldnorm optional for all field types. - Using Option for fieldnorm readers.	2021-12-10 21:12:29 +09:00
Paul Masurel	ebdbb6bd2e	Fixing compilation warnings & clippy comments.	2021-12-10 16:47:59 +09:00
Paul Masurel	dde49ac8e2	Closes #1195 (#1222 ) Removes the indexed option for facets. Facets are now always indexed. Closes #1195	2021-12-02 14:37:19 +09:00
Kanji Yomoda	bd0f9211da	Remove unused sort for segmenta meta list (#1218 ) * Remove unused sort for segment meta list * Fix segment meta order dependent test	2021-12-01 11:18:17 +09:00
Paul Masurel	7234bef0eb	Issue/1198 (#1201 ) * Unit test reproducing #1198 * Fixing unit test to handle the error from add_document. * Bump project version	2021-11-11 16:42:19 +09:00
Paul Masurel	d18ac136c0	Search simplified (#1175 )	2021-10-18 12:52:43 +09:00
Paul Masurel	02cffa4dea	Code simplification. (#1169 ) Code simplification and Clippy	2021-10-07 14:11:44 +09:00
Paul Masurel	0855649986	Leaning more on the alive (vs delete) semantics. (#1164 )	2021-10-05 18:53:29 +09:00
Pascal Seitz	8d8315f8d0	prealloc vec in postinglist	2021-09-29 09:02:38 +08:00
Pascal Seitz	d7a6a409a1	renames	2021-09-23 20:33:11 +08:00
Pascal Seitz	a1f5cead96	AliveBitSet instead of DeleteBitSet	2021-09-23 20:03:57 +08:00
sigaloid	096ce7488e	Resolve some clippys, format (#1144 ) * cargo +nightly clippy --fix -Z unstable-options	2021-08-26 08:46:00 +09:00

1 2 3 4 5 ...

441 Commits