Bruce Mitchener
cb252a42af
docs: "associated to" -> "associated with" ( #1557 )
...
This reads better this way.
2022-09-26 20:23:37 +09:00
Bruce Mitchener
ea8e6d7b1d
Tidy up clippy config. ( #1547 )
...
* Checking cfg_attr is no longer necessary.
* Don't need multiple `clippy::` prefixes on a name.
2022-09-26 09:37:55 +09:00
Paul Masurel
1998111521
Minor refactoring fast fields ( #1537 )
2022-09-21 12:46:11 +09:00
Bruce Mitchener
cf02e32578
Improvements to doc linking, grammar, etc.
2022-09-19 18:10:22 +07:00
Bruce Mitchener
68b6254b09
docs: Remove mentions of stream_field method.
...
This method doesn't exist, so no need to mention it.
2022-09-18 23:13:41 +07:00
Paul Masurel
8e775b6c3d
Refactoring dyn Column ( #1502 )
2022-09-02 17:26:30 +09:00
Kian-Meng Ang
014b1adc3e
cargo +nightly fmt
2022-08-17 22:33:44 +08:00
Kian-Meng Ang
84295d5b35
cargo fmt
2022-08-15 21:07:01 +08:00
Kian-Meng Ang
625bcb4877
Fix typos and markdowns
...
Found via these commands:
codespell -L crate,ser,panting,beauti,hart,ue,atleast,childs,ond,pris,hel,mot
markdownlint *.md doc/src/*.md --disable MD013 MD025 MD033 MD001 MD024 MD036 MD041 MD003
2022-08-13 18:25:47 +08:00
Kanji Yomoda
af84e74284
Replace deprecated std package's constants on floats and integers ( #1420 )
2022-07-22 08:05:08 +09:00
Pier-Olivier Thibault
775e936f7d
FileHandle: Change from boxed to Arc. ( #1415 )
...
* FileHandle: Change from boxed to Arc.
Changing from a Box<dyn FileHandle> to an Arc<dyn FileHandle> would
allow for a user of tantivy to manage file handles outside of tantivy
and be able to manage their life cycle.
* Fix: Rust linter
2022-07-21 16:19:18 +09:00
Paul Masurel
1273f33338
Fixed comment.
2022-05-06 18:35:25 +09:00
Paul Masurel
e30449743c
Shortens blocks' last_key in the SSTable block index. ( #1361 )
...
Right now we store last key in the blocks of the SSTable index.
This PR replaces the last key by a shorter string that is greater or
equal and still lesser than the next key.
This property is sufficiently to ensure the block index
works properly.
Related to quickwit#1366
2022-05-06 16:29:06 +08:00
Antoine G
386ffab76c
Fix documentation regression ( #1359 )
...
This breaks the doc on doc.rs as the type seems to shadow the struct https://docs.rs/tantivy/latest/tantivy/termdict/type.TermDictionary.html
introduced by #1293 which may not have been up to date with what was done in #1242
2022-05-05 14:59:25 +09:00
Antoine G
e37775fe21
iff->if or if and only if ( #1298 )
...
* has_xxx is_xxx -> if, these function usualy define equivalence
xxx returns bool -> specify equivalence when appropriate
* fix doc
2022-03-02 11:00:00 +09:00
Paul Masurel
2ead010c83
Tantivy quickwit ( #1293 )
...
* Added sstable and enabling it by default, and parallel boolean query.
* Added async API for FileSlice.
* Added async get_doc
* Reduce blocksize to 32_000
* Added debug logs
Quickwit specific feature a hidden behind the quickwit feature flag.
2022-02-25 17:32:49 +09:00
Paul Masurel
eca6628b3c
Minor refactoring ( #1266 )
2022-01-28 15:55:55 +09:00
Paul Masurel
3ea6800ac5
Pleasing clippy ( #1253 )
2022-01-06 16:41:24 +09:00
Antoine G
3129d86743
doc(termdict) expose structs ( #1242 )
...
* doc(termdict) expose structs
also add merger doc + lint
refs #1232
2022-01-03 22:20:31 +09:00
PSeitz
352e0cc58d
Adde demux operation ( #1150 )
...
* add merge for DeleteBitSet, allow custom DeleteBitSet on merge
* forward delete bitsets on merge, add tests
* add demux operation and tests
2021-10-06 16:05:16 +09:00
Pascal Seitz
3265f7bec3
dissolve common module
2021-08-19 23:26:34 +01:00
Pascal Seitz
1e4df54ab3
fix clippy
2021-07-01 17:41:53 +02:00
Pascal Seitz
10f056fbb4
apply clippy fixes
2021-07-01 17:08:44 +02:00
Stéphane Campinas
41ea14840d
add benchmark of term streams merge ( #1024 )
...
* add benchmark of term streams merge
* use union based on FST for merging the term dictionaries
* Rename TermMerger benchmark
2021-05-31 23:15:01 +09:00
Pascal Seitz
c200d59d1e
add blocked bitpacker, add benches
2021-04-29 19:53:54 +02:00
Pascal Seitz
daa53522b5
move tantivy bitpacker to crate, refator bitpacker
...
remove byteorder dependency
2021-04-29 16:40:11 +02:00
Paul Masurel
2dc5403e7b
Closes #1022
2021-04-26 14:01:14 +09:00
Paul Masurel
aead5d4068
First stab
2021-04-26 12:46:06 +09:00
Paul Masurel
39dd8cfe24
Cargo clippy. Acronym should not be full uppercase apparently.
2021-04-26 11:49:18 +09:00
Paul Masurel
868f4fd174
Removing TermMerger::next().
...
Closing #933
2021-04-14 12:06:04 +09:00
Paul Masurel
31137beea6
Replacing (start, end) by Range
2021-03-10 14:06:21 +09:00
Paul Masurel
be626083a0
Reorganized and added termdict unit tests.
2020-12-07 12:50:36 +09:00
Paul Masurel
af6dfa1856
Small refactoring
2020-12-03 14:27:05 +09:00
Paul Masurel
654c400a0b
TermDictionary.finish does not flush
2020-12-03 13:36:25 +09:00
Paul Masurel
80a99539ce
Several TermDict operation now returns an io::Result
2020-12-03 13:13:11 +09:00
Paul Masurel
3491645e69
Moved the term merger
2020-12-03 10:24:04 +09:00
Paul Masurel
b4b3bc7acd
Cargo fmt
2020-12-03 10:08:38 +09:00
Paul Masurel
521c7b271b
Isolated fst impl of termdictionary in a specific module.
2020-12-02 21:18:33 +09:00
Paul Masurel
6d4b982417
Marked blockwand test as ignored.
...
- Using impl trait for iterating `matching_segments` in the termdict
merger
2020-11-16 13:44:14 +09:00
Paul Masurel
d23aee76c9
Avoid loading fieldnorms when not necessary
2020-11-09 15:50:16 +09:00
Paul Masurel
b5f3dcdc8b
TermInfo contain the end_offset of the postings.
...
We slice the ReadOnlySource tightly.
2020-11-06 15:18:51 +09:00
Paul Masurel
01b4aa9adc
Refactoring dir ( #905 )
2020-10-11 22:22:56 +09:00
Paul Masurel
c23a03ad81
Large API Change in the Directory API. ( #901 )
...
Tantivy used to assume that all files could be somehow memory mapped. After this change, Directory return a `FileSlice` that can be reduced and eventually read into an `OwnedBytes` object. Long and blocking io operation are still required by they do not span over the entire file.
2020-10-08 16:36:51 +09:00
Paul Masurel
439d6956a9
Returning Result in some of the API ( #880 )
...
* Returning Result in some of the API
* Introducing `.writer_for_test(..)`
2020-09-07 15:52:34 +09:00
Paul Masurel
1e5ebdbf3c
Format and remove useless import ( #819 )
2020-04-27 11:56:49 +09:00
Paul Masurel
186d7fc20e
Fix build
2020-04-01 09:32:45 +09:00
Paul Masurel
6227a0555a
Added unit test for empty dictionaries.
2020-01-30 10:08:27 +09:00
Audun Halland
f85d0a522a
Optimize TermDictionary::empty by precomputed data source ( #767 )
2020-01-30 10:04:58 +09:00
Halvor Fladsrud Bø
5795488ba7
Backward iteration for termdict range ( #757 )
...
* Added backwards iteration to termdict
* Ran formatter
* Updated fst dependency
* Updated dependency
* Changelog and version
* Fixed version
* Made it part of 12.0
2020-01-30 09:59:21 +09:00
Paul Masurel
5c6580eb15
fmt ( #661 )
2019-10-04 12:10:01 +09:00