boraarslan
2d6f1d43ff
Add bool and explicit types for merger
2022-06-07 10:10:33 +03:00
boraarslan
ca0973ec78
Fix tests
2022-06-07 10:10:33 +03:00
boraarslan
38ee60d792
Edit Test
2022-06-07 10:10:33 +03:00
boraarslan
f68be28284
Add bool
2022-06-07 10:09:37 +03:00
boraarslan
fc43ab9280
Add tests
2022-06-07 10:09:37 +03:00
boraarslan
38c2ea6a5d
Remove unnecessary line
2022-06-07 10:09:37 +03:00
boraarslan
26a0fd1fbe
cargo fmt
2022-06-07 10:09:37 +03:00
boraarslan
811b91ecb3
Edit and add tests
2022-06-07 10:09:37 +03:00
boraarslan
25c00ce856
Fix indexing for bool
2022-06-07 10:09:37 +03:00
boraarslan
e5debb97a7
Edit test
2022-06-07 10:09:37 +03:00
boraarslan
bc4cd9ffaa
typo fix
2022-06-07 10:09:37 +03:00
boraarslan
9a13d8709b
Explicitly write types
2022-06-07 10:09:37 +03:00
boraarslan
e6eadf1a2f
Add tests
2022-06-07 10:09:37 +03:00
boraarslan
7cca7e6a47
Fix of last commit
2022-06-07 10:09:37 +03:00
boraarslan
ef2492dba6
Broken commit
2022-06-07 10:09:37 +03:00
boraarslan
2981e6c1df
First commit
2022-06-07 10:09:37 +03:00
Ryan Russell
b33b4c0092
Fix various occurrence var names and references ( #1385 )
...
Thank you Ryan!
Signed-off-by: Ryan Russell <git@ryanrussell.org >
2022-06-07 11:08:19 +09:00
Pascal Seitz
314ae43a45
fix fmt
2022-06-02 14:54:23 +08:00
Pascal Seitz
fce91b2f3a
vec without capacity
2022-06-02 13:50:18 +08:00
Pascal Seitz
9bcd2b8104
fix read_block_async
2022-06-02 13:37:52 +08:00
Pascal Seitz
0c9c257150
move cache handling into single function
2022-06-02 13:25:29 +08:00
Pascal Seitz
1af85a2956
accept usize instead &usize
2022-06-02 11:23:36 +08:00
Pascal Seitz
bc4c3d0c6b
add peek_lru test
2022-06-02 11:13:17 +08:00
Pascal Seitz
6937c75f05
hide advanced doc store api
2022-06-02 11:13:17 +08:00
Pascal Seitz
e54429e827
expose doc store functions
...
expose doc store functions for advanced usage
refactor cache
expose cache statistics
remove unnecessary arc
unduplicate code
2022-06-02 11:13:17 +08:00
Ryan Russell
ca836b6414
Improve Docs Readability ( #1380 )
...
Signed-off-by: Ryan Russell <git@ryanrussell.org >
2022-06-02 09:32:57 +09:00
Paul Masurel
f0a2b1cc44
Bumped tantivy and subcrate versions.
2022-05-25 22:50:33 +09:00
PSeitz
89e19f14b5
Merge pull request #1374 from kryesh/main
...
Add Zstd compression support, Make block size configurable via IndexSettings
2022-05-25 07:39:46 +02:00
Paul Masurel
71c4393ec4
Clippy
2022-05-23 10:20:37 +09:00
Antoine G
9ee4772140
Fix deps for unicode regex compiling ( #1373 )
...
* lint doc warning
* fix regex build
2022-05-20 10:18:44 +09:00
Kryesh
fc045e6bf9
Cleanup imports, remove unneeded error mapping
2022-05-19 10:34:02 +10:00
Kryesh
6837a4d468
Fix bench
2022-05-18 20:35:29 +10:00
Kryesh
0759bf9448
Cleanup zstd structure and serialise to u32 in line with lz4
2022-05-18 20:31:22 +10:00
Kryesh
152e8238d7
Fix silly errors from running tests without feature flag
2022-05-18 19:49:10 +10:00
Kryesh
d4e5b48437
Apply feedback - standardise on u64 and fix correct compression bounds
2022-05-18 19:37:28 +10:00
Kryesh
03040ed81d
Add Zstd compression support
2022-05-18 14:04:43 +10:00
Kryesh
aaa22ad225
Make block size configurable to allow for better compression ratios on large documents
2022-05-18 11:13:15 +10:00
Antoine G
3223bdf254
Refactorize PhraseScorer::compute_phrase_match ( #1364 )
...
* Refactorize PhraseScorer::compute_phrase_match
* implem optim for slop
2022-05-13 09:57:21 +09:00
Paul Masurel
749395bbb8
Added rustdoc for MultiFruit extract function ( #1369 )
2022-05-11 11:41:39 +09:00
Paul Masurel
617ba1f0c0
Bugfix in the document deserialization. ( #1368 )
...
Deserializing a json field does not expect the
end of the document anymore.
This behavior is well documented in serde_json.
https://docs.serde.rs/serde_json/fn.from_reader.html
Closes #1366
2022-05-11 11:38:10 +09:00
Paul Masurel
2f1cd7e7f0
Bugfix in the document deserialization. ( #1367 )
...
Deserializing a json field does not expect the
end of the document anymore.
This behavior is well documented in serde_json.
https://docs.serde.rs/serde_json/fn.from_reader.html
Closes #1366
2022-05-11 11:27:04 +09:00
PSeitz
58c0cb5fc4
Merge pull request #1357 from saroh/1302-json-term-writer-API
...
Expose helpers to generate json field writer terms
2022-05-10 11:02:05 +08:00
PSeitz
7f45a6ac96
allow setting tokenizer manager on index ( #1362 )
...
handle json in tokenizer_for_field
2022-05-09 18:15:45 +09:00
saroh
0ade871126
rename constructor to be more explicit
2022-05-06 13:29:07 +02:00
Pascal Seitz
d77e8de36a
flip alias variable name
2022-05-06 17:52:36 +08:00
Pascal Seitz
d11a8cce26
minor docs fix
2022-05-06 17:52:36 +08:00
Pascal Seitz
bc607a921b
add alias shard_size split_size for quickwit
...
improve some docs
2022-05-06 17:52:36 +08:00
Paul Masurel
1273f33338
Fixed comment.
2022-05-06 18:35:25 +09:00
Paul Masurel
e30449743c
Shortens blocks' last_key in the SSTable block index. ( #1361 )
...
Right now we store last key in the blocks of the SSTable index.
This PR replaces the last key by a shorter string that is greater or
equal and still lesser than the next key.
This property is sufficiently to ensure the block index
works properly.
Related to quickwit#1366
2022-05-06 16:29:06 +08:00
Paul Masurel
ed26552296
Minor changes in query parsing for quickwit#1334. ( #1356 )
...
Quickwit's still heavily relies on generating field names
containing a '.' for nested object, yet allows for
user defined field names to contain a dot.
In order to reuse tantivy query parser, we will end up
using quickwit field names directly into tantivy.
Only '.' will be escaped.
This PR makes minor changes in how tantivy query parser parses
a field name and resolves it to a field.
Some of the new edge case behavior is hacky.
Closes #1355
2022-05-06 13:20:10 +09:00