Paul Masurel
bd5eea9852
Integrated columnar work.
2023-02-09 13:14:31 +01:00
Paul Masurel
08919a2900
Improvement on the scalar / random bitpacker code. ( #1781 )
...
* Improvement on the scalar / random bitpacker code.
Added proptesting
Added simple benchmark
Added assert and comments on the very non trivial hidden contract
Remove the need for an extra padding.
The last point introduces a small performance regression (~10%).
* Fixing unit tests
2023-01-19 18:09:13 +09:00
PSeitz
f687b3a5aa
start migrate Field to &str ( #1772 )
...
start migrate Field to &str in preparation of columnar
return Result for get_field
2023-01-18 16:12:07 +09:00
Shikhar Bhushan
2650111b76
EnableScoring::Disabled - optional Searcher ( #1780 )
2023-01-12 09:26:50 -05:00
PSeitz
1176555eff
handle user input on get_docid_for_value_range ( #1760 )
...
* handle user input on get_docid_for_value_range
fixes #1757
* pass range as parameter
2023-01-12 14:20:16 +01:00
Adam Reichold
8312c882a5
More cosmetic fixes for upcoming Clippy lints. ( #1771 )
2023-01-10 10:32:45 +01:00
PSeitz
7c6cc818ae
enable range query on fast field for u64 compatible types ( #1762 )
...
* enable range query on fast field for u64 compatible types
* rename, update benches
2023-01-10 04:08:26 +01:00
PSeitz
07a51eb7c8
refactor multivalue fastfield, refactor range query ( #1749 )
...
Introduce MakeZero trait, remove make_zero from FastValue
Merge two multivalue fastfield implementations into one
prepare range query on fastfield for different types
2023-01-05 12:09:50 +01:00
Paul Masurel
f39165e1e7
Moving FileSlice to tantivy-common ( #1729 )
2022-12-21 16:35:11 +09:00
PSeitz
f9171a3981
fix clippy ( #1725 )
...
* fix clippy
* fix clippy fastfield codecs
* fix clippy bitpacker
* fix clippy common
* fix clippy stacker
* fix clippy sstable
* fmt
2022-12-20 07:30:06 +01:00
PSeitz
1119e59eae
prepare fastfield format for null index ( #1691 )
...
* prepare fastfield format for null index
* add format version for fastfield
* Update fastfield_codecs/src/compact_space/mod.rs
* switch to variable size footer
* serialize delta of end
2022-11-28 17:15:24 +09:00
PSeitz
ee1f2c1f28
add aggregation support for date type ( #1693 )
...
* add aggregation support for date type
fixes #1332
* serialize key_as_string as rfc3339 in date histogram
* update docs
* enable date for range aggregation
2022-11-28 09:12:08 +09:00
Pascal Seitz
8641155cbb
remove column from MultiValuedU128FastFieldReader
2022-11-14 18:49:15 +08:00
Pascal Seitz
b7d0dd154a
fmt
2022-11-14 14:49:15 +08:00
PSeitz
ce10fab20f
Apply suggestions from code review
...
Co-authored-by: Paul Masurel <paul@quickwit.io >
2022-11-14 14:21:53 +08:00
Pascal Seitz
e034328a8b
Improve position_to_docid, refactor, add tests
2022-11-14 14:21:53 +08:00
Pascal Seitz
f811d1616b
add support for ip range query on multivalue fastfields
2022-11-14 14:21:52 +08:00
Pascal Seitz
fb9f03118d
switch total_num_val to u32
2022-11-11 17:35:52 +08:00
Paul Masurel
3edf0a2724
Using the manual reload policy in IndexWriter. ( #1667 )
2022-11-09 11:20:41 +01:00
Pascal Seitz
c69a873dd3
fix num_vals on value index after merge
2022-11-07 21:05:21 +08:00
Pascal Seitz
83325d8f3f
move multivalue index to own file
...
start_doc parameter in positions to docids
2022-11-01 10:36:13 +08:00
PSeitz
7a80851e36
Merge pull request #1645 from quickwit-oss/ip_field_range_query
...
add ip range query benchmark, add seek behaviour
2022-10-27 16:13:52 +08:00
Adam Reichold
bbb058d976
Replace FNV by rustc-hash
...
Both construction have similar goals but rustc-hash ist better suited for
contemporary CPU as it works one word at a time instead of byte per byte.
2022-10-27 00:35:09 +02:00
Pascal Seitz
6213ea476a
pass positions parameter
2022-10-25 17:44:51 +08:00
Pascal Seitz
e772d3170d
switch get_val() to u32
...
Fixes #1638
2022-10-24 19:05:57 +08:00
Pascal Seitz
6bb73a527f
add range query via ip fast field
2022-10-24 16:00:38 +08:00
Pascal Seitz
791350091c
switch num_vals() to u32
...
fixes #1630
2022-10-20 19:44:28 +08:00
Pascal Seitz
952b048341
add term aggregation clarification
2022-10-14 16:12:19 +08:00
Pascal Seitz
9cb8cfbea8
return Error instead panic in fastfields
...
fixes #1572
2022-10-11 14:15:22 +08:00
Pascal Seitz
b2ca83a93c
switch to ipv6, add monotonic_mapping tests
2022-10-07 18:47:55 +08:00
PSeitz
534b1d33c3
use ipv6
...
Co-authored-by: Paul Masurel <paul@quickwit.io >
2022-10-07 16:56:00 +08:00
PSeitz
f465173872
Apply suggestions from code review
...
Co-authored-by: Paul Masurel <paul@quickwit.io >
2022-10-07 16:55:53 +08:00
Pascal Seitz
96315df20d
use idx part only for positions_to_docid
2022-10-07 16:54:04 +08:00
Pascal Seitz
9a1609d364
add test
2022-10-07 16:25:01 +08:00
Pascal Seitz
2864bf7123
use serializer for u128
2022-10-07 16:25:01 +08:00
Pascal Seitz
5171ff611b
serialize ip as u128, add test for positions_to_docid
2022-10-07 16:25:01 +08:00
Pascal Seitz
0b86658389
rename ip addr, use buffer
2022-10-07 16:25:01 +08:00
Pascal Seitz
4d29ff4d01
finalize ip addr rename
2022-10-07 16:25:01 +08:00
Pascal Seitz
cdc8e3a8be
group montonic mapping and inverse
...
fix mapping inverse
remove ip indexing
add get_between_vals test
2022-10-07 16:25:01 +08:00
Pascal Seitz
787a37bacf
expect instead of unwrap
2022-10-07 16:25:01 +08:00
Pascal Seitz
f5039f1846
remove roaring
2022-10-07 16:25:01 +08:00
Pascal Seitz
eeb1f19093
rename to iter_gen
2022-10-07 16:25:01 +08:00
Pascal Seitz
087beaf328
remove null handling
2022-10-07 16:25:01 +08:00
Pascal Seitz
309449dba3
rename to IpAddr
2022-10-07 16:25:01 +08:00
Pascal Seitz
c8713a01ed
use iter api
2022-10-07 16:25:01 +08:00
Pascal Seitz
400a20b7af
add ip field
...
add u128 multivalue reader and writer
add ip to schema
add ip writers, handle merge
2022-10-07 16:25:01 +08:00
Pascal Seitz
d742275048
renames
2022-10-05 19:16:49 +08:00
PSeitz
b9f06bc287
Update src/fastfield/multivalued/mod.rs
...
Co-authored-by: Paul Masurel <paul@quickwit.io >
2022-10-05 19:09:19 +08:00
Pascal Seitz
8b42c4c126
disable linear codec for multivalue value index
...
don't materialize index column on merge
use simpler chain() variant
2022-10-05 19:09:17 +08:00
Pascal Seitz
6d9a123cf2
remove get_val in serialization
...
remove get_val in serialization and mark as unimplemented!()
replace get_val with iter in linear codec
remove MultivalueStartIndexRandomSeeker
replace MultivalueStartIndexIter with closure
Sample 100 values in linear codec
2022-10-04 12:01:25 +08:00