Paul Masurel
fb6d5acb82
Simplify code
2022-10-04 15:44:38 +09:00
PSeitz
4cf911d56a
Merge pull request #1587 from quickwit-oss/no_get_val_in_serialize
...
remove get_val in serialization
2022-10-04 12:56:48 +08:00
Pascal Seitz
0f5cff762f
move enumerate and remove computation
2022-10-04 12:30:19 +08:00
Pascal Seitz
6d9a123cf2
remove get_val in serialization
...
remove get_val in serialization and mark as unimplemented!()
replace get_val with iter in linear codec
remove MultivalueStartIndexRandomSeeker
replace MultivalueStartIndexIter with closure
Sample 100 values in linear codec
2022-10-04 12:01:25 +08:00
PSeitz
0f4a47816a
Merge pull request #1582 from quickwit-oss/faster_sorted_field_values
...
use groupby instead of vec allocation
2022-10-04 09:36:24 +08:00
Pascal Seitz
b062ab2196
use groupby instead of vec allocation
2022-10-04 09:26:26 +08:00
Bruce Mitchener
a9d2f3db23
Tantivy requires Rust 1.62 or later. ( #1583 )
...
Tantivy needs the `total_cmp` feature to compile, which was stabilized
in Rust 1.62.
2022-10-03 18:31:07 +09:00
Bruce Mitchener
44e03791f9
Fix warnings when doc'ing private items. ( #1579 )
...
This also fixes a couple of typos, but plenty remain!
2022-10-03 14:24:00 +09:00
Bruce Mitchener
2d23763e9f
Use u64::from boolean more. ( #1580 )
...
This case is inverted from the previous cases fixed.
This is from nightly clippy.
2022-10-03 14:17:50 +09:00
Bruce Mitchener
a24ae8d924
clippy: Fix needless-borrow warnings. ( #1581 )
...
These show on nightly clippy.
2022-10-03 14:15:09 +09:00
PSeitz
927dff5262
Merge pull request #1578 from quickwit-oss/dead_code
...
remove dead indexing code
2022-10-03 11:25:10 +08:00
Pascal Seitz
a695edcc95
remove dead indexing code
2022-10-03 09:44:02 +08:00
Paul Masurel
b4b4f3fa73
Removing default features for zstd ( #1574 )
2022-09-30 13:02:46 +09:00
PSeitz
b50e4b7c20
Merge pull request #1566 from quickwit-oss/fix_docstore_sorting
...
fix docstore settings for temp docstore
2022-09-30 10:10:36 +08:00
PSeitz
f8686ab1ec
improve comments
...
Co-authored-by: Paul Masurel <paul@quickwit.io >
2022-09-30 10:06:34 +08:00
PSeitz
2fe42719d8
Merge pull request #1570 from quickwit-oss/no_sort_on_multi
...
validate index settings on create
2022-09-30 09:17:03 +08:00
PSeitz
fadd784a25
log improvements ( #1564 )
2022-09-30 09:39:26 +09:00
Pascal Seitz
0e94213af0
validate index settings on create
2022-09-29 18:58:09 +08:00
PSeitz
0da2a2e70d
Merge pull request #1567 from quickwit-oss/dependabot/cargo/tantivy-fst-0.4.0
...
Update tantivy-fst requirement from 0.3.0 to 0.4.0
2022-09-29 10:00:16 +08:00
dependabot[bot]
0bcdf3cbbf
Update tantivy-fst requirement from 0.3.0 to 0.4.0
...
Updates the requirements on [tantivy-fst](https://github.com/tantivy-search/fst ) to permit the latest version.
- [Release notes](https://github.com/tantivy-search/fst/releases )
- [Commits](https://github.com/tantivy-search/fst/commits )
---
updated-dependencies:
- dependency-name: tantivy-fst
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com >
2022-09-28 20:50:43 +00:00
Pascal Seitz
8f647b817f
fix docstore settings for temp docstore
...
fixes #1565
2022-09-28 17:53:59 +08:00
trinity-1686a
a86b0df6f4
Add query matching terms in a set ( #1539 )
2022-09-28 09:43:18 +02:00
Bruce Mitchener
f842da758c
Move ArcBytes,WeakArcBytes to mmap_directory. ( #1555 )
...
When building without default features (so without mmap, etc),
there are some warnings about unused things. This fixes the
ones related to `ArcBytes` and `WeakArcBytes`, which are only
used with the `mmap_directory` code.
2022-09-27 09:57:28 +09:00
Bruce Mitchener
97ccd6d712
Avoid slicing a string in DocParsingError. ( #1559 )
...
Fixes #1339 .
2022-09-26 20:27:15 +09:00
Bruce Mitchener
cb252a42af
docs: "associated to" -> "associated with" ( #1557 )
...
This reads better this way.
2022-09-26 20:23:37 +09:00
Bruce Mitchener
d9609dd6b6
POLLING_INTERVAL needn't be pub. ( #1556 )
...
This is only used within the file watcher and is const, so it
can't be configured.
2022-09-26 20:22:55 +09:00
Bruce Mitchener
f03667d967
Remove references to /cpp directory. ( #1560 )
...
This was removed in 2018, so these should be fine to remove now.
2022-09-26 20:22:28 +09:00
PSeitz
10f10a322f
Merge pull request #1554 from quickwit-oss/prepare_ip_field
...
prepare for ip field
2022-09-26 16:34:24 +08:00
Pascal Seitz
f757471077
prepare for ip field
2022-09-26 16:27:35 +08:00
PSeitz
21e0adefda
use binary search instead of linear for get_val in merge ( #1548 )
...
* use binary search instead of linear for get_val in merge
* use partition_point
2022-09-26 09:42:33 +09:00
Bruce Mitchener
ea8e6d7b1d
Tidy up clippy config. ( #1547 )
...
* Checking cfg_attr is no longer necessary.
* Don't need multiple `clippy::` prefixes on a name.
2022-09-26 09:37:55 +09:00
PSeitz
dac7da780e
Merge pull request #1545 from waywardmonkeys/remove-some-refs
...
clippy: Remove borrows that the compiler will do.
2022-09-23 15:33:23 +08:00
PSeitz
20c87903b2
fix multivalue ff index creation regression ( #1543 )
...
fixes multivalue ff regression by avoiding using `get_val`. Line::train calls repeatedly get_val, but get_val implementation on Column for multivalues is very slow. The fix is to use the iterator instead. Longterm fix should be to remove get_val access in serialization.
Old Code
test fastfield::bench::bench_multi_value_ff_merge_few_segments ... bench: 46,103,960 ns/iter (+/- 2,066,083)
test fastfield::bench::bench_multi_value_ff_merge_many_segments ... bench: 83,073,036 ns/iter (+/- 4,373,615)
est fastfield::bench::bench_multi_value_ff_merge_many_segments_log_merge ... bench: 64,178,576 ns/iter (+/- 1,466,700)
Current
running 3 tests
test fastfield::multivalued::bench::bench_multi_value_ff_merge_few_segments ... bench: 57,379,523 ns/iter (+/- 3,220,787)
test fastfield::multivalued::bench::bench_multi_value_ff_merge_many_segments ... bench: 90,831,688 ns/iter (+/- 1,445,486)
test fastfield::multivalued::bench::bench_multi_value_ff_merge_many_segments_log_merge ... bench: 158,313,264 ns/iter (+/- 28,823,250)
With Fix
running 3 tests
test fastfield::multivalued::bench::bench_multi_value_ff_merge_few_segments ... bench: 57,635,671 ns/iter (+/- 2,707,361)
test fastfield::multivalued::bench::bench_multi_value_ff_merge_many_segments ... bench: 91,468,712 ns/iter (+/- 11,393,581)
test fastfield::multivalued::bench::bench_multi_value_ff_merge_many_segments_log_merge ... bench: 73,909,138 ns/iter (+/- 15,846,097)
2022-09-23 15:36:29 +09:00
PSeitz
f9c3947803
Merge pull request #1546 from waywardmonkeys/use-ux-from-bool
...
Use u8::from(bool), u64::from(bool).
2022-09-23 09:06:24 +08:00
Bruce Mitchener
e9a384bb15
Use u8::from(bool), u64::from(bool).
2022-09-22 22:44:53 +07:00
Bruce Mitchener
d231671fe2
clippy: Remove borrows that the compiler will do.
...
This started showing up with clippy in rust 1.64.
2022-09-22 22:38:23 +07:00
trinity-1686a
fa3d786a2f
Add support for deleting all documents matching query ( #1535 )
...
* add support for deleting all documents matching query
#1494
2022-09-22 21:26:09 +09:00
Paul Masurel
75aafeeb9b
Added a function to deep clone RamDirectory. ( #1544 )
2022-09-22 12:04:02 +02:00
PSeitz
6f066c7f65
Merge pull request #1541 from quickwit-oss/add_bench
...
add benchmarks for multivalued fastfield merge
2022-09-22 15:28:00 +08:00
Pascal Seitz
22e56aaee3
add benchmarks for multivalued fastfield merge
2022-09-22 11:25:41 +08:00
Paul Masurel
d641979127
Minor refactor of fast fields ( #1538 )
2022-09-21 12:55:03 +09:00
Paul Masurel
1998111521
Minor refactoring fast fields ( #1537 )
2022-09-21 12:46:11 +09:00
PSeitz
acb2e2e282
Merge pull request #1532 from quickwit-oss/refactor_ff
...
remove fast_field_cardinality from FastValue
2022-09-21 04:00:35 +02:00
Pascal Seitz
1ff5da5eb4
remove fast_field_cardinality from FastValue
...
unused and at the wrong placed
2022-09-21 09:38:46 +08:00
Bruce Mitchener
c3b25710ad
doc: Improve directory::Lock docs. ( #1534 )
...
Update the docs to reflect the lack of LockParams, correct an error,
and improve cross-linking.
2022-09-20 18:03:35 +09:00
PSeitz
8492010d43
Merge pull request #1531 from waywardmonkeys/improve-docs-more
...
Improvements to doc linking, grammar, etc.
2022-09-20 15:37:07 +08:00
Bruce Mitchener
cf02e32578
Improvements to doc linking, grammar, etc.
2022-09-19 18:10:22 +07:00
PSeitz
8cca1014c9
Merge pull request #1527 from waywardmonkeys/remove-stream_field-reference
...
docs: Remove mentions of stream_field method.
2022-09-19 17:16:46 +08:00
PSeitz
938f884e32
Merge pull request #1525 from waywardmonkeys/fix-etsy-logo-alt-text-readme
...
README: Fix Etsy logo and alt text.
2022-09-19 16:55:08 +08:00
PSeitz
ed68afb698
Merge pull request #1528 from quickwit-oss/ff_refact
...
fix benches
2022-09-19 11:37:08 +08:00