PSeitz
514d23a20c
move tokenizer API to seperate crate ( #1767 )
...
closes #1766
Finding tantivy tokenizers is a frustrating experience currently, since
they need be updated for each tantivy version. That's unnecessary since
the API is rather stable anyway.
2023-01-09 06:37:38 +01:00
Paul Masurel
4f9efe654c
Support for columnar ( #1734 )
...
* Added support for dynamic fast field.
See README for more information.
* Apply suggestions from code review
Co-authored-by: PSeitz <PSeitz@users.noreply.github.com >
2023-01-07 17:37:00 +09:00
Adam Reichold
1afa5bf3db
Make construction of LevenshteinAutomatonBuilder for FuzzyTermQuery instances lazy. ( #1756 )
2023-01-06 12:44:49 +09:00
PSeitz
07a51eb7c8
refactor multivalue fastfield, refactor range query ( #1749 )
...
Introduce MakeZero trait, remove make_zero from FastValue
Merge two multivalue fastfield implementations into one
prepare range query on fastfield for different types
2023-01-05 12:09:50 +01:00
Adam Reichold
2080c370c2
Enable usage of FuzzyTermQuery for specific fields via QueryParser ( #1750 )
...
* Make nightly Clippy mostly happy.
* Document how to produce TermSetQuery queries using QueryParser.
* Enable construction of queries using FuzzyTermQuery via the QueryParser
* Use FxHashMap instead of HashMap in the QueryParser as these hash tables are not exposed to DoS attacks.
* Use a struct instead of a tuple to improve readability.
2023-01-04 18:11:27 +09:00
Daw-Chih Liou
b22f96624e
doc: update comments in the faceted search example ( #1737 )
...
* doc: update comments in the faceted search example
* chore: format
2023-01-02 11:07:30 +01:00
pinkforest(she/her)
b78dc5e313
Bump prettytables ( #1746 )
2022-12-31 15:01:39 +01:00
Paul Masurel
3f915925af
Fixing unit tests
2022-12-27 12:02:16 +09:00
Paul Masurel
9c5fef5af7
Fixing sstable proptest ( #1743 )
2022-12-26 16:29:33 +09:00
Paul Masurel
9948a84ebe
Simplifies the count_ones definition. ( #1742 )
2022-12-26 16:08:01 +09:00
PSeitz
45156fd869
use group_by in translate_codec_idx_to_original_id ( #1736 )
2022-12-26 06:13:29 +01:00
Paul Masurel
bc959006fa
Ooops. Removing ordered_floats.
2022-12-22 19:50:34 +09:00
Paul Masurel
7385a8f80c
Supporting PartialCmp in VectorColumn. ( #1735 )
...
* Supporting PartialCmp in VectorColumn.
* Apply suggestions from code review
Co-authored-by: PSeitz <PSeitz@users.noreply.github.com >
2022-12-22 17:47:25 +09:00
Paul Masurel
13b89cba17
Adding inlines.
2022-12-22 14:29:41 +09:00
Hasnain Lakhani
f4804ce2f5
Adjust spelling of "returns" in docs for DisjunctionMaxQuery ( #1733 )
2022-12-22 14:04:07 +09:00
Paul Masurel
2a6d1eaf78
Added missing license.
2022-12-22 12:47:43 +09:00
Paul Masurel
540a9972bd
Support for NotNaN in fast fields
2022-12-22 12:28:25 +09:00
Paul Masurel
bb48c3e488
Refactoring to prepare for the addition of dynamic fast field ( #1730 )
...
* Refactoring to prepare for the addition of dynamic fast field
- Exposing insert_key / insert_value
- Renamed SSTable::{Reader/Writer}-> SSTable::{ValueReader/ValueWriter}
- Added a generic Dictionary object in the sstable crate
- Removing the TermDictionary wrapper from tantivy, relying directly on
an alias of the generic Dictionary object.
- dropped the use of byteorder in sstable.
- Stopped scanning / reading the entire dictionary when streaming a range.
* Added a benchmark for streaming sstable ranges.
* CR comments.
Rename deserialize_u64 -> deserialize_vint_u64
* Removed needless allocation, split serialize into serialize and clear.
2022-12-22 12:25:46 +09:00
Paul Masurel
3339a3ec05
Removed feature(quickwit) in tantivy-common.
2022-12-22 10:19:57 +09:00
Paul Masurel
f39165e1e7
Moving FileSlice to tantivy-common ( #1729 )
2022-12-21 16:35:11 +09:00
Paul Masurel
32cb1d22da
Removed AsyncIoResult. ( #1728 )
2022-12-21 16:01:17 +09:00
Paul Masurel
4a6bf50e78
Clippy
2022-12-21 15:43:34 +09:00
PSeitz
2ac1cc2fc0
add sparse codec ( #1723 )
...
* add sparse codec
* Apply suggestions from code review
Co-authored-by: Paul Masurel <paul@quickwit.io >
* Apply suggestions from code review
Co-authored-by: Paul Masurel <paul@quickwit.io >
* Apply suggestions from code review
Co-authored-by: Paul Masurel <paul@quickwit.io >
* add the -1 u16 fix for metadata num_vals
* add dense block encoding to sparse codec
* add comment, refactor u16 reading
Co-authored-by: Paul Masurel <paul@quickwit.io >
2022-12-20 15:30:33 +01:00
PSeitz
f9171a3981
fix clippy ( #1725 )
...
* fix clippy
* fix clippy fastfield codecs
* fix clippy bitpacker
* fix clippy common
* fix clippy stacker
* fix clippy sstable
* fmt
2022-12-20 07:30:06 +01:00
PSeitz
a2cf6a79b4
Sparse dense index ( #1716 )
...
* add dense codec
* benchmark fix and important optimisation
* move code to DenseIndexBlock
improve benchmark
* Apply suggestions from code review
Co-authored-by: Paul Masurel <paul@quickwit.io >
* Apply suggestions from code review
Co-authored-by: Paul Masurel <paul@quickwit.io >
* extend benchmarks
* Apply suggestions from code review
Co-authored-by: Paul Masurel <paul@quickwit.io >
Co-authored-by: Paul Masurel <paul@quickwit.io >
2022-12-13 07:50:09 +01:00
Paul Masurel
f6e87a5319
Cargo fmt
2022-12-13 12:30:40 +09:00
Paul Masurel
f9971e15fe
Fixing unit test with sstable test.
2022-12-13 12:22:44 +09:00
PSeitz
3cdc8e7472
pass index info to serialize ( #1719 )
2022-12-13 04:20:31 +01:00
dependabot[bot]
fbb0f8b55d
Update base64 requirement from 0.13.0 to 0.20.0 ( #1720 )
...
Updates the requirements on [base64](https://github.com/marshallpierce/rust-base64 ) to permit the latest version.
- [Release notes](https://github.com/marshallpierce/rust-base64/releases )
- [Changelog](https://github.com/marshallpierce/rust-base64/blob/master/RELEASE-NOTES.md )
- [Commits](https://github.com/marshallpierce/rust-base64/compare/v0.13.0...v0.20.0 )
---
updated-dependencies:
- dependency-name: base64
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-13 11:46:23 +09:00
Paul Masurel
136a8f4124
Isolating sstable and stacker in independant crates. ( #1718 )
...
Both crate will be used in the new (optional + dynamic) fastfield work.
2022-12-13 11:44:17 +09:00
PSeitz
5d4535de83
Changelog fix ( #1717 )
2022-12-12 14:28:42 +09:00
PSeitz
2c50b02eb3
Fix max bucket limit in histogram ( #1703 )
...
* Fix max bucket limit in histogram
The max bucket limit in histogram was broken, since some code introduced temporary filtering of buckets, which then resulted into an incorrect increment on the bucket count.
The provided solution covers more scenarios, but there are still some scenarios unhandled (See #1702 ).
* Apply suggestions from code review
Co-authored-by: Paul Masurel <paul@quickwit.io >
Co-authored-by: Paul Masurel <paul@quickwit.io >
0.19
2022-12-12 04:40:15 +01:00
PSeitz
509adab79d
Bump version ( #1715 )
...
* group workspace deps
* update cargo.toml
* revert tant version
* chore: Release
2022-12-12 04:39:43 +01:00
PSeitz
96c93a6ba3
Merge pull request #1700 from quickwit-oss/PSeitz-patch-1
...
Update CHANGELOG.md
2022-12-02 16:31:11 +01:00
boraarslan
495824361a
Move split_full_path to Schema ( #1692 )
2022-11-29 20:56:13 +09:00
PSeitz
485a8f507e
Update CHANGELOG.md
2022-11-28 15:41:31 +01:00
PSeitz
1119e59eae
prepare fastfield format for null index ( #1691 )
...
* prepare fastfield format for null index
* add format version for fastfield
* Update fastfield_codecs/src/compact_space/mod.rs
* switch to variable size footer
* serialize delta of end
2022-11-28 17:15:24 +09:00
PSeitz
ee1f2c1f28
add aggregation support for date type ( #1693 )
...
* add aggregation support for date type
fixes #1332
* serialize key_as_string as rfc3339 in date histogram
* update docs
* enable date for range aggregation
2022-11-28 09:12:08 +09:00
PSeitz
600548fd26
Merge pull request #1694 from quickwit-oss/dependabot/cargo/zstd-0.12
...
Update zstd requirement from 0.11 to 0.12
2022-11-25 05:48:59 +01:00
PSeitz
9929c0c221
Merge pull request #1696 from quickwit-oss/dependabot/cargo/env_logger-0.10.0
...
Update env_logger requirement from 0.9.0 to 0.10.0
2022-11-25 03:28:10 +01:00
dependabot[bot]
f53e65648b
Update env_logger requirement from 0.9.0 to 0.10.0
...
Updates the requirements on [env_logger](https://github.com/rust-cli/env_logger ) to permit the latest version.
- [Release notes](https://github.com/rust-cli/env_logger/releases )
- [Changelog](https://github.com/rust-cli/env_logger/blob/main/CHANGELOG.md )
- [Commits](https://github.com/rust-cli/env_logger/compare/v0.9.0...v0.10.0 )
---
updated-dependencies:
- dependency-name: env_logger
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com >
2022-11-24 20:07:52 +00:00
PSeitz
0281b22b77
update create_in_ram docs ( #1695 )
2022-11-24 17:30:09 +01:00
dependabot[bot]
a05c184830
Update zstd requirement from 0.11 to 0.12
...
Updates the requirements on [zstd](https://github.com/gyscos/zstd-rs ) to permit the latest version.
- [Release notes](https://github.com/gyscos/zstd-rs/releases )
- [Commits](https://github.com/gyscos/zstd-rs/commits )
---
updated-dependencies:
- dependency-name: zstd
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com >
2022-11-23 20:15:32 +00:00
Paul Masurel
0b40a7fe43
Added a expand_dots JsonObjectOptions. ( #1687 )
...
Related with quickwit#2345.
2022-11-21 23:03:00 +09:00
trinity-1686a
e758080465
add support for TermSetQuery in query parser ( #1683 )
2022-11-17 16:49:49 +01:00
Paul Masurel
2a39289a1b
Handle escaped dot in json path in the QueryParser. ( #1682 )
2022-11-16 07:18:34 +09:00
Adam Reichold
ca6231170e
Make the built-in stop word lists selectable via the Language enum already used by the Stemmer filter. ( #1671 )
2022-11-15 17:40:25 +09:00
PSeitz
eda6e5a10a
Merge pull request #1681 from quickwit-oss/ip_range_query_multi
...
remove Column from MultiValuedU128FastFieldReader
2022-11-15 09:27:46 +08:00
Pascal Seitz
8641155cbb
remove column from MultiValuedU128FastFieldReader
2022-11-14 18:49:15 +08:00
PSeitz
9a090ed994
Merge pull request #1659 from quickwit-oss/ip_range_query_multi
...
add support for ip range query on multivalue fastfields
2022-11-14 15:17:41 +08:00