tantivy

mirror of https://github.com/quickwit-oss/tantivy.git synced 2026-05-22 11:10:40 +00:00

Author	SHA1	Message	Date
Paul Masurel	7559bad5fc	Suffix-sum pruning for multi-term intersection candidates After scoring each secondary in Phase 2, check whether remaining secondaries' block_max scores can still beat the threshold. Skip to the next candidate early if impossible, avoiding expensive seeks into later secondaries. Improves three-term intersection by ~8% on the balanced benchmark while keeping two-term performance neutral. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-25 22:58:52 +02:00
Paul Masurel	8a7aeed030	Optimizing top K using Adrien Grand's ideas https://jpountz.github.io/2025/08/28/compiled-vs-vectorized-search-engine-edition.html	2026-04-25 22:30:55 +02:00
Paul Masurel	d27ca164a9	block_wand: use single-scorer path when there is only one scorer	2026-04-25 16:35:00 +02:00
James Sewell	322286ee16	Tighen Block-Max in single-scorer (#2897 ) In the Block-Max WAND single-scorer, it uses block_max_score() < threshold, whereas the multi-term one uses block_max_score_upperbound <= threshold. As both of these are guarded later on with if score > threshold we can use the more efficent form in single-scorer. Single-scorer block skip (<, should be <=): https://github.com/quickwit-oss/tantivy/blob/main/src/query/boolean_query/block_wand.rs#L231 Multi-scorer block skip (already <=): https://github.com/quickwit-oss/tantivy/blob/main/src/query/boolean_query/block_wand.rs#L179 Single-scorer per-doc guard (>): https://github.com/quickwit-oss/tantivy/blob/main/src/query/boolean_query/block_wand.rs#L246 Multi-scorer per-doc guard (>): https://github.com/quickwit-oss/tantivy/blob/main/src/query/boolean_query/block_wand.rs#L206 This will improve performance when there are many identical scores.	2026-04-25 14:13:07 +02:00
Paul Masurel	d2c1b8bc2c	Optimized intersection count using a bitset when the first leg is dense	2026-04-06 12:01:52 -04:00
PSeitz	129c40f8ec	Improve Union Performance for non-score unions (#2863 ) * enhance and_or_queries bench * optimize unions for count/non-score, bitset fix for ARM Benchmarks run on M4 Max ``` single_field_only_union_5%_OR_1% count Avg: 0.1100ms (-17.46%) Median: 0.1079ms (-14.08%) [0.1045ms .. 0.1410ms] Output: 54_110 top10_inv_idx Avg: 0.1663ms (+0.79%) Median: 0.1660ms (+0.75%) [0.1634ms .. 0.1702ms] Output: 10 count+top10 Avg: 0.2639ms (-1.24%) Median: 0.2634ms (-0.31%) [0.2512ms .. 0.2813ms] Output: 54_110 top10_by_ff Avg: 0.2875ms (-8.67%) Median: 0.2852ms (-8.80%) [0.2737ms .. 0.3083ms] Output: 10 top10_by_2ff Avg: 0.3137ms (-5.79%) Median: 0.3128ms (-0.35%) [0.3044ms .. 0.3313ms] Output: 10 single_field_only_union_5%_OR_1%_OR_15% count Avg: 0.4122ms (-33.05%) Median: 0.4140ms (-32.20%) [0.3940ms .. 0.4341ms] Output: 181_663 top10_inv_idx Avg: 0.3999ms (+2.39%) Median: 0.3987ms (+2.02%) [0.3939ms .. 0.4160ms] Output: 10 count+top10 Avg: 0.8520ms (-8.63%) Median: 0.8516ms (-8.65%) [0.8413ms .. 0.8676ms] Output: 181_663 top10_by_ff Avg: 0.9694ms (-13.06%) Median: 0.9645ms (-13.77%) [0.9403ms .. 1.0122ms] Output: 10 top10_by_2ff Avg: 0.9880ms (-13.01%) Median: 0.9838ms (-13.59%) [0.9781ms .. 1.0306ms] Output: 10 single_field_only_union_5%_OR_30% count Avg: 0.7364ms (-33.11%) Median: 0.7347ms (-33.19%) [0.7233ms .. 0.7547ms] Output: 303_337 top10_inv_idx Avg: 0.8932ms (-0.89%) Median: 0.8919ms (-0.75%) [0.8861ms .. 0.9249ms] Output: 10 count+top10 Avg: 1.3611ms (-9.23%) Median: 1.3598ms (-9.39%) [1.3426ms .. 1.3891ms] Output: 303_337 top10_by_ff Avg: 1.6575ms (-18.64%) Median: 1.6224ms (-20.81%) [1.6051ms .. 1.7560ms] Output: 10 top10_by_2ff Avg: 1.6800ms (-16.24%) Median: 1.6769ms (-15.72%) [1.6661ms .. 1.7229ms] Output: 10 single_field_only_union_30%_OR_0.01% count Avg: 0.6471ms (-33.73%) Median: 0.6464ms (-33.46%) [0.6375ms .. 0.6604ms] Output: 270_268 top10_inv_idx Avg: 0.0338ms (-0.27%) Median: 0.0338ms (+0.11%) [0.0331ms .. 0.0351ms] Output: 10 count+top10 Avg: 1.2209ms (-9.27%) Median: 1.2207ms (-9.25%) [1.2158ms .. 1.2351ms] Output: 270_268 top10_by_ff Avg: 1.4808ms (-17.20%) Median: 1.4690ms (-17.91%) [1.4384ms .. 1.5553ms] Output: 10 top10_by_2ff Avg: 1.5011ms (-14.30%) Median: 1.4992ms (-13.88%) [1.4891ms .. 1.5320ms] Output: 10 multi_field_only_union_5%_OR_1% count Avg: 0.1196ms (-17.67%) Median: 0.1166ms (-14.83%) [0.1123ms .. 0.1462ms] Output: 60_183 top10_inv_idx Avg: 0.2356ms (-0.21%) Median: 0.2355ms (+0.23%) [0.2330ms .. 0.2406ms] Output: 10 count+top10 Avg: 0.2985ms (-5.06%) Median: 0.2957ms (-5.79%) [0.2875ms .. 0.3186ms] Output: 60_183 top10_by_ff Avg: 0.3102ms (-9.44%) Median: 0.3031ms (-11.09%) [0.2994ms .. 0.3324ms] Output: 10 top10_by_2ff Avg: 0.3435ms (-0.91%) Median: 0.3447ms (-0.62%) [0.3342ms .. 0.3530ms] Output: 10 multi_field_only_union_5%_OR_1%_OR_15% count Avg: 0.4465ms (-35.41%) Median: 0.4456ms (-36.25%) [0.4250ms .. 0.4936ms] Output: 201_114 top10_inv_idx Avg: 1.1542ms (+2.38%) Median: 1.1560ms (+2.96%) [1.1193ms .. 1.1912ms] Output: 10 count+top10 Avg: 0.9334ms (-8.89%) Median: 0.9330ms (-8.95%) [0.9191ms .. 0.9542ms] Output: 201_114 top10_by_ff Avg: 1.0590ms (-14.10%) Median: 1.0424ms (-15.08%) [1.0304ms .. 1.1174ms] Output: 10 top10_by_2ff Avg: 1.0779ms (-17.06%) Median: 1.0754ms (-17.40%) [1.0650ms .. 1.1155ms] Output: 10 multi_field_only_union_5%_OR_30% count Avg: 0.8137ms (-33.48%) Median: 0.7976ms (-34.84%) [0.7734ms .. 1.0855ms] Output: 335_682 top10_inv_idx Avg: 1.5108ms (+0.36%) Median: 1.4943ms (-0.72%) [1.4805ms .. 1.5865ms] Output: 10 count+top10 Avg: 1.4985ms (-9.75%) Median: 1.4936ms (-9.63%) [1.4784ms .. 1.5472ms] Output: 335_682 top10_by_ff Avg: 1.8531ms (-15.70%) Median: 1.8583ms (-16.30%) [1.7467ms .. 2.2297ms] Output: 10 top10_by_2ff Avg: 1.8735ms (-16.67%) Median: 1.8421ms (-18.05%) [1.8146ms .. 2.3650ms] Output: 10 multi_field_only_union_30%_OR_0.01% count Avg: 0.7020ms (-34.40%) Median: 0.7004ms (-34.05%) [0.6943ms .. 0.7156ms] Output: 300_315 top10_inv_idx Avg: 0.1445ms (-1.57%) Median: 0.1442ms (-1.35%) [0.1426ms .. 0.1478ms] Output: 10 count+top10 Avg: 1.3309ms (-9.84%) Median: 1.3284ms (-9.71%) [1.3234ms .. 1.3549ms] Output: 300_315 top10_by_ff Avg: 1.6152ms (-17.39%) Median: 1.6037ms (-18.72%) [1.5778ms .. 1.7227ms] Output: 10 top10_by_2ff Avg: 1.6479ms (-17.10%) Median: 1.6444ms (-15.46%) [1.6307ms .. 1.6901ms] Output: 10 ``` * add comment * fix comment * remove inline(never), bounds check	2026-03-27 08:00:26 +01:00
trinity-1686a	5562ce6037	Merge pull request #2818 from Darkheir/fix/query_grammar_regex_between_parentheses	2026-02-11 11:39:58 +01:00
Moe	8018016e46	feat: add fast field support for Bytes type (#100 ) (#2830 ) ## What Enable range queries and TopN sorting on `Bytes` fast fields, bringing them to parity with `Str` fields. ## Why `BytesColumn` uses the same dictionary encoding as `StrColumn` internally, but range queries and TopN sorting were explicitly disabled for `Bytes`. This prevented use cases like storing lexicographically sortable binary data (e.g., arbitrary-precision decimals) that need efficient range filtering. ## How 1. Enable range queries for Bytes - Changed `is_type_valid_for_fastfield_range_query()` to return `true` for `Type::Bytes` 2. Add BytesColumn handling in scorer - Added a branch in `FastFieldRangeWeight::scorer()` to handle bytes fields using dictionary ordinal lookup (mirrors the existing `StrColumn` logic) 3. Add SortByBytes - New sort key computer for TopN queries on bytes columns ## Tests - `test_bytes_field_ff_range_query` - Tests inclusive/exclusive bounds and unbounded ranges - `test_sort_by_bytes_asc` / `test_sort_by_bytes_desc` - Tests lexicographic ordering in both directions	2026-02-11 11:26:18 +01:00
PSeitz	98ebbf922d	faster exclude queries (#2825 ) * faster exclude queries Faster exclude queries with multiple terms. Changes `Exclude` to be able to exclude multiple DocSets, instead of putting the docsets into a union. Use `seek_danger` in `Exclude`. closes #2822 * replace unwrap with match	2026-01-30 17:06:41 +01:00
Darkheir	a55e4069e4	feat(query-grammar): Apply PR review suggestions Signed-off-by: Darkheir <raphael.cohen@sekoia.io>	2026-01-28 14:13:55 +01:00
Paul Masurel	3f448ecf79	Bugfix on intersection. (#2812 ) The intersection algorithm made it possible for .seek(..) with values lower than the current doc id, breaking the DocSet contract. The fix removes the optimization that caused left.seek(..) to be replaced by a simpler left.advance(..). Simply doing so lead to a performance regression. I therefore integrated that idea within SegmentPostings.seek. We now attempt to check the next doc systematically on seek, PROVIDED the block is already loaded. Closes #2811 Co-authored-by: Paul Masurel <paul.masurel@datadoghq.com>	2026-01-27 09:21:09 +01:00
Paul Masurel	b86caeefe2	Major bugfix in intersection A bug was added with the `seek_into_the_danger_zone()` optimization (Spotted and fixed by Stu) The contract says seek_into_the_danger_zone returns true if do is part of the docset. The blanket implementation goes like this. ``` let current_doc = self.doc(); if current_doc < target { self.seek(target); } self.doc() == target ``` So it will return true if target is TERMINATED, where really TERMINATED does not belong to the docset. The fix tries to clarify the contracts and fixes the intersection algorithm. We observe a small but all over the board improvement in intersection performance. --------- Co-authored-by: Stu Hood <stuhood@gmail.com> Co-authored-by: Paul Masurel <paul.masurel@datadoghq.com>	2026-01-23 18:44:10 +01:00
trinity-1686a	12977bc7c4	upgrade some dependancies (#2802 ) including rand, which had a few breaking changes	2026-01-14 10:19:09 +01:00
PSeitz	735c588f4f	fix union performance regression (#2790 ) * add inlines * fix union performance regression Remove unwrap from hotpath generates better assembly. closes #2788	2026-01-02 12:06:51 +01:00
Paul Masurel	b11605f045	Addressing clippy comments (#2789 ) Co-authored-by: Paul Masurel <paul.masurel@datadoghq.com>	2025-12-31 18:02:00 +01:00
PSeitz	923f0508f2	seek_exact + cost based intersection (#2538 ) * seek_exact + cost based intersection Adds `seek_exact` and `cost` to `DocSet` for a more efficient intersection. Unlike `seek`, `seek_exact` does not require the DocSet to advance to the next hit, if the target does not exist. `cost` allows to address the different DocSet types and their cost model and is used to determine the DocSet that drives the intersection. E.g. fast field range queries may do a full scan. Phrase queries load the positions to check if a we have a hit. They both have a higher cost than their size_hint would suggest. Improves `size_hint` estimation for intersection and union, by having a estimation based on random distribution with a co-location factor. Refactor range query benchmark. Closes #2531 Future Work Implement `seek_exact` for BufferedUnionScorer and RangeDocSet (fast field range queries) Evaluate replacing `seek` with `seek_exact` to reduce code complexity * Apply suggestions from code review Co-authored-by: Paul Masurel <paul@quickwit.io> * add API contract verfication * impl seek_exact on union * rename seek_exact * add mixed AND OR test, fix buffered_union * Add a proptest of BooleanQuery. (#2690) * fix build * Increase the document count. * fix merge conflict * fix debug assert * Fix compilation errors after rebase - Remove duplicate proptest_boolean_query module - Remove duplicate cost() method implementations - Fix TopDocs API usage (add .order_by_score()) - Remove duplicate imports - Remove unused variable assignments --------- Co-authored-by: Paul Masurel <paul@quickwit.io> Co-authored-by: Pascal Seitz <pascal.seitz@datadoghq.com> Co-authored-by: Stu Hood <stuhood@gmail.com>	2025-12-30 14:43:25 +01:00
ChangRui-Ryan	e0b62e00ac	optimize RangeDocSet for non-overlapping query ranges (#2783 )	2025-12-29 16:55:28 +01:00
Moe	e3c9be1f92	fix: boolean query incorrectly dropping documents when AllScorer is present (#2760 ) * Fixed the range issue. * Fixed the second all scorer issue * Improved docs + tests * Improved code. * Fixed lint issues. * Improved tests + logic based on PR comments. * Fixed lint issues. * Increase the document count. * Improved the prop-tests * Expand the index size, and remove unused parameter. --------- Co-authored-by: Stu Hood <stuhood@gmail.com>	2025-12-16 22:52:02 +01:00
trinity-1686a	d0e1600135	fix bug with minimum_should_match and AllScorer (#2774 )	2025-12-14 10:10:45 +01:00
Paul Masurel	63c66005db	Lazy scorers (#2726 ) * Refactoring of the score tweaker into `SortKeyComputer`s to unlock two features. - Allow lazy evaluation of score. As soon as we identified that a doc won't reach the topK threshold, we can stop the evaluation. - Allow for a different segment level score, segment level score and their conversion. This PR breaks public API, but fixing code is straightforward. * Bumping tantivy version --------- Co-authored-by: Paul Masurel <paul.masurel@datadoghq.com>	2025-12-01 15:38:57 +01:00
Paul Masurel	25d44fcec8	Revert "remove unused columnar api (#2742 )" (#2748 ) * Revert "remove unused columnar api (#2742)" This reverts commit `8725594d47`. * Clippy comment + removing fill_vals --------- Co-authored-by: Paul Masurel <paul.masurel@datadoghq.com>	2025-11-26 17:44:02 +01:00
Paul Masurel	f88b7200b2	Optimization when posting list are saturated. (#2745 ) * Optimization when posting list are saturated. If a posting list doc freq is the segment reader's max_doc, and if scoring does not matter, we can replace it by a AllScorer. In turn, in a boolean query, we can dismiss all scorers and empty scorers, to accelerate the request. * Added range query optimization * CR comment * CR comments * CR comment --------- Co-authored-by: Paul Masurel <paul.masurel@datadoghq.com>	2025-11-26 15:50:57 +01:00
Paul Masurel	c363bbd23d	Optimize term aggregation with low cardinality + some refactoring (#2740 ) This introduce an optimization of top level term aggregation on field with a low cardinality. We then use a Vec as the underlying map. In addition, we buffer subaggregations. --------- Co-authored-by: Pascal Seitz <pascal.seitz@datadoghq.com> Co-authored-by: Paul Masurel <paul@quickwit.io>	2025-11-21 14:46:29 +01:00
PSeitz	d410a3b0c0	Add Filtering for Term Aggregations (#2717 ) * Add Filtering for Term Aggregations Closes #2702 * add AggregationsSegmentCtx memory consumption --------- Co-authored-by: Pascal Seitz <pascal.seitz@datadoghq.com>	2025-10-15 17:39:53 +02:00
PSeitz	33835b6a01	Add DocSet::cost() (#2707 ) * query: add DocSet cost hint and use it for intersection ordering - Add DocSet::cost() - Use cost() instead of size_hint() to order scorers in intersect_scorers This isolates cost-related changes without the new seek APIs from PR #2538 * add comments --------- Co-authored-by: Pascal Seitz <pascal.seitz@datadoghq.com>	2025-10-13 16:25:49 +02:00
PSeitz	270ca5123c	refactor postings (#2709 ) rename shallow_seek to seek_block remove full_block from public postings API This is as preparation to optionally handle Bitsets in the postings	2025-10-08 16:55:25 +02:00
PSeitz-dd	40659d4d07	improve naming in buffered_union (#2705 )	2025-09-24 10:58:46 +02:00
PSeitz-dd	70da310b2d	perf: deduplicate queries (#2698 ) * deduplicate queries Deduplicate queries in the UserInputAst after parsing queries * add return type	2025-09-22 12:16:58 +02:00
PSeitz-dd	2340dca628	fix compiler warnings (#2699 ) * fix compiler warnings * fix import	2025-09-19 15:55:04 +02:00
PSeitz-dd	203751f2fe	Optimize ExistsQuery for a high number of dynamic columns (#2694 ) * Optimize ExistsQuery for a high number of dynamic columns The previous algorithm checked _each_ doc in _each_ column for existence. This causes huge cost on JSON fields with e.g. 100k columns. Compute a bitset instead if we have more than one column. add `iter_docs` to the multivalued_index * add benchmark subfields=1 exists_json_union Memory: 89.3 KB (+2.01%) Avg: 0.4865ms (-26.03%) Median: 0.4865ms (-26.03%) [0.4865ms .. 0.4865ms] subfields=2 exists_json_union Memory: 68.1 KB Avg: 1.7048ms (-0.46%) Median: 1.7048ms (-0.46%) [1.7048ms .. 1.7048ms] subfields=3 exists_json_union Memory: 61.8 KB Avg: 2.0742ms (-2.22%) Median: 2.0742ms (-2.22%) [2.0742ms .. 2.0742ms] subfields=4 exists_json_union Memory: 119.8 KB (+103.44%) Avg: 3.9500ms (+42.62%) Median: 3.9500ms (+42.62%) [3.9500ms .. 3.9500ms] subfields=5 exists_json_union Memory: 120.4 KB (+107.65%) Avg: 3.9610ms (+20.65%) Median: 3.9610ms (+20.65%) [3.9610ms .. 3.9610ms] subfields=6 exists_json_union Memory: 120.6 KB (+107.49%) Avg: 3.8903ms (+3.11%) Median: 3.8903ms (+3.11%) [3.8903ms .. 3.8903ms] subfields=7 exists_json_union Memory: 120.9 KB (+106.93%) Avg: 3.6220ms (-16.22%) Median: 3.6220ms (-16.22%) [3.6220ms .. 3.6220ms] subfields=8 exists_json_union Memory: 121.3 KB (+106.23%) Avg: 4.0981ms (-15.97%) Median: 4.0981ms (-15.97%) [4.0981ms .. 4.0981ms] subfields=16 exists_json_union Memory: 123.1 KB (+103.09%) Avg: 4.3483ms (-92.26%) Median: 4.3483ms (-92.26%) [4.3483ms .. 4.3483ms] subfields=256 exists_json_union Memory: 204.6 KB (+19.85%) Avg: 3.8874ms (-99.01%) Median: 3.8874ms (-99.01%) [3.8874ms .. 3.8874ms] subfields=4096 exists_json_union Memory: 2.0 MB Avg: 3.5571ms (-99.90%) Median: 3.5571ms (-99.90%) [3.5571ms .. 3.5571ms] subfields=65536 exists_json_union Memory: 28.3 MB Avg: 14.4417ms (-99.97%) Median: 14.4417ms (-99.97%) [14.4417ms .. 14.4417ms] subfields=262144 exists_json_union Memory: 113.3 MB Avg: 66.2860ms (-99.95%) Median: 66.2860ms (-99.95%) [66.2860ms .. 66.2860ms] * rename methods	2025-09-16 18:21:03 +02:00
PSeitz-dd	7963b0b4aa	Add fast field fallback for term query if not indexed (#2693 ) * Add fast field fallback for term query if not indexed * only fallback without scores	2025-09-12 14:58:21 +02:00
Paul Masurel	5d6c8de23e	Align search float search logic to the columnar coercion rules It applies the same logic on floats as for u64 or i64. In all case, the idea is (for the inverted index) to coerce number to their canonical representation, before indexing and before searching. That way a document with the float 1.0 will be searchable when the user searches for 1. Note that contrary to the columnar, we do not attempt to coerce all of the terms associated to a given json path to a single numerical type. We simply rely on this "point-wise" canonicalization.	2025-09-09 19:28:17 +02:00
Raphaël Cohen	f4b374110f	feat: Regex query grammar (#2677 ) * feat: Regex query grammar * feat: Disable regexes by default * chore: Apply formatting	2025-09-03 10:07:04 +02:00
Darkheir	d4b090124c	feat: Support spaces between field name and value	2025-07-23 11:12:13 +02:00
trinity-1686a	bc1c789897	Merge pull request #2676 from quickwit-oss/trinity.pointard/allow-partial-default-field-success ignore failure to parse query when other default field suceeded	2025-07-18 14:20:41 +02:00
trinity Pointard	e7c8c331bd	ignore failure to parse query when other default field suceeded	2025-07-17 14:47:28 +02:00
PSeitz	945af922d1	clippy (#2661 ) * clippy * use readable version --------- Co-authored-by: Pascal Seitz <pascal.seitz@datadoghq.com>	2025-07-02 11:25:03 +02:00
PSeitz-dd	295d07e55c	fix union performance regression (#2663 ) closes https://github.com/quickwit-oss/tantivy/issues/2656	2025-07-01 20:32:25 +02:00
Remi Dettai	71cf19870b	Exist queries match subpath fields (#2558 ) * Exist queries match subpath fields * Make subpath check optional * Add async subpath listing	2025-01-06 10:17:39 +01:00
PSeitz	21d057059e	clippy (#2527 ) * clippy * clippy * clippy * clippy * convert allow to expect and remove unused * cargo fmt * cleanup * export sample * clippy	2024-10-22 09:26:54 +08:00
PSeitz	dca508b4ca	remove read_postings_no_deletes (#2526 ) closes #2525	2024-10-22 09:52:43 +09:00
PSeitz	aebae9965d	add RegexPhraseQuery (#2516 ) * add RegexPhraseQuery RegexPhraseQuery supports phrase queries with regex. It supports regex and wildcards. E.g. a query with wildcards: "b* b* wolf" matches "big bad wolf" Slop is supported as well: "b* wolf"~2 matches "big bad wolf" Regex queries may match a lot of terms where we still need to keep track which term hit to load the positions. The phrase query algorithm groups terms by their frequency together in the union to prefilter groups early. This PR comes with some new datastructures: SimpleUnion - A union docset for a list of docsets. It doesn't do any caching and is therefore well suited for datasets with lots of skipping. (phrase search, but intersections in general) LoadedPostings - Like SegmentPostings, but all docs and positions are loaded in memory. SegmentPostings uses 1840 bytes per instance with its caches, which is equivalent to 460 docids. LoadedPostings is used for terms which have less than 100 docs. LoadedPostings is only used to reduce memory consumption. BitSetPostingUnion - Creates a `Posting` that uses the bitset for docid hits and the docsets for positions. The BitSet is the precalculated union of the docsets In the RegexPhraseQuery there is a size limit of 512 docsets per PreAggregatedUnion, before creating a new one. Renamed Union to BufferedUnionScorer Added proptests to test different union types. * cleanup * use Box instead of Vec * use RefCell instead of term_freq(&mut) * remove wildcard mode * move RefCell to outer * clippy	2024-10-21 18:29:17 +08:00
Bruce Mitchener	c17e513377	Reduce typo count. (#2510 )	2024-10-10 09:55:37 +08:00
trinity-1686a	85395d942a	fix clippy lints from 1.80-1.81 (#2488 ) * fix some clippy lints * fix clippy::doc_lazy_continuation * fix some lints for 1.82	2024-09-05 14:33:05 +02:00
PSeitz	a206c3ccd3	add compat tests (#2485 )	2024-09-04 18:26:57 +08:00
Chaya	dc5d31c116	grammar and misspellings (#2483 ) * grammar * grammar * misspelling	2024-09-04 12:45:31 +08:00
PSeitz	c71ec8086d	add FastFieldRangeQuery, rename (#2477 ) * add FastFieldRangeQuery, rename * remove Query impl	2024-08-19 09:02:00 +02:00
PSeitz	27be6aed91	lift clauses in LogicalAst (#2449 ) (a OR b) OR (c OR d) can be simplified to (a OR b OR c OR d) (a AND b) AND (c AND d) can be simplified to (a AND b AND c AND d) This directly affects how queries are executed remove unused SumWithCoordsCombiner the number of fields is unused and private	2024-08-14 19:21:26 +02:00
PSeitz	3d1c4b313a	support ff range queries on json fields (#2456 ) * support ff range queries on json fields * fix term date truncation * use inverted index range query for phrase prefix queries * rename to InvertedIndexRangeQuery * fix column filter, add mixed column test	2024-08-02 00:06:50 +08:00
PSeitz	d8843c608c	make FastFieldRangeWeight::new pub (#2460 )	2024-07-29 10:39:27 +08:00

1 2 3 4 5 ...

602 Commits