RT_Enzyme
ff3d3313c4
fix BooleanQuery document ( #1999 )
...
* fix BooleanQuery document
* Update src/query/boolean_query/boolean_query.rs
---------
Co-authored-by: Paul Masurel <paul@quickwit.io >
2023-04-20 11:37:20 +02:00
Paul Masurel
4b01cc4c49
Made BooleanWeight and BoostWeight public ( #1991 )
2023-04-12 10:26:30 +09:00
PSeitz
6a7a1106d6
work in batches of docs ( #1937 )
...
* work in batches of docs
* add fill_buffer test
2023-03-21 06:57:44 +01:00
Alex Cole
f2f38c43ce
Make BM25 scoring more flexible ( #1855 )
...
* Introduce Bm25StatisticsProvider to inject statistics
* fix formatting I accidentally changed
2023-02-16 19:14:12 +09:00
Shikhar Bhushan
2650111b76
EnableScoring::Disabled - optional Searcher ( #1780 )
2023-01-12 09:26:50 -05:00
Adam Reichold
2080c370c2
Enable usage of FuzzyTermQuery for specific fields via QueryParser ( #1750 )
...
* Make nightly Clippy mostly happy.
* Document how to produce TermSetQuery queries using QueryParser.
* Enable construction of queries using FuzzyTermQuery via the QueryParser
* Use FxHashMap instead of HashMap in the QueryParser as these hash tables are not exposed to DoS attacks.
* Use a struct instead of a tuple to improve readability.
2023-01-04 18:11:27 +09:00
Paul Masurel
3edf0a2724
Using the manual reload policy in IndexWriter. ( #1667 )
2022-11-09 11:20:41 +01:00
PSeitz
2af6b01c17
Update src/query/boolean_query/boolean_weight.rs
...
Co-authored-by: Paul Masurel <paul@quickwit.io >
2022-11-01 16:13:00 +08:00
Pascal Seitz
dfab201191
for_each_docset to iterate without score
2022-10-26 17:25:05 +08:00
Pascal Seitz
af839753e0
No score calls if score is not requested
2022-10-26 12:18:35 +08:00
Pascal Seitz
6800fdec9d
add indexing for ip field
...
Closes #1595
2022-10-18 10:07:48 +08:00
Bruce Mitchener
44e03791f9
Fix warnings when doc'ing private items. ( #1579 )
...
This also fixes a couple of typos, but plenty remain!
2022-10-03 14:24:00 +09:00
Bruce Mitchener
a24ae8d924
clippy: Fix needless-borrow warnings. ( #1581 )
...
These show on nightly clippy.
2022-10-03 14:15:09 +09:00
Bruce Mitchener
cf02e32578
Improvements to doc linking, grammar, etc.
2022-09-19 18:10:22 +07:00
Adam Reichold
71ab482720
RFC: Use a more general but still object-safe signature for Query::query_terms. ( #1468 )
...
* Use a more general but still object-safe signature for Query::query_terms.
* Further constraint the generalized Query::query_terms signature to allow extracting references to terms.
2022-08-24 06:34:07 +09:00
PSeitz
8edcd6f958
Merge pull request #1428 from izihawa/feature/dismax
...
[feat] Implement `DisjunctionMaxQuery` and refactor `ScoreCombiner`
2022-08-22 06:15:30 -07:00
Kian-Meng Ang
014b1adc3e
cargo +nightly fmt
2022-08-17 22:33:44 +08:00
Kian-Meng Ang
84295d5b35
cargo fmt
2022-08-15 21:07:01 +08:00
Kian-Meng Ang
625bcb4877
Fix typos and markdowns
...
Found via these commands:
codespell -L crate,ser,panting,beauti,hart,ue,atleast,childs,ond,pris,hel,mot
markdownlint *.md doc/src/*.md --disable MD013 MD025 MD033 MD001 MD024 MD036 MD041 MD003
2022-08-13 18:25:47 +08:00
Pasha Podolsky
09aae134e6
[feat] Implement DisjunctionMaxQuery and refactor ScoreCombiner
2022-07-28 20:47:20 +03:00
Ryan Russell
b33b4c0092
Fix various occurrence var names and references ( #1385 )
...
Thank you Ryan!
Signed-off-by: Ryan Russell <git@ryanrussell.org >
2022-06-07 11:08:19 +09:00
PSeitz
4b62f7907d
Merge pull request #1297 from PSeitz/fix_clippy
...
fix clippy issues
2022-03-02 10:11:56 +01:00
Antoine G
e37775fe21
iff->if or if and only if ( #1298 )
...
* has_xxx is_xxx -> if, these function usualy define equivalence
xxx returns bool -> specify equivalence when appropriate
* fix doc
2022-03-02 11:00:00 +09:00
Pascal Seitz
091b668624
fix clippy issues
2022-03-01 08:58:51 +01:00
Paul Masurel
d7b46d2137
Added JSON Type ( #1270 )
...
- Removed useless copy when ingesting JSON.
- Bugfix in phrase query with a missing field norms.
- Disabled range query on default fields
Closes #1251
2022-02-24 16:25:22 +09:00
Paul Masurel
2069e3e52b
Fixing clippy comments
2022-02-01 10:24:05 +09:00
Paul Masurel
eca6628b3c
Minor refactoring ( #1266 )
2022-01-28 15:55:55 +09:00
Paul Masurel
7234bef0eb
Issue/1198 ( #1201 )
...
* Unit test reproducing #1198
* Fixing unit test to handle the error from add_document.
* Bump project version
2021-11-11 16:42:19 +09:00
François Massot
0462754673
Optimize block wand for one and several TermScorer. ( #1190 )
...
* Added optimisation using block wand for single TermScorer.
A proptest was also added.
* Fix block wand algorithm by taking the last doc id of scores until the pivot scorer (included).
* In block wand, when block max score is lower than the threshold, advance the scorer with best score.
* Fix wrong condition in block_wand_single_scorer and add debug_assert to have an equality check on doc to break the loop.
2021-11-01 09:18:05 +09:00
sigaloid
096ce7488e
Resolve some clippys, format ( #1144 )
...
* cargo +nightly clippy --fix -Z unstable-options
2021-08-26 08:46:00 +09:00
Pascal Seitz
1e4df54ab3
fix clippy
2021-07-01 17:41:53 +02:00
Paul Masurel
6e4b61154f
Issue/1070 ( #1071 )
...
Add a boolean flag in the Query::query_terms informing on whether
position information is required.
Closes #1070
2021-06-03 22:33:20 +09:00
Paul Masurel
39dd8cfe24
Cargo clippy. Acronym should not be full uppercase apparently.
2021-04-26 11:49:18 +09:00
Stéphane Campinas
a0ec6e1e9d
Expand the DocAddress struct with named fields
2021-03-28 19:00:23 +02:00
Paul Masurel
7f0e61b173
Refactoring of the skip index.
...
The skip index now identifies both the start and the end offset
of blocks. Checkpoints are compressed in blocks, reaching better
compression.
2020-11-17 16:05:11 +09:00
Paul Masurel
6d4b982417
Marked blockwand test as ignored.
...
- Using impl trait for iterating `matching_segments` in the termdict
merger
2020-11-16 13:44:14 +09:00
Paul Masurel
a49e59053c
Making block wand test more robusts
2020-11-10 18:01:38 +09:00
Paul Masurel
36a0520a48
Added failing proptest and fixed it.
2020-11-05 15:40:00 +09:00
Paul Masurel
730ccefffb
Fixes a bug in TermQuery::explain.
...
Closes #915
2020-10-28 22:29:15 +09:00
Paul Masurel
9e27da8b4e
Added CR comments.
...
Added Unit tests.
2020-10-28 17:35:34 +09:00
Adrien Guillo
7f373f232a
Add helper methods for BooleanQuery
2020-10-28 17:35:34 +09:00
Paul Masurel
439d6956a9
Returning Result in some of the API ( #880 )
...
* Returning Result in some of the API
* Introducing `.writer_for_test(..)`
2020-09-07 15:52:34 +09:00
Paul Masurel
4914076e8f
Fixing release build
2020-08-21 08:57:27 +09:00
Paul Masurel
f355695581
Code clean up
2020-08-20 15:42:50 +09:00
Paul Masurel
2481c87be8
Block wand ( #856 )
2020-08-19 22:36:36 +09:00
Paul Masurel
6db8bb49d6
Assert nearly equals macro ( #853 )
...
* Assert nearly equals macro
* Renamed specialized_scorer in TermScorer
2020-07-17 16:40:41 +09:00
Paul Masurel
f71b04acb0
Bugfix. ( #849 )
...
go_to_first_doc was typically calling seek with a target smaller than
doc.
Since SegmentPostings typically do a linear search on the full block,
regardless of the current position, it could have our segment postings
go backward.
2020-07-16 10:57:51 +09:00
Ype Kingma
7d773abc92
Boolean query: do not combine excluded scores. ( #840 )
...
* Do nothing when combining score values of excluded scores.
* Add test case for two excluded.
* Test score for two excluded terms.
* Use TopDocs in test_boolean_query_two_excluded
2020-06-08 20:01:19 +09:00
Paul Masurel
c0f5645cd9
Move for_each functions from Scorer to Weight. ( #836 )
...
* Move for_each functions from Scorer to Weight.
* Specialized foreach / foreach_pruning for union of termscorer.
2020-06-01 11:31:18 +09:00
Paul Masurel
e25284bafe
Major change in the DocSet/Scorer API ( #824 )
...
- Change in the DocSet and Scorer API. (@fulmicoton).
A freshly created DocSet point directly to their first doc. A sentinel value called TERMINATED marks the end of a DocSet.
`.advance()` returns the new DocId. `Scorer::skip(target)` has been replaced by `Scorer::seek(target)` and returns the resulting DocId.
As a result, iterating through DocSet now looks as follows
```rust
let mut doc = docset.doc();
while doc != TERMINATED {
// ...
doc = docset.advance();
}
```
The change made it possible to greatly simplify a lot of the docset's code.
- Misc internal optimization and introduction of the `Scorer::for_each_pruning` function. (@fulmicoton)
2020-05-16 16:33:36 +09:00