Paul Masurel
2069e3e52b
Fixing clippy comments
2022-02-01 10:24:05 +09:00
Paul Masurel
eca6628b3c
Minor refactoring ( #1266 )
2022-01-28 15:55:55 +09:00
Paul Masurel
732f6847c0
Field type with codes ( #1255 )
...
* Term are now typed.
This change is backward compatible:
While the Term has a byte representation that is modified, a Term itself
is a transient object that is not serialized as is in the index.
Its .field() and .value_bytes() on the other hand are unchanged.
This change offers better Debug information for terms.
While not necessary it also will help in the support for JSON types.
* Renamed Hierarchical Facet -> Facet
2022-01-07 20:49:00 +09:00
Paul Masurel
c81b3030fa
Issue/922b ( #1233 )
...
* Add a NORMED options on field
Make fieldnorm indexation optional:
* for all types except text => added a NORMED options
* for text field
** if STRING, field has not fieldnorm retained
** if TEXT, field has fieldnorm computed
* Finalize making fieldnorm optional for all field types.
- Using Option for fieldnorm readers.
2021-12-10 21:12:29 +09:00
Paul Masurel
dde49ac8e2
Closes #1195 ( #1222 )
...
Removes the indexed option for facets.
Facets are now always indexed.
Closes #1195
2021-12-02 14:37:19 +09:00
Paul Masurel
7234bef0eb
Issue/1198 ( #1201 )
...
* Unit test reproducing #1198
* Fixing unit test to handle the error from add_document.
* Bump project version
2021-11-11 16:42:19 +09:00
François Massot
0462754673
Optimize block wand for one and several TermScorer. ( #1190 )
...
* Added optimisation using block wand for single TermScorer.
A proptest was also added.
* Fix block wand algorithm by taking the last doc id of scores until the pivot scorer (included).
* In block wand, when block max score is lower than the threshold, advance the scorer with best score.
* Fix wrong condition in block_wand_single_scorer and add debug_assert to have an equality check on doc to break the loop.
2021-11-01 09:18:05 +09:00
Paul Masurel
27acfa4dea
Removing dead file ( #1170 )
2021-10-07 14:15:21 +09:00
Paul Masurel
02cffa4dea
Code simplification. ( #1169 )
...
Code simplification and Clippy
2021-10-07 14:11:44 +09:00
Paul Masurel
b52abbc771
Bugfix transposition_cost_one in FuzzyQuery ( #1167 )
2021-10-07 09:38:39 +09:00
Pascal Seitz
5ee5037934
create and use ReadSerializedBitSet
2021-09-24 12:53:33 +08:00
Pascal Seitz
c27ccd3e24
improve naming
2021-09-23 21:02:09 +08:00
Pascal Seitz
d7a6a409a1
renames
2021-09-23 20:33:11 +08:00
Pascal Seitz
a1f5cead96
AliveBitSet instead of DeleteBitSet
2021-09-23 20:03:57 +08:00
sigaloid
096ce7488e
Resolve some clippys, format ( #1144 )
...
* cargo +nightly clippy --fix -Z unstable-options
2021-08-26 08:46:00 +09:00
Pascal Seitz
e0b83eb291
cargo fmt
2021-08-21 18:52:10 +01:00
PSeitz
13401f46ea
add wildcard mention
2021-08-21 18:10:33 +01:00
Pascal Seitz
3265f7bec3
dissolve common module
2021-08-19 23:26:34 +01:00
Pascal Seitz
ee0881712a
move bitset to common crate, move composite file to directory
2021-08-19 17:45:09 +01:00
Pascal Seitz
0062fe705d
cargo fmt
2021-07-01 18:17:08 +02:00
Pascal Seitz
9b3e508753
fix clippy
2021-07-01 18:06:09 +02:00
Pascal Seitz
a1ac63ee1c
fix clippy
2021-07-01 18:06:03 +02:00
Pascal Seitz
1e4df54ab3
fix clippy
2021-07-01 17:41:53 +02:00
Pascal Seitz
10f056fbb4
apply clippy fixes
2021-07-01 17:08:44 +02:00
Paul Masurel
6e4b61154f
Issue/1070 ( #1071 )
...
Add a boolean flag in the Query::query_terms informing on whether
position information is required.
Closes #1070
2021-06-03 22:33:20 +09:00
Moriyoshi Koizumi
4afba005f9
Provide a means to deal with malformed facet text representation for the query parser ( #1056 )
...
* Provide a means to deal with malformed facet text representation for the query parser.
* Specific error enum for the facet parse error.
2021-05-27 12:16:49 +09:00
Paul Masurel
fd8e5bdf57
Rename more like this
2021-05-21 16:32:39 +09:00
Evance Souamoro
372d12766a
fix cargo fmt
2021-05-03 10:26:56 +00:00
Evance Souamoro
d71aa57077
reusing idf from bm25 module as it was the same logic
2021-05-03 10:05:40 +00:00
Evance Souamoro
712c01aa93
fixed term sorting & moved it to a better place
2021-05-01 05:40:59 +00:00
Evance Souamoro
cde324d4b4
fixed issues based on comment, still need to check BM25 suggestion
2021-04-30 21:14:19 +00:00
Evance Souamoro
2c0f6e3319
add builder to the public for documentation
2021-04-29 12:38:16 +00:00
Evance Souamoro
27f587aa13
applied cargo fmt
2021-04-29 12:15:34 +00:00
Evance Souamoro
cfc27c9665
add support for more like this query
2021-04-29 11:49:27 +00:00
Paul Masurel
075c23eb8c
Disabling fetching fieldnorm in phrasequery if scoring is disabled.
2021-04-27 14:06:41 +09:00
Paul Masurel
39dd8cfe24
Cargo clippy. Acronym should not be full uppercase apparently.
2021-04-26 11:49:18 +09:00
Paul Masurel
18bfe131fe
Removed all inline(always) like a baboon.
2021-04-23 10:47:13 +09:00
Evance Souamoro
d69aace9ec
fixed potential typo
2021-04-20 07:21:47 +00:00
Rihards Krišlauks
f58345f0f0
Add a date range query example to QueryParser documentation
2021-04-18 22:13:02 +03:00
Stéphane Campinas
a0ec6e1e9d
Expand the DocAddress struct with named fields
2021-03-28 19:00:23 +02:00
Laurent Pouget
4b34231f28
Make facet indexation and storage optional
...
Added a FacetOptions for HierarchicalFacet which add indexed and stored flags to it.
Propagate change and update tests accordingly
Added a test to ensure that a not indexed flag was taken care of.
Added on Value implem the `path()` function to return the stored facet.
2021-03-24 14:56:27 +01:00
bstrie
5f740d9ab4
Replace deprecated collections::Bound with ops::Bound
2021-03-18 17:20:36 -04:00
Paul Masurel
9c3cabce40
Updated version of the rand crate.
2021-01-06 18:09:00 +09:00
Paul Masurel
af6dfa1856
Small refactoring
2020-12-03 14:27:05 +09:00
Paul Masurel
80a99539ce
Several TermDict operation now returns an io::Result
2020-12-03 13:13:11 +09:00
Paul Masurel
1741619c7f
DocSet is send
2020-12-01 19:11:21 +09:00
Paul Masurel
e9aa27dace
Avoid computing the BM25 weight if scoring is disabled
2020-11-25 14:35:49 +09:00
Paul Masurel
7f0e61b173
Refactoring of the skip index.
...
The skip index now identifies both the start and the end offset
of blocks. Checkpoints are compressed in blocks, reaching better
compression.
2020-11-17 16:05:11 +09:00
Paul Masurel
6d4b982417
Marked blockwand test as ignored.
...
- Using impl trait for iterating `matching_segments` in the termdict
merger
2020-11-16 13:44:14 +09:00
Paul Masurel
a49e59053c
Making block wand test more robusts
2020-11-10 18:01:38 +09:00