Files
tantivy/src/indexer
Paul Masurel 6b59ec6fd5 Fix bug occuring when merging JSON object indexed with positions.
In JSON Object field the presence of term frequencies depend on the
field.
Typically, a string with postiions indexed will have positions
while numbers won't.

The presence or absence of term freqs for a given term is unfortunately
encoded in a very passive way.

It is given by the presence of extra information in the skip info, or
the lack of term freqs after decoding vint blocks.

Before, after writing a segment, we would encode the segment correctly
(without any term freq for number in json object field).
However during merge, we would get the default term freq=1 value.
(this is default in the absence of encoded term freqs)

The merger would then proceed and attempt to decode 1 position when
there are in fact none.

This PR requires to explictly tell the posting serialize whether
term frequencies should be serialized for each new term.

Closes #2251
2023-11-14 22:41:48 +09:00
..
2022-09-26 09:37:55 +09:00
2023-10-25 09:17:31 +02:00
2022-08-13 18:25:47 +08:00
2023-10-24 10:05:22 +02:00
2023-11-01 11:25:17 +01:00
2022-01-28 15:55:55 +09:00
2022-01-28 15:55:55 +09:00
2023-09-04 11:06:56 +09:00
2023-11-06 11:18:38 +00:00