mirror of
https://github.com/quickwit-oss/tantivy.git
synced 2026-01-07 17:42:55 +00:00
* Adding a write schema to columnar's merge operations. * Added unit test checking min/max when columns are empty. * CR comment * Rename to value_type_to_column_type
1.4 KiB
1.4 KiB
zero to one
- revisit line codec
- add columns from schema on merge
- Plugging JSON
- replug examples
- move datetime to quickwit common
- switch to nanos
- reintroduce the gcd map.
Perf and Size
- remove alloc in
ord_to_term
- multivaued range queries restrat frm the beginning all of the time.
- re-add ZSTD compression for dictionaries no systematic monotonic mapping consider removing multilinear f32? adhoc solution for bool? add metrics helper for aggregate. sum(row_id) review inline absence/presence improv perf of select using PDEP compare with roaring bitmap/elias fano etc etc. SIMD range? (see blog post) Add alignment? Consider another codec to bridge the gap between few and 5k elements
Cleanup and rationalization
in benchmark, unify percent vs ratio, f32 vs f64. investigate if should have better errors? io::Error is overused at the moment. rename rank/select in unit tests Review the public API via cargo doc go through TODOs remove all doc_id occurences -> row_id use the rank & select naming in unit tests branch. multi-linear -> blockwise linear codec -> simply a multiplication for the index column rename columnar to something more explicit, like column_dictionary or columnar_table rename fastfield -> column document changes rationalization FastFieldValue, HasColumnType isolate u128_based and uniform naming
Other
fix enhance column-cli
Santa claus
autodetect datetime ipaddr, plug customizable tokenizer.