Bump codecov/codecov-action from 3 to 5

Bumps [codecov/codecov-action](https://github.com/codecov/codecov-action) from 3 to 5. - [Release notes](https://github.com/codecov/codecov-action/releases) - [Changelog](https://github.com/codecov/codecov-action/blob/main/CHANGELOG.md) - [Commits](https://github.com/codecov/codecov-action/compare/v3...v5) --- updated-dependencies: - dependency-name: codecov/codecov-action dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com>
2026-02-26 01:30:37 +00:00 · 2024-11-14 20:12:50 +00:00
220 changed files with 3475 additions and 10900 deletions
--- a/.github/workflows/coverage.yml
+++ b/.github/workflows/coverage.yml
@@ -21,7 +21,7 @@ jobs:
      - name: Generate code coverage
        run: cargo +nightly-2024-07-01 llvm-cov --all-features --workspace --doctests --lcov --output-path lcov.info
      - name: Upload coverage to Codecov
-        uses: codecov/codecov-action@v3
+        uses: codecov/codecov-action@v5
        continue-on-error: true
        with:
          token: ${{ secrets.CODECOV_TOKEN }} # not required for public repos
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -1,42 +1,11 @@
-Tantivy 0.25
+Tantivy 0.23 - Unreleased
 ================================
-
-## Bugfixes
- fix union performance regression in tantivy 0.24 [#2663](https://github.com/quickwit-oss/tantivy/pull/2663)(@PSeitz)
- make zstd optional in sstable [#2633](https://github.com/quickwit-oss/tantivy/pull/2633)(@Parth)
- Fix TopDocs::order_by_string_fast_field for asc order [#2672](https://github.com/quickwit-oss/tantivy/pull/2672)(@stuhood @PSeitz)
-
-## Features/Improvements
- add docs/example and Vec<u32> values to sstable [#2660](https://github.com/quickwit-oss/tantivy/pull/2660)(@PSeitz)
- Add string fast field support to `TopDocs`. [#2642](https://github.com/quickwit-oss/tantivy/pull/2642)(@stuhood)
- update edition to 2024 [#2620](https://github.com/quickwit-oss/tantivy/pull/2620)(@PSeitz)
- Allow optional spaces between the field name and the value in the query parser [#2678](https://github.com/quickwit-oss/tantivy/pull/2678)(@Darkheir)
- Support mixed field types in query parser [#2676](https://github.com/quickwit-oss/tantivy/pull/2676)(@trinity-1686a)
- Add per-field size details [#2679](https://github.com/quickwit-oss/tantivy/pull/2679)(@fulmicoton)
-
-Tantivy 0.24.2
-================================
- Fix TopNComputer for reverse order. [#2672](https://github.com/quickwit-oss/tantivy/pull/2672)(@stuhood @PSeitz) 
-
-Affected queries are [order_by_fast_field](https://docs.rs/tantivy/latest/tantivy/collector/struct.TopDocs.html#method.order_by_fast_field) and
-[order_by_u64_field](https://docs.rs/tantivy/latest/tantivy/collector/struct.TopDocs.html#method.order_by_u64_field)
-for `Order::Asc`
-
-Tantivy 0.24.1
-================================
- Fix: bump required rust version to 1.81
-  
-Tantivy 0.24
-================================
-Tantivy 0.24 will be backwards compatible with indices created with v0.22 and v0.21. The new minimum rust version will be 1.75. Tantivy 0.23 will be skipped.
+Tantivy 0.23 will be backwards compatible with indices created with v0.22 and v0.21.

 #### Bugfixes
 - fix potential endless loop in merge [#2457](https://github.com/quickwit-oss/tantivy/pull/2457)(@PSeitz)
 - fix bug that causes out-of-order sstable key. [#2445](https://github.com/quickwit-oss/tantivy/pull/2445)(@fulmicoton)
 - fix ReferenceValue API flaw [#2372](https://github.com/quickwit-oss/tantivy/pull/2372)(@PSeitz)
- fix `OwnedBytes` debug panic [#2512](https://github.com/quickwit-oss/tantivy/pull/2512)(@b41sh)
- catch panics during merges [#2582](https://github.com/quickwit-oss/tantivy/pull/2582)(@rdettai)
- switch from u32 to usize in bitpacker. This enables multivalued columns larger than 4GB, which crashed during merge before. [#2581](https://github.com/quickwit-oss/tantivy/pull/2581) [#2586](https://github.com/quickwit-oss/tantivy/pull/2586)(@fulmicoton-dd @PSeitz)

 #### Breaking API Changes
 - remove index sorting [#2434](https://github.com/quickwit-oss/tantivy/pull/2434)(@PSeitz)
@@ -54,7 +23,6 @@ Tantivy 0.24 will be backwards compatible with indices created with v0.22 and v0
    - reduce top hits memory consumption [#2426](https://github.com/quickwit-oss/tantivy/pull/2426)(@PSeitz)
    - check unsupported parameters top_hits [#2351](https://github.com/quickwit-oss/tantivy/pull/2351)(@PSeitz)
    - Change AggregationLimits to AggregationLimitsGuard [#2495](https://github.com/quickwit-oss/tantivy/pull/2495)(@PSeitz)
-    - add support for counting non integer in aggregation [#2547](https://github.com/quickwit-oss/tantivy/pull/2547)(@trinity-1686a)
 - **Range Queries**
    - Support fast field range queries on json fields [#2456](https://github.com/quickwit-oss/tantivy/pull/2456)(@PSeitz)
    - Add support for str fast field range query [#2460](https://github.com/quickwit-oss/tantivy/pull/2460) [#2452](https://github.com/quickwit-oss/tantivy/pull/2452) [#2453](https://github.com/quickwit-oss/tantivy/pull/2453)(@PSeitz)
@@ -65,18 +33,9 @@ Tantivy 0.24 will be backwards compatible with indices created with v0.22 and v0
 - add columnar format compatibility tests [#2433](https://github.com/quickwit-oss/tantivy/pull/2433)(@PSeitz)
 - Improved snippet ranges algorithm [#2474](https://github.com/quickwit-oss/tantivy/pull/2474)(@gezihuzi)
 - make find_field_with_default return json fields without path [#2476](https://github.com/quickwit-oss/tantivy/pull/2476)(@trinity-1686a)
- Make `BooleanQuery` support `minimum_number_should_match` [#2405](https://github.com/quickwit-oss/tantivy/pull/2405)(@LebranceBW)
- Make `NUM_MERGE_THREADS` configurable [#2535](https://github.com/quickwit-oss/tantivy/pull/2535)(@Barre)
+- feat(query): Make `BooleanQuery` support `minimum_number_should_match` [#2405](https://github.com/quickwit-oss/tantivy/pull/2405)(@LebranceBW)

- **RegexPhraseQuery** 
-`RegexPhraseQuery` supports phrase queries with regex. E.g. query "b.* b.* wolf" matches "big bad wolf". Slop is supported as well: "b.* wolf"~2 matches "big bad wolf" [#2516](https://github.com/quickwit-oss/tantivy/pull/2516)(@PSeitz)
-
- **Optional Index in Multivalue Columnar Index** 
-For mostly empty multivalued indices there was a large overhead during creation when iterating all docids (merge case). 
-This is alleviated by placing an optional index in the multivalued index to mark documents that have values. 
-This will slightly increase space and access time. [#2439](https://github.com/quickwit-oss/tantivy/pull/2439)(@PSeitz)
-
- **Store DateTime as nanoseconds in doc store** DateTime in the doc store was truncated to microseconds previously. This removes this truncation, while still keeping backwards compatibility. [#2486](https://github.com/quickwit-oss/tantivy/pull/2486)(@PSeitz)
+- **Optional Index in Multivalue Columnar Index** For mostly empty multivalued indices there was a large overhead during creation when iterating all docids (merge case). This is alleviated by placing an optional index in the multivalued index to mark documents that have values. This will slightly increase space and access time. [#2439](https://github.com/quickwit-oss/tantivy/pull/2439)(@PSeitz)

 - **Performace/Memory**
    - lift clauses in LogicalAst for optimized ast during execution [#2449](https://github.com/quickwit-oss/tantivy/pull/2449)(@PSeitz)
@@ -92,29 +51,18 @@ This will slightly increase space and access time. [#2439](https://github.com/qu
    - fix de-escaping too much in query parser [#2427](https://github.com/quickwit-oss/tantivy/pull/2427)(@trinity-1686a)
    - improve query parser [#2416](https://github.com/quickwit-oss/tantivy/pull/2416)(@trinity-1686a)
    - Support field grouping `title:(return AND "pink panther")` [#2333](https://github.com/quickwit-oss/tantivy/pull/2333)(@trinity-1686a)
-    - allow term starting with wildcard [#2568](https://github.com/quickwit-oss/tantivy/pull/2568)(@trinity-1686a)

- Exist queries match subpath fields [#2558](https://github.com/quickwit-oss/tantivy/pull/2558)(@rdettai)
 - add access benchmark for columnar [#2432](https://github.com/quickwit-oss/tantivy/pull/2432)(@PSeitz)
 - extend indexwriter proptests [#2342](https://github.com/quickwit-oss/tantivy/pull/2342)(@PSeitz)
 - add bench & test for columnar merging [#2428](https://github.com/quickwit-oss/tantivy/pull/2428)(@PSeitz)
 - Change in Executor API [#2391](https://github.com/quickwit-oss/tantivy/pull/2391)(@fulmicoton)
 - Removed usage of num_cpus [#2387](https://github.com/quickwit-oss/tantivy/pull/2387)(@fulmicoton)
- use bingang for agg and stacker benchmark [#2378](https://github.com/quickwit-oss/tantivy/pull/2378)[#2492](https://github.com/quickwit-oss/tantivy/pull/2492)(@PSeitz) 
+- use bingang for agg benchmark [#2378](https://github.com/quickwit-oss/tantivy/pull/2378)(@PSeitz)
 - cleanup top level exports [#2382](https://github.com/quickwit-oss/tantivy/pull/2382)(@PSeitz)
 - make convert_to_fast_value_and_append_to_json_term pub [#2370](https://github.com/quickwit-oss/tantivy/pull/2370)(@PSeitz)
 - remove JsonTermWriter [#2238](https://github.com/quickwit-oss/tantivy/pull/2238)(@PSeitz)
 - validate sort by field type [#2336](https://github.com/quickwit-oss/tantivy/pull/2336)(@PSeitz)
 - Fix trait bound of StoreReader::iter [#2360](https://github.com/quickwit-oss/tantivy/pull/2360)(@adamreichold)
- remove read_postings_no_deletes [#2526](https://github.com/quickwit-oss/tantivy/pull/2526)(@PSeitz)
-
-Tantivy 0.22.1
-================================
- Fix TopNComputer for reverse order. [#2672](https://github.com/quickwit-oss/tantivy/pull/2672)(@stuhood @PSeitz) 
-
-Affected queries are [order_by_fast_field](https://docs.rs/tantivy/latest/tantivy/collector/struct.TopDocs.html#method.order_by_fast_field) and
-[order_by_u64_field](https://docs.rs/tantivy/latest/tantivy/collector/struct.TopDocs.html#method.order_by_u64_field)
-for `Order::Asc`

 Tantivy 0.22
 ================================
@@ -769,7 +717,7 @@ Tantivy 0.4.0
 - Raise the limit of number of fields (previously 256 fields) (@fulmicoton)
 - Removed u32 fields. They are replaced by u64 and i64 fields (#65) (@fulmicoton)
 - Optimized skip in SegmentPostings (#130) (@lnicola)
- Replacing rustc_serialize by serde. Kudos to  benchmark@KodrAus and @lnicola
+- Replacing rustc_serialize by serde. Kudos to @KodrAus and @lnicola
 - Using error-chain (@KodrAus)
 - QueryParser: (@fulmicoton)
  - Explicit error returned when searched for a term that is not indexed
--- a/Cargo.toml
+++ b/Cargo.toml
@@ -1,6 +1,6 @@
 [package]
 name = "tantivy"
-version = "0.25.0"
+version = "0.23.0"
 authors = ["Paul Masurel <paul.masurel@gmail.com>"]
 license = "MIT"
 categories = ["database-implementations", "data-structures"]
@@ -11,7 +11,7 @@ repository = "https://github.com/quickwit-oss/tantivy"
 readme = "README.md"
 keywords = ["search", "information", "retrieval"]
 edition = "2021"
-rust-version = "1.85"
+rust-version = "1.75"
 exclude = ["benches/*.json", "benches/*.txt"]

 [dependencies]
@@ -31,14 +31,14 @@ lz4_flex = { version = "0.11", default-features = false, optional = true }
 zstd = { version = "0.13", optional = true, default-features = false }
 tempfile = { version = "3.12.0", optional = true }
 log = "0.4.16"
-serde = { version = "1.0.219", features = ["derive"] }
-serde_json = "1.0.140"
-fs4 = { version = "0.13.1", optional = true }
+serde = { version = "1.0.136", features = ["derive"] }
+serde_json = "1.0.79"
+fs4 = { version = "0.8.0", optional = true }
 levenshtein_automata = "0.2.1"
 uuid = { version = "1.0.0", features = ["v4", "serde"] }
 crossbeam-channel = "0.5.4"
 rust-stemmers = "1.2.0"
-downcast-rs = "2.0.1"
+downcast-rs = "1.2.1"
 bitpacking = { version = "0.9.2", default-features = false, features = [
    "bitpacker4x",
 ] }
@@ -52,24 +52,21 @@ smallvec = "1.8.0"
 rayon = "1.5.2"
 lru = "0.12.0"
 fastdivide = "0.4.0"
-itertools = "0.14.0"
-measure_time = "0.9.0"
+itertools = "0.13.0"
+measure_time = "0.8.2"
 arc-swap = "1.5.0"
-bon = "3.3.1"

-columnar = { version = "0.6", path = "./columnar", package = "tantivy-columnar" }
-sstable = { version = "0.6", path = "./sstable", package = "tantivy-sstable", optional = true }
-stacker = { version = "0.6", path = "./stacker", package = "tantivy-stacker" }
-query-grammar = { version = "0.25.0", path = "./query-grammar", package = "tantivy-query-grammar" }
-tantivy-bitpacker = { version = "0.9", path = "./bitpacker" }
-common = { version = "0.10", path = "./common/", package = "tantivy-common" }
-tokenizer-api = { version = "0.6", path = "./tokenizer-api", package = "tantivy-tokenizer-api" }
+columnar = { version = "0.3", path = "./columnar", package = "tantivy-columnar" }
+sstable = { version = "0.3", path = "./sstable", package = "tantivy-sstable", optional = true }
+stacker = { version = "0.3", path = "./stacker", package = "tantivy-stacker" }
+query-grammar = { version = "0.22.0", path = "./query-grammar", package = "tantivy-query-grammar" }
+tantivy-bitpacker = { version = "0.6", path = "./bitpacker" }
+common = { version = "0.7", path = "./common/", package = "tantivy-common" }
+tokenizer-api = { version = "0.3", path = "./tokenizer-api", package = "tantivy-tokenizer-api" }
 sketches-ddsketch = { version = "0.3.0", features = ["use_serde"] }
 hyperloglogplus = { version = "0.4.1", features = ["const-loop"] }
 futures-util = { version = "0.3.28", optional = true }
-futures-channel = { version = "0.3.28", optional = true }
 fnv = "1.0.7"
-typetag = "0.2.21"

 [target.'cfg(windows)'.dependencies]
 winapi = "0.3.9"
@@ -88,7 +85,7 @@ more-asserts = "0.3.1"
 rand_distr = "0.4.3"
 time = { version = "0.3.10", features = ["serde-well-known", "macros"] }
 postcard = { version = "1.0.4", features = [
-    "use-std",
+  "use-std",
 ], default-features = false }

 [target.'cfg(not(windows))'.dev-dependencies]
@@ -113,20 +110,17 @@ debug-assertions = true
 overflow-checks = true

 [features]
-default = ["mmap", "stopwords", "lz4-compression", "columnar-zstd-compression"]
+default = ["mmap", "stopwords", "lz4-compression"]
 mmap = ["fs4", "tempfile", "memmap2"]
 stopwords = []

 lz4-compression = ["lz4_flex"]
 zstd-compression = ["zstd"]

-# enable zstd-compression in columnar (and sstable)
-columnar-zstd-compression = ["columnar/zstd-compression"]
-
 failpoints = ["fail", "fail/failpoints"]
 unstable = []                            # useful for benches.

-quickwit = ["sstable", "futures-util", "futures-channel"]
+quickwit = ["sstable", "futures-util"]

 # Compares only the hash of a string when indexing data.
 # Increases indexing speed, but may lead to extremely rare missing terms, when there's a hash collision.
@@ -168,11 +162,3 @@ harness = false
 [[bench]]
 name = "agg_bench"
 harness = false
-
-[[bench]]
-name = "exists_json"
-harness = false
-
-[[bench]]
-name = "and_or_queries"
-harness = false
--- a/README.md
+++ b/README.md
@@ -23,6 +23,8 @@ performance for different types of queries/collections.

 Your mileage WILL vary depending on the nature of queries and their load.

+<img src="doc/assets/images/searchbenchmark.png">
+
 Details about the benchmark can be found at this [repository](https://github.com/quickwit-oss/search-benchmark-game).

 ## Features
--- a/RELEASE.md
+++ b/RELEASE.md
@@ -1,4 +1,4 @@
-# Releasing a new Tantivy Version
+# Release a new Tantivy Version

 ## Steps

@@ -10,29 +10,12 @@
 6. Set git tag with new version


-[`cargo-release`](https://github.com/crate-ci/cargo-release) will help us with steps 1-5:
+In conjucation with `cargo-release` Steps 1-4 (I'm not sure if the change detection works):
+Set new packages to version 0.0.0

 Replace prev-tag-name
 ```bash
-cargo release --workspace --no-publish -v --prev-tag-name 0.24 --push-remote origin minor --no-tag
+cargo release --workspace --no-publish -v --prev-tag-name 0.19 --push-remote origin minor --no-tag --execute
 ```

-`no-tag` or it will create tags for all the subpackages
-
-cargo release will _not_ ignore unchanged packages, but it will print warnings for them.
-e.g. "warning: updating ownedbytes to 0.10.0 despite no changes made since tag 0.24"
-
-We need to manually ignore these unchanged packages
-```bash
-cargo release --workspace --no-publish -v --prev-tag-name 0.24 --push-remote origin minor --no-tag --exclude tokenizer-api
-```
-
-Add `--execute` to actually publish the packages, otherwise it will only print the commands that would be run.
-
-### Tag Version
-```bash
-git tag 0.25.0
-git push upstream tag 0.25.0
-```
-
-
+no-tag or it will create tags for all the subpackages
--- a/benches/agg_bench.rs
+++ b/benches/agg_bench.rs
@@ -71,15 +71,8 @@ fn bench_agg(mut group: InputGroup<Index>) {
    register!(group, histogram);
    register!(group, histogram_hard_bounds);
    register!(group, histogram_with_avg_sub_agg);
-    register!(group, histogram_with_term_agg_few);
    register!(group, avg_and_range_with_avg_sub_agg);

-    // Filter aggregation benchmarks
-    register!(group, filter_agg_all_query_count_agg);
-    register!(group, filter_agg_term_query_count_agg);
-    register!(group, filter_agg_all_query_with_sub_aggs);
-    register!(group, filter_agg_term_query_with_sub_aggs);
-
    group.run();
 }

@@ -346,17 +339,6 @@ fn histogram_with_avg_sub_agg(index: &Index) {
    });
    execute_agg(index, agg_req);
 }
-fn histogram_with_term_agg_few(index: &Index) {
-    let agg_req = json!({
-        "rangef64": {
-            "histogram": { "field": "score_f64", "interval": 10 },
-            "aggs": {
-                "my_texts": { "terms": { "field": "text_few_terms" } }
-            }
-        }
-    });
-    execute_agg(index, agg_req);
-}
 fn avg_and_range_with_avg_sub_agg(index: &Index) {
    let agg_req = json!({
        "rangef64": {
@@ -478,61 +460,3 @@ fn get_test_index_bench(cardinality: Cardinality) -> tantivy::Result<Index> {

    Ok(index)
 }
-
-// Filter aggregation benchmarks
-
-fn filter_agg_all_query_count_agg(index: &Index) {
-    let agg_req = json!({
-        "filtered": {
-            "filter": "*",
-            "aggs": {
-                "count": { "value_count": { "field": "score" } }
-            }
-        }
-    });
-    execute_agg(index, agg_req);
-}
-
-fn filter_agg_term_query_count_agg(index: &Index) {
-    let agg_req = json!({
-        "filtered": {
-            "filter": "text:cool",
-            "aggs": {
-                "count": { "value_count": { "field": "score" } }
-            }
-        }
-    });
-    execute_agg(index, agg_req);
-}
-
-fn filter_agg_all_query_with_sub_aggs(index: &Index) {
-    let agg_req = json!({
-        "filtered": {
-            "filter": "*",
-            "aggs": {
-                "avg_score": { "avg": { "field": "score" } },
-                "stats_score": { "stats": { "field": "score_f64" } },
-                "terms_text": {
-                    "terms": { "field": "text_few_terms" }
-                }
-            }
-        }
-    });
-    execute_agg(index, agg_req);
-}
-
-fn filter_agg_term_query_with_sub_aggs(index: &Index) {
-    let agg_req = json!({
-        "filtered": {
-            "filter": "text:cool",
-            "aggs": {
-                "avg_score": { "avg": { "field": "score" } },
-                "stats_score": { "stats": { "field": "score_f64" } },
-                "terms_text": {
-                    "terms": { "field": "text_few_terms" }
-                }
-            }
-        }
-    });
-    execute_agg(index, agg_req);
-}
--- a/benches/and_or_queries.rs
+++ b/benches/and_or_queries.rs
@@ -1,224 +0,0 @@
-// Benchmarks boolean conjunction queries using binggan.
-//
-// What’s measured:
-// - Or and And queries with varying selectivity (only `Term` queries for now on leafs)
-// - Nested AND/OR combinations (on multiple fields)
-// - No-scoring path using the Count collector (focus on iterator/skip performance)
-// - Top-K retrieval (k=10) using the TopDocs collector
-//
-// Corpus model:
-// - Synthetic docs; each token a/b/c is independently included per doc
-// - If none of a/b/c are included, emit a neutral filler token to keep doc length similar
-//
-// Notes:
-// - After optimization, when scoring is disabled Tantivy reads doc-only postings
-//   (IndexRecordOption::Basic), avoiding frequency decoding overhead.
-// - This bench isolates boolean iteration speed and intersection/union cost.
-// - Use `cargo bench --bench boolean_conjunction` to run.
-
-use binggan::{black_box, BenchRunner};
-use rand::prelude::*;
-use rand::rngs::StdRng;
-use rand::SeedableRng;
-use tantivy::collector::{Count, TopDocs};
-use tantivy::query::QueryParser;
-use tantivy::schema::{Schema, TEXT};
-use tantivy::{doc, Index, ReloadPolicy, Searcher};
-
-#[derive(Clone)]
-struct BenchIndex {
-    #[allow(dead_code)]
-    index: Index,
-    searcher: Searcher,
-    query_parser: QueryParser,
-}
-
-impl BenchIndex {
-    #[inline(always)]
-    fn count_query(&self, query_str: &str) -> usize {
-        let query = self.query_parser.parse_query(query_str).unwrap();
-        self.searcher.search(&query, &Count).unwrap()
-    }
-
-    #[inline(always)]
-    fn topk_len(&self, query_str: &str, k: usize) -> usize {
-        let query = self.query_parser.parse_query(query_str).unwrap();
-        self.searcher
-            .search(&query, &TopDocs::with_limit(k))
-            .unwrap()
-            .len()
-    }
-}
-
-/// Build a single index containing both fields (title, body) and
-/// return two BenchIndex views:
-/// - single_field: QueryParser defaults to only "body"
-/// - multi_field:  QueryParser defaults to ["title", "body"]
-fn build_shared_indices(num_docs: usize, p_a: f32, p_b: f32, p_c: f32) -> (BenchIndex, BenchIndex) {
-    // Unified schema (two text fields)
-    let mut schema_builder = Schema::builder();
-    let f_title = schema_builder.add_text_field("title", TEXT);
-    let f_body = schema_builder.add_text_field("body", TEXT);
-    let schema = schema_builder.build();
-    let index = Index::create_in_ram(schema.clone());
-
-    // Populate index with stable RNG for reproducibility.
-    let mut rng = StdRng::from_seed([7u8; 32]);
-
-    // Populate: spread each present token 90/10 to body/title
-    {
-        let mut writer = index.writer(500_000_000).unwrap();
-        for _ in 0..num_docs {
-            let has_a = rng.gen_bool(p_a as f64);
-            let has_b = rng.gen_bool(p_b as f64);
-            let has_c = rng.gen_bool(p_c as f64);
-            let mut title_tokens: Vec<&str> = Vec::new();
-            let mut body_tokens: Vec<&str> = Vec::new();
-            if has_a {
-                if rng.gen_bool(0.1) {
-                    title_tokens.push("a");
-                } else {
-                    body_tokens.push("a");
-                }
-            }
-            if has_b {
-                if rng.gen_bool(0.1) {
-                    title_tokens.push("b");
-                } else {
-                    body_tokens.push("b");
-                }
-            }
-            if has_c {
-                if rng.gen_bool(0.1) {
-                    title_tokens.push("c");
-                } else {
-                    body_tokens.push("c");
-                }
-            }
-            if title_tokens.is_empty() && body_tokens.is_empty() {
-                body_tokens.push("z");
-            }
-            writer
-                .add_document(doc!(
-                    f_title=>title_tokens.join(" "),
-                    f_body=>body_tokens.join(" ")
-                ))
-                .unwrap();
-        }
-        writer.commit().unwrap();
-    }
-
-    // Prepare reader/searcher once.
-    let reader = index
-        .reader_builder()
-        .reload_policy(ReloadPolicy::Manual)
-        .try_into()
-        .unwrap();
-    let searcher = reader.searcher();
-
-    // Build two query parsers with different default fields.
-    let qp_single = QueryParser::for_index(&index, vec![f_body]);
-    let qp_multi = QueryParser::for_index(&index, vec![f_title, f_body]);
-
-    let single_view = BenchIndex {
-        index: index.clone(),
-        searcher: searcher.clone(),
-        query_parser: qp_single,
-    };
-    let multi_view = BenchIndex {
-        index,
-        searcher,
-        query_parser: qp_multi,
-    };
-    (single_view, multi_view)
-}
-
-fn main() {
-    // Prepare corpora with varying selectivity. Build one index per corpus
-    // and derive two views (single-field vs multi-field) from it.
-    let scenarios = vec![
-        (
-            "N=1M, p(a)=5%, p(b)=1%, p(c)=15%".to_string(),
-            1_000_000,
-            0.05,
-            0.01,
-            0.15,
-        ),
-        (
-            "N=1M, p(a)=1%, p(b)=1%, p(c)=15%".to_string(),
-            1_000_000,
-            0.01,
-            0.01,
-            0.15,
-        ),
-    ];
-
-    let mut runner = BenchRunner::new();
-    for (label, n, pa, pb, pc) in scenarios {
-        let (single_view, multi_view) = build_shared_indices(n, pa, pb, pc);
-
-        // Single-field group: default field is body only
-        {
-            let mut group = runner.new_group();
-            group.set_name(format!("single_field — {}", label));
-            group.register_with_input("+a_+b_count", &single_view, |benv: &BenchIndex| {
-                black_box(benv.count_query("+a +b"))
-            });
-            group.register_with_input("+a_+b_+c_count", &single_view, |benv: &BenchIndex| {
-                black_box(benv.count_query("+a +b +c"))
-            });
-            group.register_with_input("+a_+b_top10", &single_view, |benv: &BenchIndex| {
-                black_box(benv.topk_len("+a +b", 10))
-            });
-            group.register_with_input("+a_+b_+c_top10", &single_view, |benv: &BenchIndex| {
-                black_box(benv.topk_len("+a +b +c", 10))
-            });
-            // OR queries
-            group.register_with_input("a_OR_b_count", &single_view, |benv: &BenchIndex| {
-                black_box(benv.count_query("a OR b"))
-            });
-            group.register_with_input("a_OR_b_OR_c_count", &single_view, |benv: &BenchIndex| {
-                black_box(benv.count_query("a OR b OR c"))
-            });
-            group.register_with_input("a_OR_b_top10", &single_view, |benv: &BenchIndex| {
-                black_box(benv.topk_len("a OR b", 10))
-            });
-            group.register_with_input("a_OR_b_OR_c_top10", &single_view, |benv: &BenchIndex| {
-                black_box(benv.topk_len("a OR b OR c", 10))
-            });
-            group.run();
-        }
-
-        // Multi-field group: default fields are [title, body]
-        {
-            let mut group = runner.new_group();
-            group.set_name(format!("multi_field — {}", label));
-            group.register_with_input("+a_+b_count", &multi_view, |benv: &BenchIndex| {
-                black_box(benv.count_query("+a +b"))
-            });
-            group.register_with_input("+a_+b_+c_count", &multi_view, |benv: &BenchIndex| {
-                black_box(benv.count_query("+a +b +c"))
-            });
-            group.register_with_input("+a_+b_top10", &multi_view, |benv: &BenchIndex| {
-                black_box(benv.topk_len("+a +b", 10))
-            });
-            group.register_with_input("+a_+b_+c_top10", &multi_view, |benv: &BenchIndex| {
-                black_box(benv.topk_len("+a +b +c", 10))
-            });
-            // OR queries
-            group.register_with_input("a_OR_b_count", &multi_view, |benv: &BenchIndex| {
-                black_box(benv.count_query("a OR b"))
-            });
-            group.register_with_input("a_OR_b_OR_c_count", &multi_view, |benv: &BenchIndex| {
-                black_box(benv.count_query("a OR b OR c"))
-            });
-            group.register_with_input("a_OR_b_top10", &multi_view, |benv: &BenchIndex| {
-                black_box(benv.topk_len("a OR b", 10))
-            });
-            group.register_with_input("a_OR_b_OR_c_top10", &multi_view, |benv: &BenchIndex| {
-                black_box(benv.topk_len("a OR b OR c", 10))
-            });
-            group.run();
-        }
-    }
-}
--- a/benches/exists_json.rs
+++ b/benches/exists_json.rs
@@ -1,69 +0,0 @@
-use binggan::plugins::PeakMemAllocPlugin;
-use binggan::{black_box, InputGroup, PeakMemAlloc, INSTRUMENTED_SYSTEM};
-use serde_json::json;
-use tantivy::collector::Count;
-use tantivy::query::ExistsQuery;
-use tantivy::schema::{Schema, FAST, TEXT};
-use tantivy::{doc, Index};
-
-#[global_allocator]
-pub static GLOBAL: &PeakMemAlloc<std::alloc::System> = &INSTRUMENTED_SYSTEM;
-
-fn main() {
-    let doc_count: usize = 500_000;
-    let subfield_counts: &[usize] = &[1, 2, 3, 4, 5, 6, 7, 8, 16, 256, 4096, 65536, 262144];
-
-    let indices: Vec<(String, Index)> = subfield_counts
-        .iter()
-        .map(|&sub_fields| {
-            (
-                format!("subfields={sub_fields}"),
-                build_index_with_json_subfields(doc_count, sub_fields),
-            )
-        })
-        .collect();
-
-    let mut group = InputGroup::new_with_inputs(indices);
-    group.add_plugin(PeakMemAllocPlugin::new(GLOBAL));
-
-    group.config().num_iter_group = Some(1);
-    group.config().num_iter_bench = Some(1);
-    group.register("exists_json", exists_json_union);
-
-    group.run();
-}
-
-fn exists_json_union(index: &Index) {
-    let reader = index.reader().expect("reader");
-    let searcher = reader.searcher();
-    let query = ExistsQuery::new("json".to_string(), true);
-    let count = searcher.search(&query, &Count).expect("exists search");
-    // Prevents optimizer from eliding the search
-    black_box(count);
-}
-
-fn build_index_with_json_subfields(num_docs: usize, num_subfields: usize) -> Index {
-    // Schema: single JSON field stored as FAST to support ExistsQuery.
-    let mut schema_builder = Schema::builder();
-    let json_field = schema_builder.add_json_field("json", TEXT | FAST);
-    let schema = schema_builder.build();
-
-    let index = Index::create_from_tempdir(schema).expect("create index");
-    {
-        let mut index_writer = index
-            .writer_with_num_threads(1, 200_000_000)
-            .expect("writer");
-        for i in 0..num_docs {
-            let sub = i % num_subfields;
-            // Only one subpath set per document; rotate subpaths so that
-            // no single subpath is full, but the union covers all docs.
-            let v = json!({ format!("field_{sub}"): i as u64 });
-            index_writer
-                .add_document(doc!(json_field => v))
-                .expect("add_document");
-        }
-        index_writer.commit().expect("commit");
-    }
-
-    index
-}
--- a/bitpacker/Cargo.toml
+++ b/bitpacker/Cargo.toml
@@ -1,7 +1,7 @@
 [package]
 name = "tantivy-bitpacker"
-version = "0.9.0"
-edition = "2024"
+version = "0.6.0"
+edition = "2021"
 authors = ["Paul Masurel <paul.masurel@gmail.com>"]
 license = "MIT"
 categories = []
--- a/bitpacker/src/bitpacker.rs
+++ b/bitpacker/src/bitpacker.rs
@@ -48,7 +48,7 @@ impl BitPacker {

    pub fn flush<TWrite: io::Write + ?Sized>(&mut self, output: &mut TWrite) -> io::Result<()> {
        if self.mini_buffer_written > 0 {
-            let num_bytes = self.mini_buffer_written.div_ceil(8);
+            let num_bytes = (self.mini_buffer_written + 7) / 8;
            let bytes = self.mini_buffer.to_le_bytes();
            output.write_all(&bytes[..num_bytes])?;
            self.mini_buffer_written = 0;
@@ -65,7 +65,7 @@ impl BitPacker {

 #[derive(Clone, Debug, Default, Copy)]
 pub struct BitUnpacker {
-    num_bits: usize,
+    num_bits: u32,
    mask: u64,
 }

@@ -83,7 +83,7 @@ impl BitUnpacker {
            (1u64 << num_bits) - 1u64
        };
        BitUnpacker {
-            num_bits: usize::from(num_bits),
+            num_bits: u32::from(num_bits),
            mask,
        }
    }
@@ -94,14 +94,14 @@ impl BitUnpacker {

    #[inline]
    pub fn get(&self, idx: u32, data: &[u8]) -> u64 {
-        let addr_in_bits = idx as usize * self.num_bits;
-        let addr = addr_in_bits >> 3;
+        let addr_in_bits = idx * self.num_bits;
+        let addr = (addr_in_bits >> 3) as usize;
        if addr + 8 > data.len() {
            if self.num_bits == 0 {
                return 0;
            }
            let bit_shift = addr_in_bits & 7;
-            return self.get_slow_path(addr, bit_shift as u32, data);
+            return self.get_slow_path(addr, bit_shift, data);
        }
        let bit_shift = addr_in_bits & 7;
        let bytes: [u8; 8] = (&data[addr..addr + 8]).try_into().unwrap();
@@ -134,13 +134,12 @@ impl BitUnpacker {
            "Bitwidth must be <= 32 to use this method."
        );

-        let end_idx: u32 = start_idx + output.len() as u32;
+        let end_idx = start_idx + output.len() as u32;

-        // We use `usize` here to avoid overflow issues.
-        let end_bit_read = (end_idx as usize) * self.num_bits;
-        let end_byte_read = end_bit_read.div_ceil(8);
+        let end_bit_read = end_idx * self.num_bits;
+        let end_byte_read = (end_bit_read + 7) / 8;
        assert!(
-            end_byte_read <= data.len(),
+            end_byte_read as usize <= data.len(),
            "Requested index is out of bounds."
        );

@@ -160,24 +159,24 @@ impl BitUnpacker {
        // We want the start of the fast track to start align with bytes.
        // A sufficient condition is to start with an idx that is a multiple of 8,
        // so highway start is the closest multiple of 8 that is >= start_idx.
-        let entrance_ramp_len: u32 = 8 - (start_idx % 8) % 8;
+        let entrance_ramp_len = 8 - (start_idx % 8) % 8;

        let highway_start: u32 = start_idx + entrance_ramp_len;

-        if highway_start + (BitPacker1x::BLOCK_LEN as u32) > end_idx {
+        if highway_start + BitPacker1x::BLOCK_LEN as u32 > end_idx {
            // We don't have enough values to have even a single block of highway.
            // Let's just supply the values the simple way.
            get_batch_ramp(start_idx, output);
            return;
        }

-        let num_blocks: usize = (end_idx - highway_start) as usize / BitPacker1x::BLOCK_LEN;
+        let num_blocks: u32 = (end_idx - highway_start) / BitPacker1x::BLOCK_LEN as u32;

        // Entrance ramp
        get_batch_ramp(start_idx, &mut output[..entrance_ramp_len as usize]);

        // Highway
-        let mut offset = (highway_start as usize * self.num_bits) / 8;
+        let mut offset = (highway_start * self.num_bits) as usize / 8;
        let mut output_cursor = (highway_start - start_idx) as usize;
        for _ in 0..num_blocks {
            offset += BitPacker1x.decompress(
@@ -189,7 +188,7 @@ impl BitUnpacker {
        }

        // Exit ramp
-        let highway_end: u32 = highway_start + (num_blocks * BitPacker1x::BLOCK_LEN) as u32;
+        let highway_end = highway_start + num_blocks * BitPacker1x::BLOCK_LEN as u32;
        get_batch_ramp(highway_end, &mut output[output_cursor..]);
    }

--- a/bitpacker/src/blocked_bitpacker.rs
+++ b/bitpacker/src/blocked_bitpacker.rs
@@ -1,6 +1,6 @@
 use super::bitpacker::BitPacker;
 use super::compute_num_bits;
-use crate::{BitUnpacker, minmax};
+use crate::{minmax, BitUnpacker};

 const BLOCK_SIZE: usize = 128;

@@ -34,7 +34,7 @@ struct BlockedBitpackerEntryMetaData {

 impl BlockedBitpackerEntryMetaData {
    fn new(offset: u64, num_bits: u8, base_value: u64) -> Self {
-        let encoded = offset | (u64::from(num_bits) << (64 - 8));
+        let encoded = offset | (num_bits as u64) << (64 - 8);
        Self {
            encoded,
            base_value,
@@ -140,10 +140,10 @@ impl BlockedBitpacker {
    pub fn iter(&self) -> impl Iterator<Item = u64> + '_ {
        // todo performance: we could decompress a whole block and cache it instead
        let bitpacked_elems = self.offset_and_bits.len() * BLOCK_SIZE;
-
-        (0..bitpacked_elems)
+        let iter = (0..bitpacked_elems)
            .map(move |idx| self.get(idx))
-            .chain(self.buffer.iter().cloned())
+            .chain(self.buffer.iter().cloned());
+        iter
    }
 }

--- a/bitpacker/src/lib.rs
+++ b/bitpacker/src/lib.rs
@@ -33,7 +33,11 @@ pub use crate::blocked_bitpacker::BlockedBitpacker;
 /// number of bits.
 pub fn compute_num_bits(n: u64) -> u8 {
    let amplitude = (64u32 - n.leading_zeros()) as u8;
-    if amplitude <= 64 - 8 { amplitude } else { 64 }
+    if amplitude <= 64 - 8 {
+        amplitude
+    } else {
+        64
+    }
 }

 /// Computes the (min, max) of an iterator of `PartialOrd` values.
--- a/cliff.toml
+++ b/cliff.toml
@@ -16,14 +16,14 @@ body = """

 {%- if version %} in {{ version }}{%- endif -%}
 {% for commit in commits %}
-  {% if commit.remote.pr_title -%}
-    {%- set commit_message = commit.remote.pr_title -%}
+  {% if commit.github.pr_title -%}
+    {%- set commit_message = commit.github.pr_title -%}
  {%- else -%}
    {%- set commit_message = commit.message -%}
  {%- endif -%}
  - {{ commit_message | split(pat="\n") | first | trim }}\
-    {% if commit.remote.pr_number %} \
-      [#{{ commit.remote.pr_number }}]({{ self::remote_url() }}/pull/{{ commit.remote.pr_number }}){% if commit.remote.username %}(@{{ commit.remote.username }}){%- endif -%} \
+    {% if commit.github.pr_number %} \
+      [#{{ commit.github.pr_number }}]({{ self::remote_url() }}/pull/{{ commit.github.pr_number }}){% if commit.github.username %}(@{{ commit.github.username }}){%- endif -%} \
    {%- endif %}
 {%- endfor -%}

--- a/columnar/Cargo.toml
+++ b/columnar/Cargo.toml
@@ -1,7 +1,7 @@
 [package]
 name = "tantivy-columnar"
-version = "0.6.0"
-edition = "2024"
+version = "0.3.0"
+edition = "2021"
 license = "MIT"
 homepage = "https://github.com/quickwit-oss/tantivy"
 repository = "https://github.com/quickwit-oss/tantivy"
@@ -9,15 +9,15 @@ description = "column oriented storage for tantivy"
 categories = ["database-implementations", "data-structures", "compression"]

 [dependencies]
-itertools = "0.14.0"
+itertools = "0.13.0"
 fastdivide = "0.4.0"

-stacker = { version= "0.6", path = "../stacker", package="tantivy-stacker"}
-sstable = { version= "0.6", path = "../sstable", package = "tantivy-sstable" }
-common = { version= "0.10", path = "../common", package = "tantivy-common" }
-tantivy-bitpacker = { version= "0.9", path = "../bitpacker/" }
+stacker = { version= "0.3", path = "../stacker", package="tantivy-stacker"}
+sstable = { version= "0.3", path = "../sstable", package = "tantivy-sstable" }
+common = { version= "0.7", path = "../common", package = "tantivy-common" }
+tantivy-bitpacker = { version= "0.6", path = "../bitpacker/" }
 serde = "1.0.152"
-downcast-rs = "2.0.1"
+downcast-rs = "1.2.0"

 [dev-dependencies]
 proptest = "1"
@@ -33,29 +33,6 @@ harness = false
 name = "bench_access"
 harness = false

-[[bench]]
-name = "bench_first_vals"
-harness = false
-
-[[bench]]
-name = "bench_values_u64"
-harness = false
-
-[[bench]]
-name = "bench_values_u128"
-harness = false
-
-[[bench]]
-name = "bench_create_column_values"
-harness = false
-
-[[bench]]
-name = "bench_column_values_get"
-harness = false
-
-[[bench]]
-name = "bench_optional_index"
-harness = false

 [features]
-zstd-compression = ["sstable/zstd-compression"]
+unstable = []
--- a/columnar/benches/bench_access.rs
+++ b/columnar/benches/bench_access.rs
@@ -1,4 +1,4 @@
-use binggan::{InputGroup, black_box};
+use binggan::{black_box, InputGroup};
 use common::*;
 use tantivy_columnar::Column;

@@ -19,7 +19,7 @@ fn main() {

    let mut add_card = |card1: Card| {
        inputs.push((
-            card1.to_string(),
+            format!("{card1}"),
            generate_columnar_and_open(card1, NUM_DOCS),
        ));
    };
@@ -50,7 +50,6 @@ fn bench_group(mut runner: InputGroup<Column>) {
        let mut buffer = vec![None; BLOCK_SIZE];
        for i in (0..NUM_DOCS).step_by(BLOCK_SIZE) {
            // fill docs
-            #[allow(clippy::needless_range_loop)]
            for idx in 0..BLOCK_SIZE {
                docs[idx] = idx as u32 + i;
            }
--- a/columnar/benches/bench_column_values_get.rs
+++ b/columnar/benches/bench_column_values_get.rs
@@ -1,61 +0,0 @@
-use std::sync::Arc;
-
-use binggan::{InputGroup, black_box};
-use rand::rngs::StdRng;
-use rand::{Rng, SeedableRng};
-use tantivy_columnar::ColumnValues;
-use tantivy_columnar::column_values::{CodecType, serialize_and_load_u64_based_column_values};
-
-fn get_data() -> Vec<u64> {
-    let mut rng = StdRng::seed_from_u64(2u64);
-    let mut data: Vec<_> = (100..55_000_u64)
-        .map(|num| num + rng.r#gen::<u8>() as u64)
-        .collect();
-    data.push(99_000);
-    data.insert(1000, 2000);
-    data.insert(2000, 100);
-    data.insert(3000, 4100);
-    data.insert(4000, 100);
-    data.insert(5000, 800);
-    data
-}
-
-#[inline(never)]
-fn value_iter() -> impl Iterator<Item = u64> {
-    0..20_000
-}
-
-type Col = Arc<dyn ColumnValues<u64>>;
-
-fn main() {
-    let data = get_data();
-    let inputs: Vec<(String, Col)> = vec![
-        (
-            "bitpacked".to_string(),
-            serialize_and_load_u64_based_column_values(&data.as_slice(), &[CodecType::Bitpacked]),
-        ),
-        (
-            "linear".to_string(),
-            serialize_and_load_u64_based_column_values(&data.as_slice(), &[CodecType::Linear]),
-        ),
-        (
-            "blockwise_linear".to_string(),
-            serialize_and_load_u64_based_column_values(
-                &data.as_slice(),
-                &[CodecType::BlockwiseLinear],
-            ),
-        ),
-    ];
-
-    let mut group: InputGroup<Col> = InputGroup::new_with_inputs(inputs);
-
-    group.register("fastfield_get", |col: &Col| {
-        let mut sum = 0u64;
-        for pos in value_iter() {
-            sum = sum.wrapping_add(col.get_val(pos as u32));
-        }
-        black_box(sum);
-    });
-
-    group.run();
-}
--- a/columnar/benches/bench_create_column_values.rs
+++ b/columnar/benches/bench_create_column_values.rs
@@ -1,44 +0,0 @@
-use binggan::{InputGroup, black_box};
-use rand::rngs::StdRng;
-use rand::{Rng, SeedableRng};
-use tantivy_columnar::column_values::{CodecType, serialize_u64_based_column_values};
-
-fn get_data() -> Vec<u64> {
-    let mut rng = StdRng::seed_from_u64(2u64);
-    let mut data: Vec<_> = (100..55_000_u64)
-        .map(|num| num + rng.r#gen::<u8>() as u64)
-        .collect();
-    data.push(99_000);
-    data.insert(1000, 2000);
-    data.insert(2000, 100);
-    data.insert(3000, 4100);
-    data.insert(4000, 100);
-    data.insert(5000, 800);
-    data
-}
-
-fn main() {
-    let data = get_data();
-    let mut group: InputGroup<(CodecType, Vec<u64>)> = InputGroup::new_with_inputs(vec![
-        (
-            "bitpacked codec".to_string(),
-            (CodecType::Bitpacked, data.clone()),
-        ),
-        (
-            "linear codec".to_string(),
-            (CodecType::Linear, data.clone()),
-        ),
-        (
-            "blockwise linear codec".to_string(),
-            (CodecType::BlockwiseLinear, data.clone()),
-        ),
-    ]);
-
-    group.register("serialize column_values", |data| {
-        let mut buffer = Vec::new();
-        serialize_u64_based_column_values(&data.1.as_slice(), &[data.0], &mut buffer).unwrap();
-        black_box(buffer.len());
-    });
-
-    group.run();
-}
--- a/columnar/benches/bench_first_vals.rs
+++ b/columnar/benches/bench_first_vals.rs
@@ -1,9 +1,12 @@
+#![feature(test)]
+extern crate test;
+
 use std::sync::Arc;

-use binggan::{InputGroup, black_box};
 use rand::prelude::*;
-use tantivy_columnar::column_values::{CodecType, serialize_and_load_u64_based_column_values};
+use tantivy_columnar::column_values::{serialize_and_load_u64_based_column_values, CodecType};
 use tantivy_columnar::*;
+use test::{black_box, Bencher};

 struct Columns {
    pub optional: Column,
@@ -65,45 +68,88 @@ pub fn serialize_and_load(column: &[u64], codec_type: CodecType) -> Arc<dyn Colu
    serialize_and_load_u64_based_column_values(&column, &[codec_type])
 }

-fn main() {
-    let Columns {
-        optional,
-        full,
-        multi,
-    } = get_test_columns();
-
-    let inputs = vec![
-        ("full".to_string(), full),
-        ("optional".to_string(), optional),
-        ("multi".to_string(), multi),
-    ];
-
-    let mut group = InputGroup::new_with_inputs(inputs);
-
-    group.register("first_full_scan", |column| {
+fn run_bench_on_column_full_scan(b: &mut Bencher, column: Column) {
+    let num_iter = black_box(NUM_VALUES);
+    b.iter(|| {
        let mut sum = 0u64;
-        for i in 0..NUM_VALUES as u32 {
+        for i in 0..num_iter as u32 {
            let val = column.first(i);
            sum += val.unwrap_or(0);
        }
-        black_box(sum);
+        sum
    });
-
-    group.register("first_block_fetch", |column| {
-        let mut block: Vec<Option<u64>> = vec![None; 64];
-        let fetch_docids = (0..64).collect::<Vec<_>>();
+}
+fn run_bench_on_column_block_fetch(b: &mut Bencher, column: Column) {
+    let mut block: Vec<Option<u64>> = vec![None; 64];
+    let fetch_docids = (0..64).collect::<Vec<_>>();
+    b.iter(move || {
        column.first_vals(&fetch_docids, &mut block);
-        black_box(block[0]);
+        block[0]
    });
-
-    group.register("first_block_single_calls", |column| {
-        let mut block: Vec<Option<u64>> = vec![None; 64];
-        let fetch_docids = (0..64).collect::<Vec<_>>();
+}
+fn run_bench_on_column_block_single_calls(b: &mut Bencher, column: Column) {
+    let mut block: Vec<Option<u64>> = vec![None; 64];
+    let fetch_docids = (0..64).collect::<Vec<_>>();
+    b.iter(move || {
        for i in 0..fetch_docids.len() {
            block[i] = column.first(fetch_docids[i]);
        }
-        black_box(block[0]);
+        block[0]
    });
-
-    group.run();
+}
+
+/// Column first method
+#[bench]
+fn bench_get_first_on_full_column_full_scan(b: &mut Bencher) {
+    let column = get_test_columns().full;
+    run_bench_on_column_full_scan(b, column);
+}
+
+#[bench]
+fn bench_get_first_on_optional_column_full_scan(b: &mut Bencher) {
+    let column = get_test_columns().optional;
+    run_bench_on_column_full_scan(b, column);
+}
+
+#[bench]
+fn bench_get_first_on_multi_column_full_scan(b: &mut Bencher) {
+    let column = get_test_columns().multi;
+    run_bench_on_column_full_scan(b, column);
+}
+
+/// Block fetch column accessor
+#[bench]
+fn bench_get_block_first_on_optional_column(b: &mut Bencher) {
+    let column = get_test_columns().optional;
+    run_bench_on_column_block_fetch(b, column);
+}
+
+#[bench]
+fn bench_get_block_first_on_multi_column(b: &mut Bencher) {
+    let column = get_test_columns().multi;
+    run_bench_on_column_block_fetch(b, column);
+}
+
+#[bench]
+fn bench_get_block_first_on_full_column(b: &mut Bencher) {
+    let column = get_test_columns().full;
+    run_bench_on_column_block_fetch(b, column);
+}
+
+#[bench]
+fn bench_get_block_first_on_optional_column_single_calls(b: &mut Bencher) {
+    let column = get_test_columns().optional;
+    run_bench_on_column_block_single_calls(b, column);
+}
+
+#[bench]
+fn bench_get_block_first_on_multi_column_single_calls(b: &mut Bencher) {
+    let column = get_test_columns().multi;
+    run_bench_on_column_block_single_calls(b, column);
+}
+
+#[bench]
+fn bench_get_block_first_on_full_column_single_calls(b: &mut Bencher) {
+    let column = get_test_columns().full;
+    run_bench_on_column_block_single_calls(b, column);
 }
--- a/columnar/benches/bench_merge.rs
+++ b/columnar/benches/bench_merge.rs
@@ -1,7 +1,7 @@
 pub mod common;

 use binggan::BenchRunner;
-use common::{Card, generate_columnar_with_name};
+use common::{generate_columnar_with_name, Card};
 use tantivy_columnar::*;

 const NUM_DOCS: u32 = 100_000;
--- a/columnar/benches/bench_optional_index.rs
+++ b/columnar/benches/bench_optional_index.rs
@@ -1,106 +0,0 @@
-use binggan::{InputGroup, black_box};
-use rand::rngs::StdRng;
-use rand::{Rng, SeedableRng};
-use tantivy_columnar::column_index::{OptionalIndex, Set};
-
-const TOTAL_NUM_VALUES: u32 = 1_000_000;
-
-fn gen_optional_index(fill_ratio: f64) -> OptionalIndex {
-    let mut rng: StdRng = StdRng::from_seed([1u8; 32]);
-    let vals: Vec<u32> = (0..TOTAL_NUM_VALUES)
-        .map(|_| rng.gen_bool(fill_ratio))
-        .enumerate()
-        .filter(|(_pos, val)| *val)
-        .map(|(pos, _)| pos as u32)
-        .collect();
-    OptionalIndex::for_test(TOTAL_NUM_VALUES, &vals)
-}
-
-fn random_range_iterator(
-    start: u32,
-    end: u32,
-    avg_step_size: u32,
-    avg_deviation: u32,
-) -> impl Iterator<Item = u32> {
-    let mut rng: StdRng = StdRng::from_seed([1u8; 32]);
-    let mut current = start;
-    std::iter::from_fn(move || {
-        current += rng.gen_range(avg_step_size - avg_deviation..=avg_step_size + avg_deviation);
-        if current >= end { None } else { Some(current) }
-    })
-}
-
-fn n_percent_step_iterator(percent: f32, num_values: u32) -> impl Iterator<Item = u32> {
-    let ratio = percent / 100.0;
-    let step_size = (1f32 / ratio) as u32;
-    let deviation = step_size - 1;
-    random_range_iterator(0, num_values, step_size, deviation)
-}
-
-fn walk_over_data(codec: &OptionalIndex, avg_step_size: u32) -> Option<u32> {
-    walk_over_data_from_positions(
-        codec,
-        random_range_iterator(0, TOTAL_NUM_VALUES, avg_step_size, 0),
-    )
-}
-
-fn walk_over_data_from_positions(
-    codec: &OptionalIndex,
-    positions: impl Iterator<Item = u32>,
-) -> Option<u32> {
-    let mut dense_idx: Option<u32> = None;
-    for idx in positions {
-        dense_idx = dense_idx.or(codec.rank_if_exists(idx));
-    }
-    dense_idx
-}
-
-fn main() {
-    // Build separate inputs for each fill ratio.
-    let inputs: Vec<(String, OptionalIndex)> = vec![
-        ("fill=1%".to_string(), gen_optional_index(0.01)),
-        ("fill=5%".to_string(), gen_optional_index(0.05)),
-        ("fill=10%".to_string(), gen_optional_index(0.10)),
-        ("fill=50%".to_string(), gen_optional_index(0.50)),
-        ("fill=90%".to_string(), gen_optional_index(0.90)),
-    ];
-
-    let mut group: InputGroup<OptionalIndex> = InputGroup::new_with_inputs(inputs);
-
-    // Translate orig->codec (rank_if_exists) with sampling
-    group.register("orig_to_codec_10pct_hit", |codec: &OptionalIndex| {
-        black_box(walk_over_data(codec, 100));
-    });
-    group.register("orig_to_codec_1pct_hit", |codec: &OptionalIndex| {
-        black_box(walk_over_data(codec, 1000));
-    });
-    group.register("orig_to_codec_full_scan", |codec: &OptionalIndex| {
-        black_box(walk_over_data_from_positions(codec, 0..TOTAL_NUM_VALUES));
-    });
-
-    // Translate codec->orig (select/select_batch) on sampled ranks
-    fn bench_translate_codec_to_orig_util(codec: &OptionalIndex, percent_hit: f32) {
-        let num_non_nulls = codec.num_non_nulls();
-        let idxs: Vec<u32> = if percent_hit == 100.0f32 {
-            (0..num_non_nulls).collect()
-        } else {
-            n_percent_step_iterator(percent_hit, num_non_nulls).collect()
-        };
-        let mut output = vec![0u32; idxs.len()];
-        output.copy_from_slice(&idxs[..]);
-        codec.select_batch(&mut output);
-        black_box(output);
-    }
-
-    group.register("codec_to_orig_0.005pct_hit", |codec: &OptionalIndex| {
-        bench_translate_codec_to_orig_util(codec, 0.005);
-    });
-    group.register("codec_to_orig_10pct_hit", |codec: &OptionalIndex| {
-        bench_translate_codec_to_orig_util(codec, 10.0);
-    });
-    group.register("codec_to_orig_full_scan", |codec: &OptionalIndex| {
-        bench_translate_codec_to_orig_util(codec, 100.0);
-    });
-
-    group.run();
-}
--- a/columnar/benches/bench_values_u128.rs
+++ b/columnar/benches/bench_values_u128.rs
@@ -1,12 +1,15 @@
+#![feature(test)]
+
 use std::ops::RangeInclusive;
 use std::sync::Arc;

-use binggan::{InputGroup, black_box};
 use common::OwnedBytes;
 use rand::rngs::StdRng;
 use rand::seq::SliceRandom;
-use rand::{Rng, SeedableRng, random};
+use rand::{random, Rng, SeedableRng};
 use tantivy_columnar::ColumnValues;
+use test::Bencher;
+extern crate test;

 // TODO does this make sense for IPv6 ?
 fn generate_random() -> Vec<u64> {
@@ -44,77 +47,78 @@ fn get_data_50percent_item() -> Vec<u128> {
    }
    data.push(SINGLE_ITEM);
    data.shuffle(&mut rng);
-    data.iter().map(|el| *el as u128).collect::<Vec<_>>()
+    let data = data.iter().map(|el| *el as u128).collect::<Vec<_>>();
+    data
 }

-fn main() {
+#[bench]
+fn bench_intfastfield_getrange_u128_50percent_hit(b: &mut Bencher) {
    let data = get_data_50percent_item();
-    let column_range = get_u128_column_from_data(&data);
-    let column_random = get_u128_column_random();
+    let column = get_u128_column_from_data(&data);

-    struct Inputs {
-        data: Vec<u128>,
-        column_range: Arc<dyn ColumnValues<u128>>,
-        column_random: Arc<dyn ColumnValues<u128>>,
-    }
-
-    let inputs = Inputs {
-        data,
-        column_range,
-        column_random,
-    };
-    let mut group: InputGroup<Inputs> =
-        InputGroup::new_with_inputs(vec![("u128 benches".to_string(), inputs)]);
-
-    group.register(
-        "intfastfield_getrange_u128_50percent_hit",
-        |inp: &Inputs| {
-            let mut positions = Vec::new();
-            inp.column_range.get_row_ids_for_value_range(
-                *FIFTY_PERCENT_RANGE.start() as u128..=*FIFTY_PERCENT_RANGE.end() as u128,
-                0..inp.data.len() as u32,
-                &mut positions,
-            );
-            black_box(positions.len());
-        },
-    );
-
-    group.register("intfastfield_getrange_u128_single_hit", |inp: &Inputs| {
+    b.iter(|| {
        let mut positions = Vec::new();
-        inp.column_range.get_row_ids_for_value_range(
+        column.get_row_ids_for_value_range(
+            *FIFTY_PERCENT_RANGE.start() as u128..=*FIFTY_PERCENT_RANGE.end() as u128,
+            0..data.len() as u32,
+            &mut positions,
+        );
+        positions
+    });
+}
+
+#[bench]
+fn bench_intfastfield_getrange_u128_single_hit(b: &mut Bencher) {
+    let data = get_data_50percent_item();
+    let column = get_u128_column_from_data(&data);
+
+    b.iter(|| {
+        let mut positions = Vec::new();
+        column.get_row_ids_for_value_range(
            *SINGLE_ITEM_RANGE.start() as u128..=*SINGLE_ITEM_RANGE.end() as u128,
-            0..inp.data.len() as u32,
+            0..data.len() as u32,
            &mut positions,
        );
-        black_box(positions.len());
+        positions
    });
+}

-    group.register("intfastfield_getrange_u128_hit_all", |inp: &Inputs| {
+#[bench]
+fn bench_intfastfield_getrange_u128_hit_all(b: &mut Bencher) {
+    let data = get_data_50percent_item();
+    let column = get_u128_column_from_data(&data);
+
+    b.iter(|| {
        let mut positions = Vec::new();
-        inp.column_range.get_row_ids_for_value_range(
-            0..=u128::MAX,
-            0..inp.data.len() as u32,
-            &mut positions,
-        );
-        black_box(positions.len());
+        column.get_row_ids_for_value_range(0..=u128::MAX, 0..data.len() as u32, &mut positions);
+        positions
    });
+}
+// U128 RANGE END

-    group.register("intfastfield_scan_all_fflookup_u128", |inp: &Inputs| {
+#[bench]
+fn bench_intfastfield_scan_all_fflookup_u128(b: &mut Bencher) {
+    let column = get_u128_column_random();
+
+    b.iter(|| {
        let mut a = 0u128;
-        for i in 0u64..inp.column_random.num_vals() as u64 {
-            a += inp.column_random.get_val(i as u32);
+        for i in 0u64..column.num_vals() as u64 {
+            a += column.get_val(i as u32);
        }
-        black_box(a);
+        a
    });
+}

-    group.register("intfastfield_jumpy_stride5_u128", |inp: &Inputs| {
-        let n = inp.column_random.num_vals();
+#[bench]
+fn bench_intfastfield_jumpy_stride5_u128(b: &mut Bencher) {
+    let column = get_u128_column_random();
+
+    b.iter(|| {
+        let n = column.num_vals();
        let mut a = 0u128;
        for i in (0..n / 5).map(|val| val * 5) {
-            a += inp.column_random.get_val(i);
+            a += column.get_val(i);
        }
-        black_box(a);
+        a
    });
-
-    group.run();
 }
--- a/columnar/benches/bench_values_u64.rs
+++ b/columnar/benches/bench_values_u64.rs
@@ -1,10 +1,13 @@
+#![feature(test)]
+extern crate test;
+
 use std::ops::RangeInclusive;
 use std::sync::Arc;

-use binggan::{InputGroup, black_box};
 use rand::prelude::*;
-use tantivy_columnar::column_values::{CodecType, serialize_and_load_u64_based_column_values};
+use tantivy_columnar::column_values::{serialize_and_load_u64_based_column_values, CodecType};
 use tantivy_columnar::*;
+use test::Bencher;

 // Warning: this generates the same permutation at each call
 fn generate_permutation() -> Vec<u64> {
@@ -24,11 +27,37 @@ pub fn serialize_and_load(column: &[u64], codec_type: CodecType) -> Arc<dyn Colu
    serialize_and_load_u64_based_column_values(&column, &[codec_type])
 }

+#[bench]
+fn bench_intfastfield_jumpy_veclookup(b: &mut Bencher) {
+    let permutation = generate_permutation();
+    let n = permutation.len();
+    b.iter(|| {
+        let mut a = 0u64;
+        for _ in 0..n {
+            a = permutation[a as usize];
+        }
+        a
+    });
+}
+
+#[bench]
+fn bench_intfastfield_jumpy_fflookup_bitpacked(b: &mut Bencher) {
+    let permutation = generate_permutation();
+    let n = permutation.len();
+    let column: Arc<dyn ColumnValues<u64>> = serialize_and_load(&permutation, CodecType::Bitpacked);
+    b.iter(|| {
+        let mut a = 0u64;
+        for _ in 0..n {
+            a = column.get_val(a as u32);
+        }
+        a
+    });
+}
+
 const FIFTY_PERCENT_RANGE: RangeInclusive<u64> = 1..=50;
 const SINGLE_ITEM: u64 = 90;
 const SINGLE_ITEM_RANGE: RangeInclusive<u64> = 90..=90;
 const ONE_PERCENT_ITEM_RANGE: RangeInclusive<u64> = 49..=49;
-
 fn get_data_50percent_item() -> Vec<u128> {
    let mut rng = StdRng::from_seed([1u8; 32]);

@@ -40,122 +69,135 @@ fn get_data_50percent_item() -> Vec<u128> {
    data.push(SINGLE_ITEM);

    data.shuffle(&mut rng);
-    data.iter().map(|el| *el as u128).collect::<Vec<_>>()
+    let data = data.iter().map(|el| *el as u128).collect::<Vec<_>>();
+    data
 }

-type VecCol = (Vec<u64>, Arc<dyn ColumnValues<u64>>);
+// U64 RANGE START
+#[bench]
+fn bench_intfastfield_getrange_u64_50percent_hit(b: &mut Bencher) {
+    let data = get_data_50percent_item();
+    let data = data.iter().map(|el| *el as u64).collect::<Vec<_>>();
+    let column: Arc<dyn ColumnValues<u64>> = serialize_and_load(&data, CodecType::Bitpacked);
+    b.iter(|| {
+        let mut positions = Vec::new();
+        column.get_row_ids_for_value_range(
+            FIFTY_PERCENT_RANGE,
+            0..data.len() as u32,
+            &mut positions,
+        );
+        positions
+    });
+}

-fn bench_access() {
+#[bench]
+fn bench_intfastfield_getrange_u64_1percent_hit(b: &mut Bencher) {
+    let data = get_data_50percent_item();
+    let data = data.iter().map(|el| *el as u64).collect::<Vec<_>>();
+    let column: Arc<dyn ColumnValues<u64>> = serialize_and_load(&data, CodecType::Bitpacked);
+
+    b.iter(|| {
+        let mut positions = Vec::new();
+        column.get_row_ids_for_value_range(
+            ONE_PERCENT_ITEM_RANGE,
+            0..data.len() as u32,
+            &mut positions,
+        );
+        positions
+    });
+}
+
+#[bench]
+fn bench_intfastfield_getrange_u64_single_hit(b: &mut Bencher) {
+    let data = get_data_50percent_item();
+    let data = data.iter().map(|el| *el as u64).collect::<Vec<_>>();
+    let column: Arc<dyn ColumnValues<u64>> = serialize_and_load(&data, CodecType::Bitpacked);
+
+    b.iter(|| {
+        let mut positions = Vec::new();
+        column.get_row_ids_for_value_range(SINGLE_ITEM_RANGE, 0..data.len() as u32, &mut positions);
+        positions
+    });
+}
+
+#[bench]
+fn bench_intfastfield_getrange_u64_hit_all(b: &mut Bencher) {
+    let data = get_data_50percent_item();
+    let data = data.iter().map(|el| *el as u64).collect::<Vec<_>>();
+    let column: Arc<dyn ColumnValues<u64>> = serialize_and_load(&data, CodecType::Bitpacked);
+
+    b.iter(|| {
+        let mut positions = Vec::new();
+        column.get_row_ids_for_value_range(0..=u64::MAX, 0..data.len() as u32, &mut positions);
+        positions
+    });
+}
+// U64 RANGE END
+
+#[bench]
+fn bench_intfastfield_stride7_vec(b: &mut Bencher) {
    let permutation = generate_permutation();
-    let column_perm: Arc<dyn ColumnValues<u64>> =
-        serialize_and_load(&permutation, CodecType::Bitpacked);
-
-    let permutation_gcd = generate_permutation_gcd();
-    let column_perm_gcd: Arc<dyn ColumnValues<u64>> =
-        serialize_and_load(&permutation_gcd, CodecType::Bitpacked);
-
-    let mut group: InputGroup<VecCol> = InputGroup::new_with_inputs(vec![
-        (
-            "access".to_string(),
-            (permutation.clone(), column_perm.clone()),
-        ),
-        (
-            "access_gcd".to_string(),
-            (permutation_gcd.clone(), column_perm_gcd.clone()),
-        ),
-    ]);
-
-    group.register("stride7_vec", |inp: &VecCol| {
-        let n = inp.0.len();
+    let n = permutation.len();
+    b.iter(|| {
        let mut a = 0u64;
        for i in (0..n / 7).map(|val| val * 7) {
-            a += inp.0[i];
+            a += permutation[i as usize];
        }
-        black_box(a);
+        a
    });
+}

-    group.register("fullscan_vec", |inp: &VecCol| {
-        let mut a = 0u64;
-        for i in 0..inp.0.len() {
-            a += inp.0[i];
-        }
-        black_box(a);
-    });
-
-    group.register("stride7_column_values", |inp: &VecCol| {
-        let n = inp.1.num_vals() as usize;
-        let mut a = 0u64;
+#[bench]
+fn bench_intfastfield_stride7_fflookup(b: &mut Bencher) {
+    let permutation = generate_permutation();
+    let n = permutation.len();
+    let column: Arc<dyn ColumnValues<u64>> = serialize_and_load(&permutation, CodecType::Bitpacked);
+    b.iter(|| {
+        let mut a = 0;
        for i in (0..n / 7).map(|val| val * 7) {
-            a += inp.1.get_val(i as u32);
+            a += column.get_val(i as u32);
        }
-        black_box(a);
+        a
    });
+}

-    group.register("fullscan_column_values", |inp: &VecCol| {
+#[bench]
+fn bench_intfastfield_scan_all_fflookup(b: &mut Bencher) {
+    let permutation = generate_permutation();
+    let n = permutation.len();
+    let column: Arc<dyn ColumnValues<u64>> = serialize_and_load(&permutation, CodecType::Bitpacked);
+    let column_ref = column.as_ref();
+    b.iter(|| {
+        let mut a = 0u64;
+        for i in 0u32..n as u32 {
+            a += column_ref.get_val(i);
+        }
+        a
+    });
+}
+
+#[bench]
+fn bench_intfastfield_scan_all_fflookup_gcd(b: &mut Bencher) {
+    let permutation = generate_permutation_gcd();
+    let n = permutation.len();
+    let column: Arc<dyn ColumnValues<u64>> = serialize_and_load(&permutation, CodecType::Bitpacked);
+    b.iter(|| {
        let mut a = 0u64;
-        let n = inp.1.num_vals() as usize;
        for i in 0..n {
-            a += inp.1.get_val(i as u32);
+            a += column.get_val(i as u32);
        }
-        black_box(a);
+        a
    });
-
-    group.run();
 }

-fn bench_range() {
-    let data_50 = get_data_50percent_item();
-    let data_u64 = data_50.iter().map(|el| *el as u64).collect::<Vec<_>>();
-    let column_data: Arc<dyn ColumnValues<u64>> =
-        serialize_and_load(&data_u64, CodecType::Bitpacked);
-
-    let mut group: InputGroup<Arc<dyn ColumnValues<u64>>> =
-        InputGroup::new_with_inputs(vec![("dist_50pct_item".to_string(), column_data.clone())]);
-
-    group.register(
-        "fastfield_getrange_u64_50percent_hit",
-        |col: &Arc<dyn ColumnValues<u64>>| {
-            let mut positions = Vec::new();
-            col.get_row_ids_for_value_range(FIFTY_PERCENT_RANGE, 0..col.num_vals(), &mut positions);
-            black_box(positions.len());
-        },
-    );
-
-    group.register(
-        "fastfield_getrange_u64_1percent_hit",
-        |col: &Arc<dyn ColumnValues<u64>>| {
-            let mut positions = Vec::new();
-            col.get_row_ids_for_value_range(
-                ONE_PERCENT_ITEM_RANGE,
-                0..col.num_vals(),
-                &mut positions,
-            );
-            black_box(positions.len());
-        },
-    );
-
-    group.register(
-        "fastfield_getrange_u64_single_hit",
-        |col: &Arc<dyn ColumnValues<u64>>| {
-            let mut positions = Vec::new();
-            col.get_row_ids_for_value_range(SINGLE_ITEM_RANGE, 0..col.num_vals(), &mut positions);
-            black_box(positions.len());
-        },
-    );
-
-    group.register(
-        "fastfield_getrange_u64_hit_all",
-        |col: &Arc<dyn ColumnValues<u64>>| {
-            let mut positions = Vec::new();
-            col.get_row_ids_for_value_range(0..=u64::MAX, 0..col.num_vals(), &mut positions);
-            black_box(positions.len());
-        },
-    );
-
-    group.run();
-}
-
-fn main() {
-    bench_access();
-    bench_range();
+#[bench]
+fn bench_intfastfield_scan_all_vec(b: &mut Bencher) {
+    let permutation = generate_permutation();
+    b.iter(|| {
+        let mut a = 0u64;
+        for i in 0..permutation.len() {
+            a += permutation[i as usize] as u64;
+        }
+        a
+    });
 }
--- a/columnar/columnar-cli-inspect/Cargo.toml
+++ b/columnar/columnar-cli-inspect/Cargo.toml
@@ -1,18 +0,0 @@
-[package]
-name = "tantivy-columnar-inspect"
-version = "0.1.0"
-edition = "2021"
-license = "MIT"
-
-[dependencies]
-tantivy = {path="../..", package="tantivy"}
-columnar = {path="../", package="tantivy-columnar"}
-common = {path="../../common", package="tantivy-common"}
-
-[workspace]
-members = []
-
-[profile.release]
-debug = true
-#debug-assertions = true
-#overflow-checks = true
--- a/columnar/columnar-cli-inspect/src/main.rs
+++ b/columnar/columnar-cli-inspect/src/main.rs
@@ -1,54 +0,0 @@
-use columnar::ColumnarReader;
-use common::file_slice::{FileSlice, WrapFile};
-use std::io;
-use std::path::Path;
-use tantivy::directory::footer::Footer;
-
-fn main() -> io::Result<()> {
-    println!("Opens a columnar file written by tantivy and validates it.");
-    let path = std::env::args().nth(1).unwrap();
-
-    let path = Path::new(&path);
-    println!("Reading {:?}", path);
-    let _reader = open_and_validate_columnar(path.to_str().unwrap())?;
-
-    Ok(())
-}
-
-pub fn validate_columnar_reader(reader: &ColumnarReader) {
-    let num_rows = reader.num_rows();
-    println!("num_rows: {}", num_rows);
-    let columns = reader.list_columns().unwrap();
-    println!("num columns: {:?}", columns.len());
-    for (col_name, dynamic_column_handle) in columns {
-        let col = dynamic_column_handle.open().unwrap();
-        match col {
-            columnar::DynamicColumn::Bool(_)
-            | columnar::DynamicColumn::I64(_)
-            | columnar::DynamicColumn::U64(_)
-            | columnar::DynamicColumn::F64(_)
-            | columnar::DynamicColumn::IpAddr(_)
-            | columnar::DynamicColumn::DateTime(_)
-            | columnar::DynamicColumn::Bytes(_) => {}
-            columnar::DynamicColumn::Str(str_column) => {
-                let num_vals = str_column.ords().values.num_vals();
-                let num_terms_dict = str_column.num_terms() as u64;
-                let max_ord = str_column.ords().values.iter().max().unwrap_or_default();
-                println!("{col_name:35}  num_vals {num_vals:10} \t num_terms_dict {num_terms_dict:8} max_ord: {max_ord:8}",);
-                for ord in str_column.ords().values.iter() {
-                    assert!(ord < num_terms_dict);
-                }
-            }
-        }
-    }
-}
-
-/// Opens a columnar file that was written by tantivy and validates it.
-pub fn open_and_validate_columnar(path: &str) -> io::Result<ColumnarReader> {
-    let wrap_file = WrapFile::new(std::fs::File::open(path)?)?;
-    let slice = FileSlice::new(std::sync::Arc::new(wrap_file));
-    let (_footer, slice) = Footer::extract_footer(slice.clone()).unwrap();
-    let reader = ColumnarReader::open(slice).unwrap();
-    validate_columnar_reader(&reader);
-    Ok(reader)
-}
--- a/columnar/src/block_accessor.rs
+++ b/columnar/src/block_accessor.rs
@@ -66,7 +66,7 @@ impl<T: PartialOrd + Copy + std::fmt::Debug + Send + Sync + 'static + Default>
        &'a self,
        docs: &'a [u32],
        accessor: &Column<T>,
-    ) -> impl Iterator<Item = (DocId, T)> + 'a + use<'a, T> {
+    ) -> impl Iterator<Item = (DocId, T)> + 'a {
        if accessor.index.get_cardinality().is_full() {
            docs.iter().cloned().zip(self.val_cache.iter().cloned())
        } else {
@@ -139,7 +139,7 @@ mod tests {
            missing_docs.push(missing_doc);
        });

-        assert_eq!(missing_docs, Vec::<u32>::new());
+        assert_eq!(missing_docs, vec![]);
    }

    #[test]
--- a/columnar/src/column/dictionary_encoded.rs
+++ b/columnar/src/column/dictionary_encoded.rs
@@ -4,8 +4,8 @@ use std::{fmt, io};

 use sstable::{Dictionary, VoidSSTable};

-use crate::RowId;
 use crate::column::Column;
+use crate::RowId;

 /// Dictionary encoded column.
 ///
--- a/columnar/src/column/mod.rs
+++ b/columnar/src/column/mod.rs
@@ -9,14 +9,13 @@ use std::sync::Arc;
 use common::BinarySerializable;
 pub use dictionary_encoded::{BytesColumn, StrColumn};
 pub use serialize::{
-    open_column_bytes, open_column_str, open_column_u64, open_column_u128,
-    open_column_u128_as_compact_u64, serialize_column_mappable_to_u64,
-    serialize_column_mappable_to_u128,
+    open_column_bytes, open_column_str, open_column_u128, open_column_u128_as_compact_u64,
+    open_column_u64, serialize_column_mappable_to_u128, serialize_column_mappable_to_u64,
 };

 use crate::column_index::{ColumnIndex, Set};
 use crate::column_values::monotonic_mapping::StrictlyMonotonicMappingToInternal;
-use crate::column_values::{ColumnValues, monotonic_map_column};
+use crate::column_values::{monotonic_map_column, ColumnValues};
 use crate::{Cardinality, DocId, EmptyColumnValues, MonotonicallyMappableToU64, RowId};

 #[derive(Clone)]
@@ -114,7 +113,7 @@ impl<T: PartialOrd + Copy + Debug + Send + Sync + 'static> Column<T> {
        }
    }

-    /// Translates a block of docids to row_ids.
+    /// Translates a block of docis to row_ids.
    ///
    /// returns the row_ids and the matching docids on the same index
    /// e.g.
--- a/columnar/src/column/serialize.rs
+++ b/columnar/src/column/serialize.rs
@@ -6,10 +6,10 @@ use common::OwnedBytes;
 use sstable::Dictionary;

 use crate::column::{BytesColumn, Column};
-use crate::column_index::{SerializableColumnIndex, serialize_column_index};
+use crate::column_index::{serialize_column_index, SerializableColumnIndex};
 use crate::column_values::{
-    CodecType, MonotonicallyMappableToU64, MonotonicallyMappableToU128,
    load_u64_based_column_values, serialize_column_values_u128, serialize_u64_based_column_values,
+    CodecType, MonotonicallyMappableToU128, MonotonicallyMappableToU64,
 };
 use crate::iterable::Iterable;
 use crate::{StrColumn, Version};
--- a/columnar/src/column_index/merge/mod.rs
+++ b/columnar/src/column_index/merge/mod.rs
@@ -99,9 +99,9 @@ mod tests {

    use crate::column_index::merge::detect_cardinality;
    use crate::column_index::multivalued_index::{
-        MultiValueIndex, open_multivalued_index, serialize_multivalued_index,
+        open_multivalued_index, serialize_multivalued_index, MultiValueIndex,
    };
-    use crate::column_index::{OptionalIndex, SerializableColumnIndex, merge_column_index};
+    use crate::column_index::{merge_column_index, OptionalIndex, SerializableColumnIndex};
    use crate::{
        Cardinality, ColumnIndex, MergeRowOrder, RowAddr, RowId, ShuffleMergeOrder, StackMergeOrder,
    };
--- a/columnar/src/column_index/merge/shuffled.rs
+++ b/columnar/src/column_index/merge/shuffled.rs
@@ -58,7 +58,7 @@ struct ShuffledIndex<'a> {
    merge_order: &'a ShuffleMergeOrder,
 }

-impl Iterable<u32> for ShuffledIndex<'_> {
+impl<'a> Iterable<u32> for ShuffledIndex<'a> {
    fn boxed_iter(&self) -> Box<dyn Iterator<Item = u32> + '_> {
        Box::new(
            self.merge_order
@@ -127,7 +127,7 @@ fn integrate_num_vals(num_vals: impl Iterator<Item = u32>) -> impl Iterator<Item
    )
 }

-impl Iterable<u32> for ShuffledMultivaluedIndex<'_> {
+impl<'a> Iterable<u32> for ShuffledMultivaluedIndex<'a> {
    fn boxed_iter(&self) -> Box<dyn Iterator<Item = u32> + '_> {
        let num_vals_per_row = iter_num_values(self.column_indexes, self.merge_order);
        Box::new(integrate_num_vals(num_vals_per_row))
@@ -137,8 +137,8 @@ impl Iterable<u32> for ShuffledMultivaluedIndex<'_> {
 #[cfg(test)]
 mod tests {
    use super::*;
-    use crate::RowAddr;
    use crate::column_index::OptionalIndex;
+    use crate::RowAddr;

    #[test]
    fn test_integrate_num_vals_empty() {
--- a/columnar/src/column_index/merge/stacked.rs
+++ b/columnar/src/column_index/merge/stacked.rs
@@ -1,8 +1,8 @@
 use std::ops::Range;

-use crate::column_index::SerializableColumnIndex;
 use crate::column_index::multivalued_index::{MultiValueIndex, SerializableMultivalueIndex};
 use crate::column_index::serialize::SerializableOptionalIndex;
+use crate::column_index::SerializableColumnIndex;
 use crate::iterable::Iterable;
 use crate::{Cardinality, ColumnIndex, RowId, StackMergeOrder};

@@ -56,7 +56,7 @@ fn get_doc_ids_with_values<'a>(
        ColumnIndex::Full => Box::new(doc_range),
        ColumnIndex::Optional(optional_index) => Box::new(
            optional_index
-                .iter_non_null_docs()
+                .iter_rows()
                .map(move |row| row + doc_range.start),
        ),
        ColumnIndex::Multivalued(multivalued_index) => match multivalued_index {
@@ -73,7 +73,7 @@ fn get_doc_ids_with_values<'a>(
            MultiValueIndex::MultiValueIndexV2(multivalued_index) => Box::new(
                multivalued_index
                    .optional_index
-                    .iter_non_null_docs()
+                    .iter_rows()
                    .map(move |row| row + doc_range.start),
            ),
        },
@@ -105,11 +105,10 @@ fn get_num_values_iterator<'a>(
 ) -> Box<dyn Iterator<Item = u32> + 'a> {
    match column_index {
        ColumnIndex::Empty { .. } => Box::new(std::iter::empty()),
-        ColumnIndex::Full => Box::new(std::iter::repeat_n(1u32, num_docs as usize)),
-        ColumnIndex::Optional(optional_index) => Box::new(std::iter::repeat_n(
-            1u32,
-            optional_index.num_non_nulls() as usize,
-        )),
+        ColumnIndex::Full => Box::new(std::iter::repeat(1u32).take(num_docs as usize)),
+        ColumnIndex::Optional(optional_index) => {
+            Box::new(std::iter::repeat(1u32).take(optional_index.num_non_nulls() as usize))
+        }
        ColumnIndex::Multivalued(multivalued_index) => Box::new(
            multivalued_index
                .get_start_index_column()
@@ -124,7 +123,7 @@ fn get_num_values_iterator<'a>(
    }
 }

-impl Iterable<u32> for StackedStartOffsets<'_> {
+impl<'a> Iterable<u32> for StackedStartOffsets<'a> {
    fn boxed_iter(&self) -> Box<dyn Iterator<Item = u32> + '_> {
        let num_values_it = (0..self.column_indexes.len()).flat_map(|columnar_id| {
            let num_docs = self.stack_merge_order.columnar_range(columnar_id).len() as u32;
@@ -178,7 +177,7 @@ impl<'a> Iterable<RowId> for StackedOptionalIndex<'a> {
                        ColumnIndex::Full => Box::new(columnar_row_range),
                        ColumnIndex::Optional(optional_index) => Box::new(
                            optional_index
-                                .iter_non_null_docs()
+                                .iter_rows()
                                .map(move |row_id: RowId| columnar_row_range.start + row_id),
                        ),
                        ColumnIndex::Multivalued(_) => {
--- a/columnar/src/column_index/mod.rs
+++ b/columnar/src/column_index/mod.rs
@@ -14,7 +14,7 @@ pub use merge::merge_column_index;
 pub(crate) use multivalued_index::SerializableMultivalueIndex;
 pub use optional_index::{OptionalIndex, Set};
 pub use serialize::{
-    SerializableColumnIndex, SerializableOptionalIndex, open_column_index, serialize_column_index,
+    open_column_index, serialize_column_index, SerializableColumnIndex, SerializableOptionalIndex,
 };

 use crate::column_index::multivalued_index::MultiValueIndex;
--- a/columnar/src/column_index/multivalued_index.rs
+++ b/columnar/src/column_index/multivalued_index.rs
@@ -8,7 +8,7 @@ use common::{CountingWriter, OwnedBytes};
 use super::optional_index::{open_optional_index, serialize_optional_index};
 use super::{OptionalIndex, SerializableOptionalIndex, Set};
 use crate::column_values::{
-    CodecType, ColumnValues, load_u64_based_column_values, serialize_u64_based_column_values,
+    load_u64_based_column_values, serialize_u64_based_column_values, CodecType, ColumnValues,
 };
 use crate::iterable::Iterable;
 use crate::{DocId, RowId, Version};
@@ -215,32 +215,6 @@ impl MultiValueIndex {
        }
    }

-    /// Returns an iterator over document ids that have at least one value.
-    pub fn iter_non_null_docs(&self) -> Box<dyn Iterator<Item = DocId> + '_> {
-        match self {
-            MultiValueIndex::MultiValueIndexV1(idx) => {
-                let mut doc: DocId = 0u32;
-                let num_docs = idx.num_docs();
-                Box::new(std::iter::from_fn(move || {
-                    // This is not the most efficient way to do this, but it's legacy code.
-                    while doc < num_docs {
-                        let cur = doc;
-                        doc += 1;
-                        let start = idx.start_index_column.get_val(cur);
-                        let end = idx.start_index_column.get_val(cur + 1);
-                        if end > start {
-                            return Some(cur);
-                        }
-                    }
-                    None
-                }))
-            }
-            MultiValueIndex::MultiValueIndexV2(idx) => {
-                Box::new(idx.optional_index.iter_non_null_docs())
-            }
-        }
-    }
-
    /// Converts a list of ranks (row ids of values) in a 1:n index to the corresponding list of
    /// docids. Positions are converted inplace to docids.
    ///
--- a/columnar/src/column_index/optional_index/mod.rs
+++ b/columnar/src/column_index/optional_index/mod.rs
@@ -1,4 +1,4 @@
-use std::io;
+use std::io::{self, Write};
 use std::sync::Arc;

 mod set;
@@ -7,11 +7,11 @@ mod set_block;
 use common::{BinarySerializable, OwnedBytes, VInt};
 pub use set::{SelectCursor, Set, SetCodec};
 use set_block::{
-    DENSE_BLOCK_NUM_BYTES, DenseBlock, DenseBlockCodec, SparseBlock, SparseBlockCodec,
+    DenseBlock, DenseBlockCodec, SparseBlock, SparseBlockCodec, DENSE_BLOCK_NUM_BYTES,
 };

 use crate::iterable::Iterable;
-use crate::{DocId, RowId};
+use crate::{DocId, InvalidData, RowId};

 /// The threshold for for number of elements after which we switch to dense block encoding.
 ///
@@ -80,23 +80,23 @@ impl BlockVariant {
 /// index is the block index. For each block `byte_start` and `offset` is computed.
 #[derive(Clone)]
 pub struct OptionalIndex {
-    num_docs: RowId,
-    num_non_null_docs: RowId,
+    num_rows: RowId,
+    num_non_null_rows: RowId,
    block_data: OwnedBytes,
    block_metas: Arc<[BlockMeta]>,
 }

-impl Iterable<u32> for &OptionalIndex {
+impl<'a> Iterable<u32> for &'a OptionalIndex {
    fn boxed_iter(&self) -> Box<dyn Iterator<Item = u32> + '_> {
-        Box::new(self.iter_non_null_docs())
+        Box::new(self.iter_rows())
    }
 }

 impl std::fmt::Debug for OptionalIndex {
    fn fmt(&self, f: &mut std::fmt::Formatter) -> std::fmt::Result {
        f.debug_struct("OptionalIndex")
-            .field("num_docs", &self.num_docs)
-            .field("num_non_null_docs", &self.num_non_null_docs)
+            .field("num_rows", &self.num_rows)
+            .field("num_non_null_rows", &self.num_non_null_rows)
            .finish_non_exhaustive()
    }
 }
@@ -123,7 +123,7 @@ enum BlockSelectCursor<'a> {
    Sparse(<SparseBlock<'a> as Set<u16>>::SelectCursor<'a>),
 }

-impl BlockSelectCursor<'_> {
+impl<'a> BlockSelectCursor<'a> {
    fn select(&mut self, rank: u16) -> u16 {
        match self {
            BlockSelectCursor::Dense(dense_select_cursor) => dense_select_cursor.select(rank),
@@ -141,7 +141,7 @@ pub struct OptionalIndexSelectCursor<'a> {
    num_null_rows_before_block: RowId,
 }

-impl OptionalIndexSelectCursor<'_> {
+impl<'a> OptionalIndexSelectCursor<'a> {
    fn search_and_load_block(&mut self, rank: RowId) {
        if rank < self.current_block_end_rank {
            // we are already in the right block
@@ -165,7 +165,7 @@ impl OptionalIndexSelectCursor<'_> {
    }
 }

-impl SelectCursor<RowId> for OptionalIndexSelectCursor<'_> {
+impl<'a> SelectCursor<RowId> for OptionalIndexSelectCursor<'a> {
    fn select(&mut self, rank: RowId) -> RowId {
        self.search_and_load_block(rank);
        let index_in_block = (rank - self.num_null_rows_before_block) as u16;
@@ -259,13 +259,11 @@ impl Set<RowId> for OptionalIndex {

 impl OptionalIndex {
    pub fn for_test(num_rows: RowId, row_ids: &[RowId]) -> OptionalIndex {
-        assert!(
-            row_ids
-                .last()
-                .copied()
-                .map(|last_row_id| last_row_id < num_rows)
-                .unwrap_or(true)
-        );
+        assert!(row_ids
+            .last()
+            .copied()
+            .map(|last_row_id| last_row_id < num_rows)
+            .unwrap_or(true));
        let mut buffer = Vec::new();
        serialize_optional_index(&row_ids, num_rows, &mut buffer).unwrap();
        let bytes = OwnedBytes::new(buffer);
@@ -273,18 +271,17 @@ impl OptionalIndex {
    }

    pub fn num_docs(&self) -> RowId {
-        self.num_docs
+        self.num_rows
    }

    pub fn num_non_nulls(&self) -> RowId {
-        self.num_non_null_docs
+        self.num_non_null_rows
    }

-    pub fn iter_non_null_docs(&self) -> impl Iterator<Item = RowId> + '_ {
-        // TODO optimize. We could iterate over the blocks directly.
-        // We use the dense value ids and retrieve the doc ids via select.
+    pub fn iter_rows(&self) -> impl Iterator<Item = RowId> + '_ {
+        // TODO optimize
        let mut select_batch = self.select_cursor();
-        (0..self.num_non_null_docs).map(move |rank| select_batch.select(rank))
+        (0..self.num_non_null_rows).map(move |rank| select_batch.select(rank))
    }
    pub fn select_batch(&self, ranks: &mut [RowId]) {
        let mut select_cursor = self.select_cursor();
@@ -335,6 +332,38 @@ enum Block<'a> {
    Sparse(SparseBlock<'a>),
 }

+#[derive(Debug, Copy, Clone)]
+enum OptionalIndexCodec {
+    Dense = 0,
+    Sparse = 1,
+}
+
+impl OptionalIndexCodec {
+    fn to_code(self) -> u8 {
+        self as u8
+    }
+
+    fn try_from_code(code: u8) -> Result<Self, InvalidData> {
+        match code {
+            0 => Ok(Self::Dense),
+            1 => Ok(Self::Sparse),
+            _ => Err(InvalidData),
+        }
+    }
+}
+
+impl BinarySerializable for OptionalIndexCodec {
+    fn serialize<W: Write + ?Sized>(&self, writer: &mut W) -> io::Result<()> {
+        writer.write_all(&[self.to_code()])
+    }
+
+    fn deserialize<R: io::Read>(reader: &mut R) -> io::Result<Self> {
+        let optional_codec_code = u8::deserialize(reader)?;
+        let optional_codec = Self::try_from_code(optional_codec_code)?;
+        Ok(optional_codec)
+    }
+}
+
 fn serialize_optional_index_block(block_els: &[u16], out: &mut impl io::Write) -> io::Result<()> {
    let is_sparse = is_sparse(block_els.len() as u32);
    if is_sparse {
@@ -476,7 +505,7 @@ fn deserialize_optional_index_block_metadatas(
        non_null_rows_before_block += num_non_null_rows;
    }
    block_metas.resize(
-        num_rows.div_ceil(ELEMENTS_PER_BLOCK) as usize,
+        ((num_rows + ELEMENTS_PER_BLOCK - 1) / ELEMENTS_PER_BLOCK) as usize,
        BlockMeta {
            non_null_rows_before_block,
            start_byte_offset,
@@ -490,15 +519,15 @@ pub fn open_optional_index(bytes: OwnedBytes) -> io::Result<OptionalIndex> {
    let (mut bytes, num_non_empty_blocks_bytes) = bytes.rsplit(2);
    let num_non_empty_block_bytes =
        u16::from_le_bytes(num_non_empty_blocks_bytes.as_slice().try_into().unwrap());
-    let num_docs = VInt::deserialize_u64(&mut bytes)? as u32;
+    let num_rows = VInt::deserialize_u64(&mut bytes)? as u32;
    let block_metas_num_bytes =
        num_non_empty_block_bytes as usize * SERIALIZED_BLOCK_META_NUM_BYTES;
    let (block_data, block_metas) = bytes.rsplit(block_metas_num_bytes);
-    let (block_metas, num_non_null_docs) =
-        deserialize_optional_index_block_metadatas(block_metas.as_slice(), num_docs);
+    let (block_metas, num_non_null_rows) =
+        deserialize_optional_index_block_metadatas(block_metas.as_slice(), num_rows);
    let optional_index = OptionalIndex {
-        num_docs,
-        num_non_null_docs,
+        num_rows,
+        num_non_null_rows,
        block_data,
        block_metas: block_metas.into(),
    };
--- a/columnar/src/column_index/optional_index/set_block/dense.rs
+++ b/columnar/src/column_index/optional_index/set_block/dense.rs
@@ -2,7 +2,7 @@ use std::io::{self, Write};

 use common::BinarySerializable;

-use crate::column_index::optional_index::{ELEMENTS_PER_BLOCK, SelectCursor, Set, SetCodec};
+use crate::column_index::optional_index::{SelectCursor, Set, SetCodec, ELEMENTS_PER_BLOCK};

 #[inline(always)]
 fn get_bit_at(input: u64, n: u16) -> bool {
@@ -23,6 +23,7 @@ fn set_bit_at(input: &mut u64, n: u16) {
 ///
 /// When translating a dense index to the original index, we can use the offset to find the correct
 /// block. Direct computation is not possible, but we can employ a linear or binary search.
+
 const ELEMENTS_PER_MINI_BLOCK: u16 = 64;
 const MINI_BLOCK_BITVEC_NUM_BYTES: usize = 8;
 const MINI_BLOCK_OFFSET_NUM_BYTES: usize = 2;
@@ -108,7 +109,7 @@ pub struct DenseBlockSelectCursor<'a> {
    dense_block: DenseBlock<'a>,
 }

-impl SelectCursor<u16> for DenseBlockSelectCursor<'_> {
+impl<'a> SelectCursor<u16> for DenseBlockSelectCursor<'a> {
    #[inline]
    fn select(&mut self, rank: u16) -> u16 {
        self.block_id = self
@@ -174,7 +175,7 @@ impl<'a> Set<u16> for DenseBlock<'a> {
    }
 }

-impl DenseBlock<'_> {
+impl<'a> DenseBlock<'a> {
    #[inline]
    fn mini_block(&self, mini_block_id: u16) -> DenseMiniBlock {
        let data_start_pos = mini_block_id as usize * MINI_BLOCK_NUM_BYTES;
--- a/columnar/src/column_index/optional_index/set_block/mod.rs
+++ b/columnar/src/column_index/optional_index/set_block/mod.rs
@@ -1,7 +1,7 @@
 mod dense;
 mod sparse;

-pub use dense::{DENSE_BLOCK_NUM_BYTES, DenseBlock, DenseBlockCodec};
+pub use dense::{DenseBlock, DenseBlockCodec, DENSE_BLOCK_NUM_BYTES};
 pub use sparse::{SparseBlock, SparseBlockCodec};

 #[cfg(test)]
--- a/columnar/src/column_index/optional_index/set_block/sparse.rs
+++ b/columnar/src/column_index/optional_index/set_block/sparse.rs
@@ -31,7 +31,7 @@ impl<'a> SelectCursor<u16> for SparseBlock<'a> {
    }
 }

-impl Set<u16> for SparseBlock<'_> {
+impl<'a> Set<u16> for SparseBlock<'a> {
    type SelectCursor<'b>
        = Self
    where Self: 'b;
@@ -69,7 +69,7 @@ fn get_u16(data: &[u8], byte_position: usize) -> u16 {
    u16::from_le_bytes(bytes)
 }

-impl SparseBlock<'_> {
+impl<'a> SparseBlock<'a> {
    #[inline(always)]
    fn value_at_idx(&self, data: &[u8], idx: u16) -> u16 {
        let start_offset: usize = idx as usize * 2;
--- a/columnar/src/column_index/optional_index/tests.rs
+++ b/columnar/src/column_index/optional_index/tests.rs
@@ -164,11 +164,7 @@ fn test_optional_index_large() {
 fn test_optional_index_iter_aux(row_ids: &[RowId], num_rows: RowId) {
    let optional_index = OptionalIndex::for_test(num_rows, row_ids);
    assert_eq!(optional_index.num_docs(), num_rows);
-    assert!(
-        optional_index
-            .iter_non_null_docs()
-            .eq(row_ids.iter().copied())
-    );
+    assert!(optional_index.iter_rows().eq(row_ids.iter().copied()));
 }

 #[test]
@@ -223,3 +219,174 @@ fn test_optional_index_for_tests() {
    assert!(!optional_index.contains(3));
    assert_eq!(optional_index.num_docs(), 4);
 }
+
+#[cfg(all(test, feature = "unstable"))]
+mod bench {
+
+    use rand::rngs::StdRng;
+    use rand::{Rng, SeedableRng};
+    use test::Bencher;
+
+    use super::*;
+
+    const TOTAL_NUM_VALUES: u32 = 1_000_000;
+    fn gen_bools(fill_ratio: f64) -> OptionalIndex {
+        let mut out = Vec::new();
+        let mut rng: StdRng = StdRng::from_seed([1u8; 32]);
+        let vals: Vec<RowId> = (0..TOTAL_NUM_VALUES)
+            .map(|_| rng.gen_bool(fill_ratio))
+            .enumerate()
+            .filter(|(_pos, val)| *val)
+            .map(|(pos, _)| pos as RowId)
+            .collect();
+        serialize_optional_index(&&vals[..], TOTAL_NUM_VALUES, &mut out).unwrap();
+
+        open_optional_index(OwnedBytes::new(out)).unwrap()
+    }
+
+    fn random_range_iterator(
+        start: u32,
+        end: u32,
+        avg_step_size: u32,
+        avg_deviation: u32,
+    ) -> impl Iterator<Item = u32> {
+        let mut rng: StdRng = StdRng::from_seed([1u8; 32]);
+        let mut current = start;
+        std::iter::from_fn(move || {
+            current += rng.gen_range(avg_step_size - avg_deviation..=avg_step_size + avg_deviation);
+            if current >= end {
+                None
+            } else {
+                Some(current)
+            }
+        })
+    }
+
+    fn n_percent_step_iterator(percent: f32, num_values: u32) -> impl Iterator<Item = u32> {
+        let ratio = percent / 100.0;
+        let step_size = (1f32 / ratio) as u32;
+        let deviation = step_size - 1;
+        random_range_iterator(0, num_values, step_size, deviation)
+    }
+
+    fn walk_over_data(codec: &OptionalIndex, avg_step_size: u32) -> Option<u32> {
+        walk_over_data_from_positions(
+            codec,
+            random_range_iterator(0, TOTAL_NUM_VALUES, avg_step_size, 0),
+        )
+    }
+
+    fn walk_over_data_from_positions(
+        codec: &OptionalIndex,
+        positions: impl Iterator<Item = u32>,
+    ) -> Option<u32> {
+        let mut dense_idx: Option<u32> = None;
+        for idx in positions {
+            dense_idx = dense_idx.or(codec.rank_if_exists(idx));
+        }
+        dense_idx
+    }
+
+    #[bench]
+    fn bench_translate_orig_to_codec_1percent_filled_10percent_hit(bench: &mut Bencher) {
+        let codec = gen_bools(0.01f64);
+        bench.iter(|| walk_over_data(&codec, 100));
+    }
+
+    #[bench]
+    fn bench_translate_orig_to_codec_5percent_filled_10percent_hit(bench: &mut Bencher) {
+        let codec = gen_bools(0.05f64);
+        bench.iter(|| walk_over_data(&codec, 100));
+    }
+
+    #[bench]
+    fn bench_translate_orig_to_codec_5percent_filled_1percent_hit(bench: &mut Bencher) {
+        let codec = gen_bools(0.05f64);
+        bench.iter(|| walk_over_data(&codec, 1000));
+    }
+
+    #[bench]
+    fn bench_translate_orig_to_codec_full_scan_1percent_filled(bench: &mut Bencher) {
+        let codec = gen_bools(0.01f64);
+        bench.iter(|| walk_over_data_from_positions(&codec, 0..TOTAL_NUM_VALUES));
+    }
+
+    #[bench]
+    fn bench_translate_orig_to_codec_full_scan_10percent_filled(bench: &mut Bencher) {
+        let codec = gen_bools(0.1f64);
+        bench.iter(|| walk_over_data_from_positions(&codec, 0..TOTAL_NUM_VALUES));
+    }
+
+    #[bench]
+    fn bench_translate_orig_to_codec_full_scan_90percent_filled(bench: &mut Bencher) {
+        let codec = gen_bools(0.9f64);
+        bench.iter(|| walk_over_data_from_positions(&codec, 0..TOTAL_NUM_VALUES));
+    }
+
+    #[bench]
+    fn bench_translate_orig_to_codec_10percent_filled_1percent_hit(bench: &mut Bencher) {
+        let codec = gen_bools(0.1f64);
+        bench.iter(|| walk_over_data(&codec, 100));
+    }
+
+    #[bench]
+    fn bench_translate_orig_to_codec_50percent_filled_1percent_hit(bench: &mut Bencher) {
+        let codec = gen_bools(0.5f64);
+        bench.iter(|| walk_over_data(&codec, 100));
+    }
+
+    #[bench]
+    fn bench_translate_orig_to_codec_90percent_filled_1percent_hit(bench: &mut Bencher) {
+        let codec = gen_bools(0.9f64);
+        bench.iter(|| walk_over_data(&codec, 100));
+    }
+
+    #[bench]
+    fn bench_translate_codec_to_orig_1percent_filled_0comma005percent_hit(bench: &mut Bencher) {
+        bench_translate_codec_to_orig_util(0.01f64, 0.005f32, bench);
+    }
+
+    #[bench]
+    fn bench_translate_codec_to_orig_10percent_filled_0comma005percent_hit(bench: &mut Bencher) {
+        bench_translate_codec_to_orig_util(0.1f64, 0.005f32, bench);
+    }
+
+    #[bench]
+    fn bench_translate_codec_to_orig_1percent_filled_10percent_hit(bench: &mut Bencher) {
+        bench_translate_codec_to_orig_util(0.01f64, 10f32, bench);
+    }
+
+    #[bench]
+    fn bench_translate_codec_to_orig_1percent_filled_full_scan(bench: &mut Bencher) {
+        bench_translate_codec_to_orig_util(0.01f64, 100f32, bench);
+    }
+
+    fn bench_translate_codec_to_orig_util(
+        percent_filled: f64,
+        percent_hit: f32,
+        bench: &mut Bencher,
+    ) {
+        let codec = gen_bools(percent_filled);
+        let num_non_nulls = codec.num_non_nulls();
+        let idxs: Vec<u32> = if percent_hit == 100.0f32 {
+            (0..num_non_nulls).collect()
+        } else {
+            n_percent_step_iterator(percent_hit, num_non_nulls).collect()
+        };
+        let mut output = vec![0u32; idxs.len()];
+        bench.iter(|| {
+            output.copy_from_slice(&idxs[..]);
+            codec.select_batch(&mut output);
+        });
+    }
+
+    #[bench]
+    fn bench_translate_codec_to_orig_90percent_filled_0comma005percent_hit(bench: &mut Bencher) {
+        bench_translate_codec_to_orig_util(0.9f64, 0.005, bench);
+    }
+
+    #[bench]
+    fn bench_translate_codec_to_orig_90percent_filled_full_scan(bench: &mut Bencher) {
+        bench_translate_codec_to_orig_util(0.9f64, 100.0f32, bench);
+    }
+}
--- a/columnar/src/column_index/serialize.rs
+++ b/columnar/src/column_index/serialize.rs
@@ -3,11 +3,11 @@ use std::io::Write;

 use common::{CountingWriter, OwnedBytes};

-use super::OptionalIndex;
 use super::multivalued_index::SerializableMultivalueIndex;
-use crate::column_index::ColumnIndex;
+use super::OptionalIndex;
 use crate::column_index::multivalued_index::serialize_multivalued_index;
 use crate::column_index::optional_index::serialize_optional_index;
+use crate::column_index::ColumnIndex;
 use crate::iterable::Iterable;
 use crate::{Cardinality, RowId, Version};

@@ -31,7 +31,7 @@ pub enum SerializableColumnIndex<'a> {
    Multivalued(SerializableMultivalueIndex<'a>),
 }

-impl SerializableColumnIndex<'_> {
+impl<'a> SerializableColumnIndex<'a> {
    pub fn get_cardinality(&self) -> Cardinality {
        match self {
            SerializableColumnIndex::Full => Cardinality::Full,
--- a/columnar/src/column_values/bench.rs
+++ b/columnar/src/column_values/bench.rs
@@ -0,0 +1,139 @@
+use std::sync::Arc;
+
+use common::OwnedBytes;
+use rand::rngs::StdRng;
+use rand::{Rng, SeedableRng};
+use test::{self, Bencher};
+
+use super::*;
+use crate::column_values::u64_based::*;
+
+fn get_data() -> Vec<u64> {
+    let mut rng = StdRng::seed_from_u64(2u64);
+    let mut data: Vec<_> = (100..55000_u64)
+        .map(|num| num + rng.gen::<u8>() as u64)
+        .collect();
+    data.push(99_000);
+    data.insert(1000, 2000);
+    data.insert(2000, 100);
+    data.insert(3000, 4100);
+    data.insert(4000, 100);
+    data.insert(5000, 800);
+    data
+}
+
+fn compute_stats(vals: impl Iterator<Item = u64>) -> ColumnStats {
+    let mut stats_collector = StatsCollector::default();
+    for val in vals {
+        stats_collector.collect(val);
+    }
+    stats_collector.stats()
+}
+
+#[inline(never)]
+fn value_iter() -> impl Iterator<Item = u64> {
+    0..20_000
+}
+
+fn get_reader_for_bench<Codec: ColumnCodec>(data: &[u64]) -> Codec::ColumnValues {
+    let mut bytes = Vec::new();
+    let stats = compute_stats(data.iter().cloned());
+    let mut codec_serializer = Codec::estimator();
+    for val in data {
+        codec_serializer.collect(*val);
+    }
+    codec_serializer
+        .serialize(&stats, Box::new(data.iter().copied()).as_mut(), &mut bytes)
+        .unwrap();
+
+    Codec::load(OwnedBytes::new(bytes)).unwrap()
+}
+
+fn bench_get<Codec: ColumnCodec>(b: &mut Bencher, data: &[u64]) {
+    let col = get_reader_for_bench::<Codec>(data);
+    b.iter(|| {
+        let mut sum = 0u64;
+        for pos in value_iter() {
+            let val = col.get_val(pos as u32);
+            sum = sum.wrapping_add(val);
+        }
+        sum
+    });
+}
+
+#[inline(never)]
+fn bench_get_dynamic_helper(b: &mut Bencher, col: Arc<dyn ColumnValues>) {
+    b.iter(|| {
+        let mut sum = 0u64;
+        for pos in value_iter() {
+            let val = col.get_val(pos as u32);
+            sum = sum.wrapping_add(val);
+        }
+        sum
+    });
+}
+
+fn bench_get_dynamic<Codec: ColumnCodec>(b: &mut Bencher, data: &[u64]) {
+    let col = Arc::new(get_reader_for_bench::<Codec>(data));
+    bench_get_dynamic_helper(b, col);
+}
+fn bench_create<Codec: ColumnCodec>(b: &mut Bencher, data: &[u64]) {
+    let stats = compute_stats(data.iter().cloned());
+
+    let mut bytes = Vec::new();
+    b.iter(|| {
+        bytes.clear();
+        let mut codec_serializer = Codec::estimator();
+        for val in data.iter().take(1024) {
+            codec_serializer.collect(*val);
+        }
+
+        codec_serializer.serialize(&stats, Box::new(data.iter().copied()).as_mut(), &mut bytes)
+    });
+}
+
+#[bench]
+fn bench_fastfield_bitpack_create(b: &mut Bencher) {
+    let data: Vec<_> = get_data();
+    bench_create::<BitpackedCodec>(b, &data);
+}
+#[bench]
+fn bench_fastfield_linearinterpol_create(b: &mut Bencher) {
+    let data: Vec<_> = get_data();
+    bench_create::<LinearCodec>(b, &data);
+}
+#[bench]
+fn bench_fastfield_multilinearinterpol_create(b: &mut Bencher) {
+    let data: Vec<_> = get_data();
+    bench_create::<BlockwiseLinearCodec>(b, &data);
+}
+#[bench]
+fn bench_fastfield_bitpack_get(b: &mut Bencher) {
+    let data: Vec<_> = get_data();
+    bench_get::<BitpackedCodec>(b, &data);
+}
+#[bench]
+fn bench_fastfield_bitpack_get_dynamic(b: &mut Bencher) {
+    let data: Vec<_> = get_data();
+    bench_get_dynamic::<BitpackedCodec>(b, &data);
+}
+#[bench]
+fn bench_fastfield_linearinterpol_get(b: &mut Bencher) {
+    let data: Vec<_> = get_data();
+    bench_get::<LinearCodec>(b, &data);
+}
+#[bench]
+fn bench_fastfield_linearinterpol_get_dynamic(b: &mut Bencher) {
+    let data: Vec<_> = get_data();
+    bench_get_dynamic::<LinearCodec>(b, &data);
+}
+#[bench]
+fn bench_fastfield_multilinearinterpol_get(b: &mut Bencher) {
+    let data: Vec<_> = get_data();
+    bench_get::<BlockwiseLinearCodec>(b, &data);
+}
+#[bench]
+fn bench_fastfield_multilinearinterpol_get_dynamic(b: &mut Bencher) {
+    let data: Vec<_> = get_data();
+    bench_get_dynamic::<BlockwiseLinearCodec>(b, &data);
+}
--- a/columnar/src/column_values/merge.rs
+++ b/columnar/src/column_values/merge.rs
@@ -10,7 +10,7 @@ pub(crate) struct MergedColumnValues<'a, T> {
    pub(crate) merge_row_order: &'a MergeRowOrder,
 }

-impl<T: Copy + PartialOrd + Debug + 'static> Iterable<T> for MergedColumnValues<'_, T> {
+impl<'a, T: Copy + PartialOrd + Debug + 'static> Iterable<T> for MergedColumnValues<'a, T> {
    fn boxed_iter(&self) -> Box<dyn Iterator<Item = T> + '_> {
        match self.merge_row_order {
            MergeRowOrder::Stack(_) => Box::new(
--- a/columnar/src/column_values/mod.rs
+++ b/columnar/src/column_values/mod.rs
@@ -26,13 +26,13 @@ mod monotonic_column;

 pub(crate) use merge::MergedColumnValues;
 pub use stats::ColumnStats;
-pub use u64_based::{
-    ALL_U64_CODEC_TYPES, CodecType, load_u64_based_column_values,
-    serialize_and_load_u64_based_column_values, serialize_u64_based_column_values,
-};
 pub use u128_based::{
-    CompactSpaceU64Accessor, open_u128_as_compact_u64, open_u128_mapped,
-    serialize_column_values_u128,
+    open_u128_as_compact_u64, open_u128_mapped, serialize_column_values_u128,
+    CompactSpaceU64Accessor,
+};
+pub use u64_based::{
+    load_u64_based_column_values, serialize_and_load_u64_based_column_values,
+    serialize_u64_based_column_values, CodecType, ALL_U64_CODEC_TYPES,
 };
 pub use vec_column::VecColumn;

@@ -242,3 +242,6 @@ impl<T: Copy + PartialOrd + Debug + 'static> ColumnValues<T> for Arc<dyn ColumnV
            .get_row_ids_for_value_range(range, doc_id_range, positions)
    }
 }
+
+#[cfg(all(test, feature = "unstable"))]
+mod bench;
--- a/columnar/src/column_values/monotonic_column.rs
+++ b/columnar/src/column_values/monotonic_column.rs
@@ -2,8 +2,8 @@ use std::fmt::Debug;
 use std::marker::PhantomData;
 use std::ops::{Range, RangeInclusive};

-use crate::ColumnValues;
 use crate::column_values::monotonic_mapping::StrictlyMonotonicFn;
+use crate::ColumnValues;

 struct MonotonicMappingColumn<C, T, Input> {
    from_column: C,
@@ -99,10 +99,10 @@ where
 #[cfg(test)]
 mod tests {
    use super::*;
-    use crate::column_values::VecColumn;
    use crate::column_values::monotonic_mapping::{
        StrictlyMonotonicMappingInverter, StrictlyMonotonicMappingToInternal,
    };
+    use crate::column_values::VecColumn;

    #[test]
    fn test_monotonic_mapping_iter() {
--- a/columnar/src/column_values/u128_based/compact_space/build_compact_space.rs
+++ b/columnar/src/column_values/u128_based/compact_space/build_compact_space.rs
@@ -185,10 +185,10 @@ impl CompactSpaceBuilder {
        let mut covered_space = Vec::with_capacity(self.blanks.len());

        // beginning of the blanks
-        if let Some(first_blank_start) = self.blanks.first().map(RangeInclusive::start)
-            && *first_blank_start != 0
-        {
-            covered_space.push(0..=first_blank_start - 1);
+        if let Some(first_blank_start) = self.blanks.first().map(RangeInclusive::start) {
+            if *first_blank_start != 0 {
+                covered_space.push(0..=first_blank_start - 1);
+            }
        }

        // Between the blanks
@@ -202,10 +202,10 @@ impl CompactSpaceBuilder {
        covered_space.extend(between_blanks);

        // end of the blanks
-        if let Some(last_blank_end) = self.blanks.last().map(RangeInclusive::end)
-            && *last_blank_end != u128::MAX
-        {
-            covered_space.push(last_blank_end + 1..=u128::MAX);
+        if let Some(last_blank_end) = self.blanks.last().map(RangeInclusive::end) {
+            if *last_blank_end != u128::MAX {
+                covered_space.push(last_blank_end + 1..=u128::MAX);
+            }
        }

        if covered_space.is_empty() {
--- a/columnar/src/column_values/u128_based/compact_space/mod.rs
+++ b/columnar/src/column_values/u128_based/compact_space/mod.rs
@@ -24,8 +24,8 @@ use build_compact_space::get_compact_space;
 use common::{BinarySerializable, CountingWriter, OwnedBytes, VInt, VIntU128};
 use tantivy_bitpacker::{BitPacker, BitUnpacker};

-use crate::RowId;
 use crate::column_values::ColumnValues;
+use crate::RowId;

 /// The cost per blank is quite hard actually, since blanks are delta encoded, the actual cost of
 /// blanks depends on the number of blanks.
@@ -653,14 +653,12 @@ mod tests {
            ),
            &[3]
        );
-        assert!(
-            get_positions_for_value_range_helper(
-                &decomp,
-                99998u128..=99998u128,
-                complete_range.clone()
-            )
-            .is_empty()
-        );
+        assert!(get_positions_for_value_range_helper(
+            &decomp,
+            99998u128..=99998u128,
+            complete_range.clone()
+        )
+        .is_empty());
        assert_eq!(
            &get_positions_for_value_range_helper(
                &decomp,
--- a/columnar/src/column_values/u128_based/mod.rs
+++ b/columnar/src/column_values/u128_based/mod.rs
@@ -130,11 +130,11 @@ pub fn open_u128_as_compact_u64(mut bytes: OwnedBytes) -> io::Result<Arc<dyn Col
 #[cfg(test)]
 pub(crate) mod tests {
    use super::*;
-    use crate::column_values::CodecType;
    use crate::column_values::u64_based::{
-        ALL_U64_CODEC_TYPES, serialize_and_load_u64_based_column_values,
-        serialize_u64_based_column_values,
+        serialize_and_load_u64_based_column_values, serialize_u64_based_column_values,
+        ALL_U64_CODEC_TYPES,
    };
+    use crate::column_values::CodecType;

    #[test]
    fn test_serialize_deserialize_u128_header() {
--- a/columnar/src/column_values/u64_based/bitpacked.rs
+++ b/columnar/src/column_values/u64_based/bitpacked.rs
@@ -4,7 +4,7 @@ use std::ops::{Range, RangeInclusive};

 use common::{BinarySerializable, OwnedBytes};
 use fastdivide::DividerU64;
-use tantivy_bitpacker::{BitPacker, BitUnpacker, compute_num_bits};
+use tantivy_bitpacker::{compute_num_bits, BitPacker, BitUnpacker};

 use crate::column_values::u64_based::{ColumnCodec, ColumnCodecEstimator, ColumnStats};
 use crate::{ColumnValues, RowId};
@@ -23,7 +23,11 @@ const fn div_ceil(n: u64, q: NonZeroU64) -> u64 {
    // copied from unstable rust standard library.
    let d = n / q.get();
    let r = n % q.get();
-    if r > 0 { d + 1 } else { d }
+    if r > 0 {
+        d + 1
+    } else {
+        d
+    }
 }

 // The bitpacked codec applies a linear transformation `f` over data that are bitpacked.
@@ -105,7 +109,7 @@ impl ColumnCodecEstimator for BitpackedCodecEstimator {

    fn estimate(&self, stats: &ColumnStats) -> Option<u64> {
        let num_bits_per_value = num_bits(stats);
-        Some(stats.num_bytes() + (stats.num_rows as u64 * (num_bits_per_value as u64)).div_ceil(8))
+        Some(stats.num_bytes() + (stats.num_rows as u64 * (num_bits_per_value as u64) + 7) / 8)
    }

    fn serialize(
--- a/columnar/src/column_values/u64_based/blockwise_linear.rs
+++ b/columnar/src/column_values/u64_based/blockwise_linear.rs
@@ -4,12 +4,12 @@ use std::{io, iter};

 use common::{BinarySerializable, CountingWriter, DeserializeFrom, OwnedBytes};
 use fastdivide::DividerU64;
-use tantivy_bitpacker::{BitPacker, BitUnpacker, compute_num_bits};
+use tantivy_bitpacker::{compute_num_bits, BitPacker, BitUnpacker};

-use crate::MonotonicallyMappableToU64;
 use crate::column_values::u64_based::line::Line;
 use crate::column_values::u64_based::{ColumnCodec, ColumnCodecEstimator, ColumnStats};
 use crate::column_values::{ColumnValues, VecColumn};
+use crate::MonotonicallyMappableToU64;

 const BLOCK_SIZE: u32 = 512u32;

@@ -39,7 +39,7 @@ impl BinarySerializable for Block {
 }

 fn compute_num_blocks(num_vals: u32) -> u32 {
-    num_vals.div_ceil(BLOCK_SIZE)
+    (num_vals + BLOCK_SIZE - 1) / BLOCK_SIZE
 }

 pub struct BlockwiseLinearEstimator {
--- a/columnar/src/column_values/u64_based/linear.rs
+++ b/columnar/src/column_values/u64_based/linear.rs
@@ -1,13 +1,13 @@
 use std::io;

 use common::{BinarySerializable, OwnedBytes};
-use tantivy_bitpacker::{BitPacker, BitUnpacker, compute_num_bits};
+use tantivy_bitpacker::{compute_num_bits, BitPacker, BitUnpacker};

-use super::ColumnValues;
 use super::line::Line;
-use crate::RowId;
-use crate::column_values::VecColumn;
+use super::ColumnValues;
 use crate::column_values::u64_based::{ColumnCodec, ColumnCodecEstimator, ColumnStats};
+use crate::column_values::VecColumn;
+use crate::RowId;

 const HALF_SPACE: u64 = u64::MAX / 2;
 const LINE_ESTIMATION_BLOCK_LEN: usize = 512;
@@ -117,7 +117,7 @@ impl ColumnCodecEstimator for LinearCodecEstimator {
        Some(
            stats.num_bytes()
                + linear_params.num_bytes()
-                + (num_bits as u64 * stats.num_rows as u64).div_ceil(8),
+                + (num_bits as u64 * stats.num_rows as u64 + 7) / 8,
        )
    }

--- a/columnar/src/column_values/u64_based/mod.rs
+++ b/columnar/src/column_values/u64_based/mod.rs
@@ -17,7 +17,7 @@ pub use crate::column_values::u64_based::bitpacked::BitpackedCodec;
 pub use crate::column_values::u64_based::blockwise_linear::BlockwiseLinearCodec;
 pub use crate::column_values::u64_based::linear::LinearCodec;
 pub use crate::column_values::u64_based::stats_collector::StatsCollector;
-use crate::column_values::{ColumnStats, monotonic_map_column};
+use crate::column_values::{monotonic_map_column, ColumnStats};
 use crate::iterable::Iterable;
 use crate::{ColumnValues, MonotonicallyMappableToU64};

--- a/columnar/src/column_values/u64_based/stats_collector.rs
+++ b/columnar/src/column_values/u64_based/stats_collector.rs
@@ -2,8 +2,8 @@ use std::num::NonZeroU64;

 use fastdivide::DividerU64;

-use crate::RowId;
 use crate::column_values::ColumnStats;
+use crate::RowId;

 /// Compute the gcd of two non null numbers.
 ///
@@ -96,8 +96,8 @@ impl StatsCollector {
 mod tests {
    use std::num::NonZeroU64;

+    use crate::column_values::u64_based::stats_collector::{compute_gcd, StatsCollector};
    use crate::column_values::u64_based::ColumnStats;
-    use crate::column_values::u64_based::stats_collector::{StatsCollector, compute_gcd};

    fn compute_stats(vals: impl Iterator<Item = u64>) -> ColumnStats {
        let mut stats_collector = StatsCollector::default();
--- a/columnar/src/column_values/u64_based/tests.rs
+++ b/columnar/src/column_values/u64_based/tests.rs
@@ -1,6 +1,5 @@
 use proptest::prelude::*;
 use proptest::{prop_oneof, proptest};
-use rand::Rng;

 #[test]
 fn test_serialize_and_load_simple() {
--- a/columnar/src/columnar/column_type.rs
+++ b/columnar/src/columnar/column_type.rs
@@ -4,8 +4,8 @@ use std::net::Ipv6Addr;

 use serde::{Deserialize, Serialize};

-use crate::InvalidData;
 use crate::value::NumericalType;
+use crate::InvalidData;

 /// The column type represents the column type.
 /// Any changes need to be propagated to `COLUMN_TYPES`.
--- a/columnar/src/columnar/merge/merge_dict_column.rs
+++ b/columnar/src/columnar/merge/merge_dict_column.rs
@@ -3,7 +3,7 @@ use std::io::{self, Write};
 use common::{BitSet, CountingWriter, ReadOnlyBitSet};
 use sstable::{SSTable, Streamer, TermOrdinal, VoidSSTable};

-use super::term_merger::{TermMerger, TermsWithSegmentOrd};
+use super::term_merger::TermMerger;
 use crate::column::serialize_column_mappable_to_u64;
 use crate::column_index::SerializableColumnIndex;
 use crate::iterable::Iterable;
@@ -39,7 +39,7 @@ struct RemappedTermOrdinalsValues<'a> {
    merge_row_order: &'a MergeRowOrder,
 }

-impl Iterable for RemappedTermOrdinalsValues<'_> {
+impl<'a> Iterable for RemappedTermOrdinalsValues<'a> {
    fn boxed_iter(&self) -> Box<dyn Iterator<Item = u64> + '_> {
        match self.merge_row_order {
            MergeRowOrder::Stack(_) => self.boxed_iter_stacked(),
@@ -50,7 +50,7 @@ impl Iterable for RemappedTermOrdinalsValues<'_> {
    }
 }

-impl RemappedTermOrdinalsValues<'_> {
+impl<'a> RemappedTermOrdinalsValues<'a> {
    fn boxed_iter_stacked(&self) -> Box<dyn Iterator<Item = u64> + '_> {
        let iter = self
            .bytes_columns
@@ -126,17 +126,14 @@ fn serialize_merged_dict(
    let mut term_ord_mapping = TermOrdinalMapping::default();

    let mut field_term_streams = Vec::new();
-    for (segment_ord, column_opt) in bytes_columns.iter().enumerate() {
+    for column_opt in bytes_columns.iter() {
        if let Some(column) = column_opt {
            term_ord_mapping.add_segment(column.dictionary.num_terms());
            let terms: Streamer<VoidSSTable> = column.dictionary.stream()?;
-            field_term_streams.push(TermsWithSegmentOrd { terms, segment_ord });
+            field_term_streams.push(terms);
        } else {
            term_ord_mapping.add_segment(0);
-            field_term_streams.push(TermsWithSegmentOrd {
-                terms: Streamer::empty(),
-                segment_ord,
-            });
+            field_term_streams.push(Streamer::empty());
        }
    }

@@ -194,7 +191,6 @@ fn serialize_merged_dict(

 #[derive(Default, Debug)]
 struct TermOrdinalMapping {
-    /// Contains the new term ordinals for each segment.
    per_segment_new_term_ordinals: Vec<Vec<TermOrdinal>>,
 }

@@ -209,6 +205,6 @@ impl TermOrdinalMapping {
    }

    fn get_segment(&self, segment_ord: u32) -> &[TermOrdinal] {
-        &self.per_segment_new_term_ordinals[segment_ord as usize]
+        &(self.per_segment_new_term_ordinals[segment_ord as usize])[..]
    }
 }
--- a/columnar/src/columnar/merge/merge_mapping.rs
+++ b/columnar/src/columnar/merge/merge_mapping.rs
@@ -26,7 +26,7 @@ impl StackMergeOrder {
        let mut cumulated_row_ids: Vec<RowId> = Vec::with_capacity(columnars.len());
        let mut cumulated_row_id = 0;
        for columnar in columnars {
-            cumulated_row_id += columnar.num_docs();
+            cumulated_row_id += columnar.num_rows();
            cumulated_row_ids.push(cumulated_row_id);
        }
        StackMergeOrder { cumulated_row_ids }
--- a/columnar/src/columnar/merge/mod.rs
+++ b/columnar/src/columnar/merge/mod.rs
@@ -10,11 +10,11 @@ use std::sync::Arc;
 pub use merge_mapping::{MergeRowOrder, ShuffleMergeOrder, StackMergeOrder};

 use super::writer::ColumnarSerializer;
-use crate::column::{serialize_column_mappable_to_u64, serialize_column_mappable_to_u128};
+use crate::column::{serialize_column_mappable_to_u128, serialize_column_mappable_to_u64};
 use crate::column_values::MergedColumnValues;
-use crate::columnar::ColumnarReader;
 use crate::columnar::merge::merge_dict_column::merge_bytes_or_str_column;
 use crate::columnar::writer::CompatibleNumericalTypes;
+use crate::columnar::ColumnarReader;
 use crate::dynamic_column::DynamicColumn;
 use crate::{
    BytesColumn, Column, ColumnIndex, ColumnType, ColumnValues, DynamicColumnHandle, NumericalType,
@@ -80,12 +80,13 @@ pub fn merge_columnar(
    output: &mut impl io::Write,
 ) -> io::Result<()> {
    let mut serializer = ColumnarSerializer::new(output);
-    let num_docs_per_columnar = columnar_readers
+    let num_rows_per_columnar = columnar_readers
        .iter()
-        .map(|reader| reader.num_docs())
+        .map(|reader| reader.num_rows())
        .collect::<Vec<u32>>();

-    let columns_to_merge = group_columns_for_merge(columnar_readers, required_columns)?;
+    let columns_to_merge =
+        group_columns_for_merge(columnar_readers, required_columns, &merge_row_order)?;
    for res in columns_to_merge {
        let ((column_name, _column_type_category), grouped_columns) = res;
        let grouped_columns = grouped_columns.open(&merge_row_order)?;
@@ -93,18 +94,15 @@ pub fn merge_columnar(
            continue;
        }

-        let column_type_after_merge = grouped_columns.column_type_after_merge();
+        let column_type = grouped_columns.column_type_after_merge();
        let mut columns = grouped_columns.columns;
-        // Make sure the number of columns is the same as the number of columnar readers.
-        // Or num_docs_per_columnar would be incorrect.
-        assert_eq!(columns.len(), columnar_readers.len());
-        coerce_columns(column_type_after_merge, &mut columns)?;
+        coerce_columns(column_type, &mut columns)?;

        let mut column_serializer =
-            serializer.start_serialize_column(column_name.as_bytes(), column_type_after_merge);
+            serializer.start_serialize_column(column_name.as_bytes(), column_type);
        merge_column(
-            column_type_after_merge,
-            &num_docs_per_columnar,
+            column_type,
+            &num_rows_per_columnar,
            columns,
            &merge_row_order,
            &mut column_serializer,
@@ -130,7 +128,7 @@ fn dynamic_column_to_u64_monotonic(dynamic_column: DynamicColumn) -> Option<Colu
 fn merge_column(
    column_type: ColumnType,
    num_docs_per_column: &[u32],
-    columns_to_merge: Vec<Option<DynamicColumn>>,
+    columns: Vec<Option<DynamicColumn>>,
    merge_row_order: &MergeRowOrder,
    wrt: &mut impl io::Write,
 ) -> io::Result<()> {
@@ -140,21 +138,20 @@ fn merge_column(
        | ColumnType::F64
        | ColumnType::DateTime
        | ColumnType::Bool => {
-            let mut column_indexes: Vec<ColumnIndex> = Vec::with_capacity(columns_to_merge.len());
+            let mut column_indexes: Vec<ColumnIndex> = Vec::with_capacity(columns.len());
            let mut column_values: Vec<Option<Arc<dyn ColumnValues>>> =
-                Vec::with_capacity(columns_to_merge.len());
-            for (i, dynamic_column_opt) in columns_to_merge.into_iter().enumerate() {
-                match dynamic_column_opt.and_then(dynamic_column_to_u64_monotonic) {
-                    Some(Column { index: idx, values }) => {
-                        column_indexes.push(idx);
-                        column_values.push(Some(values));
-                    }
-                    None => {
-                        column_indexes.push(ColumnIndex::Empty {
-                            num_docs: num_docs_per_column[i],
-                        });
-                        column_values.push(None);
-                    }
+                Vec::with_capacity(columns.len());
+            for (i, dynamic_column_opt) in columns.into_iter().enumerate() {
+                if let Some(Column { index: idx, values }) =
+                    dynamic_column_opt.and_then(dynamic_column_to_u64_monotonic)
+                {
+                    column_indexes.push(idx);
+                    column_values.push(Some(values));
+                } else {
+                    column_indexes.push(ColumnIndex::Empty {
+                        num_docs: num_docs_per_column[i],
+                    });
+                    column_values.push(None);
                }
            }
            let merged_column_index =
@@ -167,10 +164,10 @@ fn merge_column(
            serialize_column_mappable_to_u64(merged_column_index, &merge_column_values, wrt)?;
        }
        ColumnType::IpAddr => {
-            let mut column_indexes: Vec<ColumnIndex> = Vec::with_capacity(columns_to_merge.len());
+            let mut column_indexes: Vec<ColumnIndex> = Vec::with_capacity(columns.len());
            let mut column_values: Vec<Option<Arc<dyn ColumnValues<Ipv6Addr>>>> =
-                Vec::with_capacity(columns_to_merge.len());
-            for (i, dynamic_column_opt) in columns_to_merge.into_iter().enumerate() {
+                Vec::with_capacity(columns.len());
+            for (i, dynamic_column_opt) in columns.into_iter().enumerate() {
                if let Some(DynamicColumn::IpAddr(Column { index: idx, values })) =
                    dynamic_column_opt
                {
@@ -195,10 +192,9 @@ fn merge_column(
            serialize_column_mappable_to_u128(merged_column_index, &merge_column_values, wrt)?;
        }
        ColumnType::Bytes | ColumnType::Str => {
-            let mut column_indexes: Vec<ColumnIndex> = Vec::with_capacity(columns_to_merge.len());
-            let mut bytes_columns: Vec<Option<BytesColumn>> =
-                Vec::with_capacity(columns_to_merge.len());
-            for (i, dynamic_column_opt) in columns_to_merge.into_iter().enumerate() {
+            let mut column_indexes: Vec<ColumnIndex> = Vec::with_capacity(columns.len());
+            let mut bytes_columns: Vec<Option<BytesColumn>> = Vec::with_capacity(columns.len());
+            for (i, dynamic_column_opt) in columns.into_iter().enumerate() {
                match dynamic_column_opt {
                    Some(DynamicColumn::Str(str_column)) => {
                        column_indexes.push(str_column.term_ord_column.index.clone());
@@ -252,15 +248,13 @@ impl GroupedColumns {
        if column_type.len() == 1 {
            return column_type.into_iter().next().unwrap();
        }
-        // At the moment, only the numerical column type category has more than one possible
+        // At the moment, only the numerical categorical column type has more than one possible
        // column type.
-        assert!(
-            self.columns
-                .iter()
-                .flatten()
-                .all(|el| ColumnTypeCategory::from(el.column_type())
-                    == ColumnTypeCategory::Numerical)
-        );
+        assert!(self
+            .columns
+            .iter()
+            .flatten()
+            .all(|el| ColumnTypeCategory::from(el.column_type()) == ColumnTypeCategory::Numerical));
        merged_numerical_columns_type(self.columns.iter().flatten()).into()
    }
 }
@@ -367,7 +361,7 @@ fn is_empty_after_merge(
                    ColumnIndex::Empty { .. } => true,
                    ColumnIndex::Full => alive_bitset.len() == 0,
                    ColumnIndex::Optional(optional_index) => {
-                        for doc in optional_index.iter_non_null_docs() {
+                        for doc in optional_index.iter_rows() {
                            if alive_bitset.contains(doc) {
                                return false;
                            }
@@ -397,6 +391,7 @@ fn is_empty_after_merge(
 fn group_columns_for_merge<'a>(
    columnar_readers: &'a [&'a ColumnarReader],
    required_columns: &'a [(String, ColumnType)],
+    _merge_row_order: &'a MergeRowOrder,
 ) -> io::Result<BTreeMap<(String, ColumnTypeCategory), GroupedColumnsHandle>> {
    let mut columns: BTreeMap<(String, ColumnTypeCategory), GroupedColumnsHandle> = BTreeMap::new();

--- a/columnar/src/columnar/merge/term_merger.rs
+++ b/columnar/src/columnar/merge/term_merger.rs
@@ -5,29 +5,28 @@ use sstable::TermOrdinal;

 use crate::Streamer;

-/// The terms of a column with the ordinal of the segment.
-pub struct TermsWithSegmentOrd<'a> {
-    pub terms: Streamer<'a>,
+pub struct HeapItem<'a> {
+    pub streamer: Streamer<'a>,
    pub segment_ord: usize,
 }

-impl PartialEq for TermsWithSegmentOrd<'_> {
+impl<'a> PartialEq for HeapItem<'a> {
    fn eq(&self, other: &Self) -> bool {
        self.segment_ord == other.segment_ord
    }
 }

-impl Eq for TermsWithSegmentOrd<'_> {}
+impl<'a> Eq for HeapItem<'a> {}

-impl<'a> PartialOrd for TermsWithSegmentOrd<'a> {
-    fn partial_cmp(&self, other: &TermsWithSegmentOrd<'a>) -> Option<Ordering> {
+impl<'a> PartialOrd for HeapItem<'a> {
+    fn partial_cmp(&self, other: &HeapItem<'a>) -> Option<Ordering> {
        Some(self.cmp(other))
    }
 }

-impl<'a> Ord for TermsWithSegmentOrd<'a> {
-    fn cmp(&self, other: &TermsWithSegmentOrd<'a>) -> Ordering {
-        (&other.terms.key(), &other.segment_ord).cmp(&(&self.terms.key(), &self.segment_ord))
+impl<'a> Ord for HeapItem<'a> {
+    fn cmp(&self, other: &HeapItem<'a>) -> Ordering {
+        (&other.streamer.key(), &other.segment_ord).cmp(&(&self.streamer.key(), &self.segment_ord))
    }
 }

@@ -38,32 +37,39 @@ impl<'a> Ord for TermsWithSegmentOrd<'a> {
 /// - the term
 /// - a slice with the ordinal of the segments containing the terms.
 pub struct TermMerger<'a> {
-    heap: BinaryHeap<TermsWithSegmentOrd<'a>>,
-    term_streams_with_segment: Vec<TermsWithSegmentOrd<'a>>,
+    heap: BinaryHeap<HeapItem<'a>>,
+    current_streamers: Vec<HeapItem<'a>>,
 }

 impl<'a> TermMerger<'a> {
    /// Stream of merged term dictionary
-    pub fn new(term_streams_with_segment: Vec<TermsWithSegmentOrd<'a>>) -> TermMerger<'a> {
+    pub fn new(streams: Vec<Streamer<'a>>) -> TermMerger<'a> {
        TermMerger {
            heap: BinaryHeap::new(),
-            term_streams_with_segment,
+            current_streamers: streams
+                .into_iter()
+                .enumerate()
+                .map(|(ord, streamer)| HeapItem {
+                    streamer,
+                    segment_ord: ord,
+                })
+                .collect(),
        }
    }

    pub(crate) fn matching_segments<'b: 'a>(
        &'b self,
    ) -> impl 'b + Iterator<Item = (usize, TermOrdinal)> {
-        self.term_streams_with_segment
+        self.current_streamers
            .iter()
-            .map(|heap_item| (heap_item.segment_ord, heap_item.terms.term_ord()))
+            .map(|heap_item| (heap_item.segment_ord, heap_item.streamer.term_ord()))
    }

    fn advance_segments(&mut self) {
-        let streamers = &mut self.term_streams_with_segment;
+        let streamers = &mut self.current_streamers;
        let heap = &mut self.heap;
        for mut heap_item in streamers.drain(..) {
-            if heap_item.terms.advance() {
+            if heap_item.streamer.advance() {
                heap.push(heap_item);
            }
        }
@@ -74,19 +80,18 @@ impl<'a> TermMerger<'a> {
    /// False if there is none.
    pub fn advance(&mut self) -> bool {
        self.advance_segments();
-        match self.heap.pop() {
-            Some(head) => {
-                self.term_streams_with_segment.push(head);
-                while let Some(next_streamer) = self.heap.peek() {
-                    if self.term_streams_with_segment[0].terms.key() != next_streamer.terms.key() {
-                        break;
-                    }
-                    let next_heap_it = self.heap.pop().unwrap(); // safe : we peeked beforehand
-                    self.term_streams_with_segment.push(next_heap_it);
+        if let Some(head) = self.heap.pop() {
+            self.current_streamers.push(head);
+            while let Some(next_streamer) = self.heap.peek() {
+                if self.current_streamers[0].streamer.key() != next_streamer.streamer.key() {
+                    break;
                }
-                true
+                let next_heap_it = self.heap.pop().unwrap(); // safe : we peeked beforehand
+                self.current_streamers.push(next_heap_it);
            }
-            _ => false,
+            true
+        } else {
+            false
        }
    }

@@ -96,6 +101,6 @@ impl<'a> TermMerger<'a> {
    /// if and only if advance() has been called before
    /// and "true" was returned.
    pub fn key(&self) -> &[u8] {
-        self.term_streams_with_segment[0].terms.key()
+        self.current_streamers[0].streamer.key()
    }
 }
--- a/columnar/src/columnar/merge/tests.rs
+++ b/columnar/src/columnar/merge/tests.rs
@@ -1,10 +1,7 @@
 use itertools::Itertools;
-use proptest::collection::vec;
-use proptest::prelude::*;

 use super::*;
-use crate::columnar::{ColumnarReader, MergeRowOrder, StackMergeOrder, merge_columnar};
-use crate::{Cardinality, ColumnarWriter, DynamicColumn, HasAssociatedColumnType, RowId};
+use crate::{Cardinality, ColumnarWriter, HasAssociatedColumnType, RowId};

 fn make_columnar<T: Into<NumericalValue> + HasAssociatedColumnType + Copy>(
    column_name: &str,
@@ -29,8 +26,9 @@ fn test_column_coercion_to_u64() {
    // u64 type
    let columnar2 = make_columnar("numbers", &[u64::MAX]);
    let columnars = &[&columnar1, &columnar2];
+    let merge_order = StackMergeOrder::stack(columnars).into();
    let column_map: BTreeMap<(String, ColumnTypeCategory), GroupedColumnsHandle> =
-        group_columns_for_merge(columnars, &[]).unwrap();
+        group_columns_for_merge(columnars, &[], &merge_order).unwrap();
    assert_eq!(column_map.len(), 1);
    assert!(column_map.contains_key(&("numbers".to_string(), ColumnTypeCategory::Numerical)));
 }
@@ -40,8 +38,9 @@ fn test_column_coercion_to_i64() {
    let columnar1 = make_columnar("numbers", &[-1i64]);
    let columnar2 = make_columnar("numbers", &[2u64]);
    let columnars = &[&columnar1, &columnar2];
+    let merge_order = StackMergeOrder::stack(columnars).into();
    let column_map: BTreeMap<(String, ColumnTypeCategory), GroupedColumnsHandle> =
-        group_columns_for_merge(columnars, &[]).unwrap();
+        group_columns_for_merge(columnars, &[], &merge_order).unwrap();
    assert_eq!(column_map.len(), 1);
    assert!(column_map.contains_key(&("numbers".to_string(), ColumnTypeCategory::Numerical)));
 }
@@ -64,8 +63,14 @@ fn test_group_columns_with_required_column() {
    let columnar1 = make_columnar("numbers", &[1i64]);
    let columnar2 = make_columnar("numbers", &[2u64]);
    let columnars = &[&columnar1, &columnar2];
+    let merge_order = StackMergeOrder::stack(columnars).into();
    let column_map: BTreeMap<(String, ColumnTypeCategory), GroupedColumnsHandle> =
-        group_columns_for_merge(columnars, &[("numbers".to_string(), ColumnType::U64)]).unwrap();
+        group_columns_for_merge(
+            &[&columnar1, &columnar2],
+            &[("numbers".to_string(), ColumnType::U64)],
+            &merge_order,
+        )
+        .unwrap();
    assert_eq!(column_map.len(), 1);
    assert!(column_map.contains_key(&("numbers".to_string(), ColumnTypeCategory::Numerical)));
 }
@@ -75,9 +80,13 @@ fn test_group_columns_required_column_with_no_existing_columns() {
    let columnar1 = make_columnar("numbers", &[2u64]);
    let columnar2 = make_columnar("numbers", &[2u64]);
    let columnars = &[&columnar1, &columnar2];
-    let column_map: BTreeMap<_, _> =
-        group_columns_for_merge(columnars, &[("required_col".to_string(), ColumnType::Str)])
-            .unwrap();
+    let merge_order = StackMergeOrder::stack(columnars).into();
+    let column_map: BTreeMap<_, _> = group_columns_for_merge(
+        columnars,
+        &[("required_col".to_string(), ColumnType::Str)],
+        &merge_order,
+    )
+    .unwrap();
    assert_eq!(column_map.len(), 2);
    let columns = &column_map
        .get(&("required_col".to_string(), ColumnTypeCategory::Str))
@@ -93,8 +102,14 @@ fn test_group_columns_required_column_is_above_all_columns_have_the_same_type_ru
    let columnar1 = make_columnar("numbers", &[2i64]);
    let columnar2 = make_columnar("numbers", &[2i64]);
    let columnars = &[&columnar1, &columnar2];
+    let merge_order = StackMergeOrder::stack(columnars).into();
    let column_map: BTreeMap<(String, ColumnTypeCategory), GroupedColumnsHandle> =
-        group_columns_for_merge(columnars, &[("numbers".to_string(), ColumnType::U64)]).unwrap();
+        group_columns_for_merge(
+            columnars,
+            &[("numbers".to_string(), ColumnType::U64)],
+            &merge_order,
+        )
+        .unwrap();
    assert_eq!(column_map.len(), 1);
    assert!(column_map.contains_key(&("numbers".to_string(), ColumnTypeCategory::Numerical)));
 }
@@ -104,8 +119,9 @@ fn test_missing_column() {
    let columnar1 = make_columnar("numbers", &[-1i64]);
    let columnar2 = make_columnar("numbers2", &[2u64]);
    let columnars = &[&columnar1, &columnar2];
+    let merge_order = StackMergeOrder::stack(columnars).into();
    let column_map: BTreeMap<(String, ColumnTypeCategory), GroupedColumnsHandle> =
-        group_columns_for_merge(columnars, &[]).unwrap();
+        group_columns_for_merge(columnars, &[], &merge_order).unwrap();
    assert_eq!(column_map.len(), 2);
    assert!(column_map.contains_key(&("numbers".to_string(), ColumnTypeCategory::Numerical)));
    {
@@ -208,7 +224,7 @@ fn test_merge_columnar_numbers() {
    )
    .unwrap();
    let columnar_reader = ColumnarReader::open(buffer).unwrap();
-    assert_eq!(columnar_reader.num_docs(), 3);
+    assert_eq!(columnar_reader.num_rows(), 3);
    assert_eq!(columnar_reader.num_columns(), 1);
    let cols = columnar_reader.read_columns("numbers").unwrap();
    let dynamic_column = cols[0].open().unwrap();
@@ -236,7 +252,7 @@ fn test_merge_columnar_texts() {
    )
    .unwrap();
    let columnar_reader = ColumnarReader::open(buffer).unwrap();
-    assert_eq!(columnar_reader.num_docs(), 3);
+    assert_eq!(columnar_reader.num_rows(), 3);
    assert_eq!(columnar_reader.num_columns(), 1);
    let cols = columnar_reader.read_columns("texts").unwrap();
    let dynamic_column = cols[0].open().unwrap();
@@ -285,7 +301,7 @@ fn test_merge_columnar_byte() {
    )
    .unwrap();
    let columnar_reader = ColumnarReader::open(buffer).unwrap();
-    assert_eq!(columnar_reader.num_docs(), 4);
+    assert_eq!(columnar_reader.num_rows(), 4);
    assert_eq!(columnar_reader.num_columns(), 1);
    let cols = columnar_reader.read_columns("bytes").unwrap();
    let dynamic_column = cols[0].open().unwrap();
@@ -341,7 +357,7 @@ fn test_merge_columnar_byte_with_missing() {
    )
    .unwrap();
    let columnar_reader = ColumnarReader::open(buffer).unwrap();
-    assert_eq!(columnar_reader.num_docs(), 3 + 2 + 3);
+    assert_eq!(columnar_reader.num_rows(), 3 + 2 + 3);
    assert_eq!(columnar_reader.num_columns(), 2);
    let cols = columnar_reader.read_columns("col").unwrap();
    let dynamic_column = cols[0].open().unwrap();
@@ -393,7 +409,7 @@ fn test_merge_columnar_different_types() {
    )
    .unwrap();
    let columnar_reader = ColumnarReader::open(buffer).unwrap();
-    assert_eq!(columnar_reader.num_docs(), 4);
+    assert_eq!(columnar_reader.num_rows(), 4);
    assert_eq!(columnar_reader.num_columns(), 2);
    let cols = columnar_reader.read_columns("mixed").unwrap();

@@ -403,11 +419,11 @@ fn test_merge_columnar_different_types() {
        panic!()
    };
    assert_eq!(vals.get_cardinality(), Cardinality::Optional);
-    assert_eq!(vals.values_for_doc(0).collect_vec(), Vec::<i64>::new());
-    assert_eq!(vals.values_for_doc(1).collect_vec(), Vec::<i64>::new());
-    assert_eq!(vals.values_for_doc(2).collect_vec(), Vec::<i64>::new());
+    assert_eq!(vals.values_for_doc(0).collect_vec(), vec![]);
+    assert_eq!(vals.values_for_doc(1).collect_vec(), vec![]);
+    assert_eq!(vals.values_for_doc(2).collect_vec(), vec![]);
    assert_eq!(vals.values_for_doc(3).collect_vec(), vec![1]);
-    assert_eq!(vals.values_for_doc(4).collect_vec(), Vec::<i64>::new());
+    assert_eq!(vals.values_for_doc(4).collect_vec(), vec![]);

    // text column
    let dynamic_column = cols[1].open().unwrap();
@@ -458,7 +474,7 @@ fn test_merge_columnar_different_empty_cardinality() {
    )
    .unwrap();
    let columnar_reader = ColumnarReader::open(buffer).unwrap();
-    assert_eq!(columnar_reader.num_docs(), 2);
+    assert_eq!(columnar_reader.num_rows(), 2);
    assert_eq!(columnar_reader.num_columns(), 2);
    let cols = columnar_reader.read_columns("mixed").unwrap();

@@ -470,119 +486,3 @@ fn test_merge_columnar_different_empty_cardinality() {
    let dynamic_column = cols[1].open().unwrap();
    assert_eq!(dynamic_column.get_cardinality(), Cardinality::Optional);
 }
-
-#[derive(Debug, Clone)]
-struct ColumnSpec {
-    column_name: String,
-    /// (row_id, term)
-    terms: Vec<(RowId, Vec<u8>)>,
-}
-
-#[derive(Clone, Debug)]
-struct ColumnarSpec {
-    columns: Vec<ColumnSpec>,
-}
-
-/// Generate a random (row_id, term) pair:
-///  - row_id in [0..10]
-///  - term is either from POSSIBLE_TERMS or random bytes
-fn rowid_and_term_strategy() -> impl Strategy<Value = (RowId, Vec<u8>)> {
-    const POSSIBLE_TERMS: &[&[u8]] = &[b"a", b"b", b"allo"];
-
-    let term_strat = prop_oneof![
-        // pick from the fixed list
-        (0..POSSIBLE_TERMS.len()).prop_map(|i| POSSIBLE_TERMS[i].to_vec()),
-        // or random bytes (length 0..10)
-        prop::collection::vec(any::<u8>(), 0..10),
-    ];
-
-    (0u32..11, term_strat)
-}
-
-/// Generate one ColumnSpec, with a random name and a random list of (row_id, term).
-/// We sort it by row_id so that data is in ascending order.
-fn column_spec_strategy() -> impl Strategy<Value = ColumnSpec> {
-    let column_name = prop_oneof![
-        Just("col".to_string()),
-        Just("col2".to_string()),
-        "col.*".prop_map(|s| s),
-    ];
-
-    // We'll produce 0..8 (rowid,term) entries for this column
-    let data_strat = vec(rowid_and_term_strategy(), 0..8).prop_map(|mut pairs| {
-        // Sort by row_id
-        pairs.sort_by_key(|(row_id, _)| *row_id);
-        pairs
-    });
-
-    (column_name, data_strat).prop_map(|(name, data)| ColumnSpec {
-        column_name: name,
-        terms: data,
-    })
-}
-
-/// Strategy to generate an ColumnarSpec
-fn columnar_strategy() -> impl Strategy<Value = ColumnarSpec> {
-    vec(column_spec_strategy(), 0..3).prop_map(|columns| ColumnarSpec { columns })
-}
-
-/// Strategy to generate multiple ColumnarSpecs, each of which we will treat
-/// as one "columnar" to be merged together.
-fn columnars_strategy() -> impl Strategy<Value = Vec<ColumnarSpec>> {
-    vec(columnar_strategy(), 1..4)
-}
-
-/// Build a `ColumnarReader` from a `ColumnarSpec`
-fn build_columnar(spec: &ColumnarSpec) -> ColumnarReader {
-    let mut writer = ColumnarWriter::default();
-    let mut max_row_id = 0;
-    for col in &spec.columns {
-        for &(row_id, ref term) in &col.terms {
-            writer.record_bytes(row_id, &col.column_name, term);
-            max_row_id = max_row_id.max(row_id);
-        }
-    }
-
-    let mut buffer = Vec::new();
-    writer.serialize(max_row_id + 1, &mut buffer).unwrap();
-    ColumnarReader::open(buffer).unwrap()
-}
-
-proptest! {
-    // We just test that the merge_columnar function doesn't crash.
-    #![proptest_config(ProptestConfig::with_cases(256))]
-    #[test]
-    fn test_merge_columnar_bytes_no_crash(columnars in columnars_strategy(), second_merge_columnars in columnars_strategy()) {
-        let columnars: Vec<ColumnarReader> = columnars.iter()
-            .map(build_columnar)
-            .collect();
-
-        let mut out = Vec::new();
-        let columnar_refs: Vec<&ColumnarReader> = columnars.iter().collect();
-        let stack_merge_order = StackMergeOrder::stack(&columnar_refs);
-        merge_columnar(
-            &columnar_refs,
-            &[],
-            MergeRowOrder::Stack(stack_merge_order),
-            &mut out,
-        ).unwrap();
-
-        let merged_reader = ColumnarReader::open(out).unwrap();
-
-        // Merge the second set of columnars with the result of the first merge
-        let mut columnars: Vec<ColumnarReader> = second_merge_columnars.iter()
-            .map(build_columnar)
-            .collect();
-        columnars.push(merged_reader);
-        let mut out = Vec::new();
-        let columnar_refs: Vec<&ColumnarReader> = columnars.iter().collect();
-        let stack_merge_order = StackMergeOrder::stack(&columnar_refs);
-        merge_columnar(
-            &columnar_refs,
-            &[],
-            MergeRowOrder::Stack(stack_merge_order),
-            &mut out,
-        ).unwrap();
-
-    }
-}
--- a/columnar/src/columnar/mod.rs
+++ b/columnar/src/columnar/mod.rs
@@ -5,9 +5,9 @@ mod reader;
 mod writer;

 pub use column_type::{ColumnType, HasAssociatedColumnType};
-pub use format_version::{CURRENT_VERSION, Version};
+pub use format_version::{Version, CURRENT_VERSION};
 #[cfg(test)]
 pub(crate) use merge::ColumnTypeCategory;
-pub use merge::{MergeRowOrder, ShuffleMergeOrder, StackMergeOrder, merge_columnar};
+pub use merge::{merge_columnar, MergeRowOrder, ShuffleMergeOrder, StackMergeOrder};
 pub use reader::ColumnarReader;
 pub use writer::ColumnarWriter;
--- a/columnar/src/columnar/reader/mod.rs
+++ b/columnar/src/columnar/reader/mod.rs
@@ -1,11 +1,10 @@
 use std::{fmt, io, mem};

-use common::BinarySerializable;
 use common::file_slice::FileSlice;
-use common::json_path_writer::JSON_PATH_SEGMENT_SEP;
+use common::BinarySerializable;
 use sstable::{Dictionary, RangeSSTable};

-use crate::columnar::{ColumnType, format_version};
+use crate::columnar::{format_version, ColumnType};
 use crate::dynamic_column::DynamicColumnHandle;
 use crate::{RowId, Version};

@@ -19,13 +18,13 @@ fn io_invalid_data(msg: String) -> io::Error {
 pub struct ColumnarReader {
    column_dictionary: Dictionary<RangeSSTable>,
    column_data: FileSlice,
-    num_docs: RowId,
+    num_rows: RowId,
    format_version: Version,
 }

 impl fmt::Debug for ColumnarReader {
    fn fmt(&self, f: &mut fmt::Formatter<'_>) -> fmt::Result {
-        let num_rows = self.num_docs();
+        let num_rows = self.num_rows();
        let columns = self.list_columns().unwrap();
        let num_cols = columns.len();
        let mut debug_struct = f.debug_struct("Columnar");
@@ -77,19 +76,6 @@ fn read_all_columns_in_stream(
    Ok(results)
 }

-fn column_dictionary_prefix_for_column_name(column_name: &str) -> String {
-    // Each column is a associated to a given `column_key`,
-    // that starts by `column_name\0column_header`.
-    //
-    // Listing the columns associated to the given column name is therefore equivalent to
-    // listing `column_key` with the prefix `column_name\0`.
-    format!("{}{}", column_name, '\0')
-}
-
-fn column_dictionary_prefix_for_subpath(root_path: &str) -> String {
-    format!("{}{}", root_path, JSON_PATH_SEGMENT_SEP as char)
-}
-
 impl ColumnarReader {
    /// Opens a new Columnar file.
    pub fn open<F>(file_slice: F) -> io::Result<ColumnarReader>
@@ -112,13 +98,13 @@ impl ColumnarReader {
        Ok(ColumnarReader {
            column_dictionary,
            column_data,
-            num_docs: num_rows,
+            num_rows,
            format_version,
        })
    }

-    pub fn num_docs(&self) -> RowId {
-        self.num_docs
+    pub fn num_rows(&self) -> RowId {
+        self.num_rows
    }
    // Iterate over the columns in a sorted way
    pub fn iter_columns(
@@ -158,14 +144,32 @@ impl ColumnarReader {
        Ok(self.iter_columns()?.collect())
    }

+    fn stream_for_column_range(&self, column_name: &str) -> sstable::StreamerBuilder<RangeSSTable> {
+        // Each column is a associated to a given `column_key`,
+        // that starts by `column_name\0column_header`.
+        //
+        // Listing the columns associated to the given column name is therefore equivalent to
+        // listing `column_key` with the prefix `column_name\0`.
+        //
+        // This is in turn equivalent to searching for the range
+        // `[column_name,\0`..column_name\1)`.
+        // TODO can we get some more generic `prefix(..)` logic in the dictionary.
+        let mut start_key = column_name.to_string();
+        start_key.push('\0');
+        let mut end_key = column_name.to_string();
+        end_key.push(1u8 as char);
+        self.column_dictionary
+            .range()
+            .ge(start_key.as_bytes())
+            .lt(end_key.as_bytes())
+    }
+
    pub async fn read_columns_async(
        &self,
        column_name: &str,
    ) -> io::Result<Vec<DynamicColumnHandle>> {
-        let prefix = column_dictionary_prefix_for_column_name(column_name);
        let stream = self
-            .column_dictionary
-            .prefix_range(prefix)
+            .stream_for_column_range(column_name)
            .into_stream_async()
            .await?;
        read_all_columns_in_stream(stream, &self.column_data, self.format_version)
@@ -176,35 +180,7 @@ impl ColumnarReader {
    /// There can be more than one column associated to a given column name, provided they have
    /// different types.
    pub fn read_columns(&self, column_name: &str) -> io::Result<Vec<DynamicColumnHandle>> {
-        let prefix = column_dictionary_prefix_for_column_name(column_name);
-        let stream = self.column_dictionary.prefix_range(prefix).into_stream()?;
-        read_all_columns_in_stream(stream, &self.column_data, self.format_version)
-    }
-
-    pub async fn read_subpath_columns_async(
-        &self,
-        root_path: &str,
-    ) -> io::Result<Vec<DynamicColumnHandle>> {
-        let prefix = column_dictionary_prefix_for_subpath(root_path);
-        let stream = self
-            .column_dictionary
-            .prefix_range(prefix)
-            .into_stream_async()
-            .await?;
-        read_all_columns_in_stream(stream, &self.column_data, self.format_version)
-    }
-
-    /// Get all inner columns for a given JSON prefix, i.e columns for which the name starts
-    /// with the prefix then contain the [`JSON_PATH_SEGMENT_SEP`].
-    ///
-    /// There can be more than one column associated to each path within the JSON structure,
-    /// provided they have different types.
-    pub fn read_subpath_columns(&self, root_path: &str) -> io::Result<Vec<DynamicColumnHandle>> {
-        let prefix = column_dictionary_prefix_for_subpath(root_path);
-        let stream = self
-            .column_dictionary
-            .prefix_range(prefix.as_bytes())
-            .into_stream()?;
+        let stream = self.stream_for_column_range(column_name).into_stream()?;
        read_all_columns_in_stream(stream, &self.column_data, self.format_version)
    }

@@ -216,8 +192,6 @@ impl ColumnarReader {

 #[cfg(test)]
 mod tests {
-    use common::json_path_writer::JSON_PATH_SEGMENT_SEP;
-
    use crate::{ColumnType, ColumnarReader, ColumnarWriter};

    #[test]
@@ -250,64 +224,6 @@ mod tests {
        assert_eq!(columns[0].1.column_type(), ColumnType::U64);
    }

-    #[test]
-    fn test_read_columns() {
-        let mut columnar_writer = ColumnarWriter::default();
-        columnar_writer.record_column_type("col", ColumnType::U64, false);
-        columnar_writer.record_numerical(1, "col", 1u64);
-        let mut buffer = Vec::new();
-        columnar_writer.serialize(2, &mut buffer).unwrap();
-        let columnar = ColumnarReader::open(buffer).unwrap();
-        {
-            let columns = columnar.read_columns("col").unwrap();
-            assert_eq!(columns.len(), 1);
-            assert_eq!(columns[0].column_type(), ColumnType::U64);
-        }
-        {
-            let columns = columnar.read_columns("other").unwrap();
-            assert_eq!(columns.len(), 0);
-        }
-    }
-
-    #[test]
-    fn test_read_subpath_columns() {
-        let mut columnar_writer = ColumnarWriter::default();
-        columnar_writer.record_str(
-            0,
-            &format!("col1{}subcol1", JSON_PATH_SEGMENT_SEP as char),
-            "hello",
-        );
-        columnar_writer.record_numerical(
-            0,
-            &format!("col1{}subcol2", JSON_PATH_SEGMENT_SEP as char),
-            1i64,
-        );
-        columnar_writer.record_str(1, "col1", "hello");
-        columnar_writer.record_str(0, "col2", "hello");
-        let mut buffer = Vec::new();
-        columnar_writer.serialize(2, &mut buffer).unwrap();
-
-        let columnar = ColumnarReader::open(buffer).unwrap();
-        {
-            let columns = columnar.read_subpath_columns("col1").unwrap();
-            assert_eq!(columns.len(), 2);
-            assert_eq!(columns[0].column_type(), ColumnType::Str);
-            assert_eq!(columns[1].column_type(), ColumnType::I64);
-        }
-        {
-            let columns = columnar.read_subpath_columns("col1.subcol1").unwrap();
-            assert_eq!(columns.len(), 0);
-        }
-        {
-            let columns = columnar.read_subpath_columns("col2").unwrap();
-            assert_eq!(columns.len(), 0);
-        }
-        {
-            let columns = columnar.read_subpath_columns("other").unwrap();
-            assert_eq!(columns.len(), 0);
-        }
-    }
-
    #[test]
    #[should_panic(expected = "Input type forbidden")]
    fn test_list_columns_strict_typing_panics_on_wrong_types() {
--- a/columnar/src/columnar/writer/column_operation.rs
+++ b/columnar/src/columnar/writer/column_operation.rs
@@ -244,7 +244,7 @@ impl SymbolValue for UnorderedId {

 fn compute_num_bytes_for_u64(val: u64) -> usize {
    let msb = (64u32 - val.leading_zeros()) as usize;
-    msb.div_ceil(8)
+    (msb + 7) / 8
 }

 fn encode_zig_zag(n: i64) -> u64 {
--- a/columnar/src/columnar/writer/column_writers.rs
+++ b/columnar/src/columnar/writer/column_writers.rs
@@ -42,7 +42,7 @@ impl ColumnWriter {
        &self,
        arena: &MemoryArena,
        buffer: &'a mut Vec<u8>,
-    ) -> impl Iterator<Item = ColumnOperation<V>> + 'a + use<'a, V> {
+    ) -> impl Iterator<Item = ColumnOperation<V>> + 'a {
        buffer.clear();
        self.values.read_to_end(arena, buffer);
        let mut cursor: &[u8] = &buffer[..];
@@ -104,10 +104,9 @@ pub(crate) struct NumericalColumnWriter {

 impl NumericalColumnWriter {
    pub fn force_numerical_type(&mut self, numerical_type: NumericalType) {
-        assert!(
-            self.compatible_numerical_types
-                .is_type_accepted(numerical_type)
-        );
+        assert!(self
+            .compatible_numerical_types
+            .is_type_accepted(numerical_type));
        self.compatible_numerical_types = CompatibleNumericalTypes::StaticType(numerical_type);
    }
 }
@@ -212,7 +211,7 @@ impl NumericalColumnWriter {
        self,
        arena: &MemoryArena,
        buffer: &'a mut Vec<u8>,
-    ) -> impl Iterator<Item = ColumnOperation<NumericalValue>> + 'a + use<'a> {
+    ) -> impl Iterator<Item = ColumnOperation<NumericalValue>> + 'a {
        self.column_writer.operation_iterator(arena, buffer)
    }
 }
@@ -256,7 +255,7 @@ impl StrOrBytesColumnWriter {
        &self,
        arena: &MemoryArena,
        byte_buffer: &'a mut Vec<u8>,
-    ) -> impl Iterator<Item = ColumnOperation<UnorderedId>> + 'a + use<'a> {
+    ) -> impl Iterator<Item = ColumnOperation<UnorderedId>> + 'a {
        self.column_writer.operation_iterator(arena, byte_buffer)
    }
 }
--- a/columnar/src/columnar/writer/mod.rs
+++ b/columnar/src/columnar/writer/mod.rs
@@ -8,13 +8,13 @@ use std::net::Ipv6Addr;

 use column_operation::ColumnOperation;
 pub(crate) use column_writers::CompatibleNumericalTypes;
-use common::CountingWriter;
 use common::json_path_writer::JSON_END_OF_PATH;
+use common::CountingWriter;
 pub(crate) use serializer::ColumnarSerializer;
 use stacker::{Addr, ArenaHashMap, MemoryArena};

 use crate::column_index::{SerializableColumnIndex, SerializableOptionalIndex};
-use crate::column_values::{MonotonicallyMappableToU64, MonotonicallyMappableToU128};
+use crate::column_values::{MonotonicallyMappableToU128, MonotonicallyMappableToU64};
 use crate::columnar::column_type::ColumnType;
 use crate::columnar::writer::column_writers::{
    ColumnWriter, NumericalColumnWriter, StrOrBytesColumnWriter,
@@ -285,6 +285,7 @@ impl ColumnarWriter {
                .map(|(column_name, addr)| (column_name, ColumnType::DateTime, addr)),
        );
        columns.sort_unstable_by_key(|(column_name, col_type, _)| (*column_name, *col_type));
+
        let (arena, buffers, dictionaries) = (&self.arena, &mut self.buffers, &self.dictionaries);
        let mut symbol_byte_buffer: Vec<u8> = Vec::new();
        for (column_name, column_type, addr) in columns {
--- a/columnar/src/columnar/writer/serializer.rs
+++ b/columnar/src/columnar/writer/serializer.rs
@@ -3,11 +3,11 @@ use std::io::Write;

 use common::json_path_writer::JSON_END_OF_PATH;
 use common::{BinarySerializable, CountingWriter};
-use sstable::RangeSSTable;
 use sstable::value::RangeValueWriter;
+use sstable::RangeSSTable;

-use crate::RowId;
 use crate::columnar::ColumnType;
+use crate::RowId;

 pub struct ColumnarSerializer<W: io::Write> {
    wrt: CountingWriter<W>,
@@ -67,7 +67,7 @@ pub struct ColumnSerializer<'a, W: io::Write> {
    start_offset: u64,
 }

-impl<W: io::Write> ColumnSerializer<'_, W> {
+impl<'a, W: io::Write> ColumnSerializer<'a, W> {
    pub fn finalize(self) -> io::Result<()> {
        let end_offset: u64 = self.columnar_serializer.wrt.written_bytes();
        let byte_range = self.start_offset..end_offset;
@@ -80,7 +80,7 @@ impl<W: io::Write> ColumnSerializer<'_, W> {
    }
 }

-impl<W: io::Write> io::Write for ColumnSerializer<'_, W> {
+impl<'a, W: io::Write> io::Write for ColumnSerializer<'a, W> {
    fn write(&mut self, buf: &[u8]) -> io::Result<usize> {
        self.columnar_serializer.wrt.write(buf)
    }
--- a/columnar/src/columnar/writer/value_index.rs
+++ b/columnar/src/columnar/writer/value_index.rs
@@ -1,6 +1,6 @@
-use crate::RowId;
 use crate::column_index::{SerializableMultivalueIndex, SerializableOptionalIndex};
 use crate::iterable::Iterable;
+use crate::RowId;

 /// The `IndexBuilder` interprets a sequence of
 /// calls of the form:
@@ -31,13 +31,12 @@ pub struct OptionalIndexBuilder {

 impl OptionalIndexBuilder {
    pub fn finish(&mut self, num_rows: RowId) -> impl Iterable<RowId> + '_ {
-        debug_assert!(
-            self.docs
-                .last()
-                .copied()
-                .map(|last_doc| last_doc < num_rows)
-                .unwrap_or(true)
-        );
+        debug_assert!(self
+            .docs
+            .last()
+            .copied()
+            .map(|last_doc| last_doc < num_rows)
+            .unwrap_or(true));
        &self.docs[..]
    }

@@ -49,13 +48,12 @@ impl OptionalIndexBuilder {
 impl IndexBuilder for OptionalIndexBuilder {
    #[inline(always)]
    fn record_row(&mut self, doc: RowId) {
-        debug_assert!(
-            self.docs
-                .last()
-                .copied()
-                .map(|prev_doc| doc > prev_doc)
-                .unwrap_or(true)
-        );
+        debug_assert!(self
+            .docs
+            .last()
+            .copied()
+            .map(|prev_doc| doc > prev_doc)
+            .unwrap_or(true));
        self.docs.push(doc);
    }
 }
--- a/columnar/src/compat_tests.rs
+++ b/columnar/src/compat_tests.rs
@@ -3,8 +3,8 @@ use std::path::PathBuf;
 use itertools::Itertools;

 use crate::{
-    CURRENT_VERSION, Cardinality, Column, ColumnarReader, DynamicColumn, StackMergeOrder,
-    merge_columnar,
+    merge_columnar, Cardinality, Column, ColumnarReader, DynamicColumn, StackMergeOrder,
+    CURRENT_VERSION,
 };

 const NUM_DOCS: u32 = u16::MAX as u32;
--- a/columnar/src/dynamic_column.rs
+++ b/columnar/src/dynamic_column.rs
@@ -6,7 +6,7 @@ use common::file_slice::FileSlice;
 use common::{ByteCount, DateTime, HasLen, OwnedBytes};

 use crate::column::{BytesColumn, Column, StrColumn};
-use crate::column_values::{StrictlyMonotonicFn, monotonic_map_column};
+use crate::column_values::{monotonic_map_column, StrictlyMonotonicFn};
 use crate::columnar::ColumnType;
 use crate::{Cardinality, ColumnIndex, ColumnValues, NumericalType, Version};

--- a/columnar/src/iterable.rs
+++ b/columnar/src/iterable.rs
@@ -7,7 +7,7 @@ pub trait Iterable<T = u64> {
    fn boxed_iter(&self) -> Box<dyn Iterator<Item = T> + '_>;
 }

-impl<T: Copy> Iterable<T> for &[T] {
+impl<'a, T: Copy> Iterable<T> for &'a [T] {
    fn boxed_iter(&self) -> Box<dyn Iterator<Item = T> + '_> {
        Box::new(self.iter().copied())
    }
--- a/columnar/src/lib.rs
+++ b/columnar/src/lib.rs
@@ -17,10 +17,15 @@
 //!       column.
 //!     - [column_values]: Stores the values of a column in a dense format.

+#![cfg_attr(all(feature = "unstable", test), feature(test))]
+
 #[cfg(test)]
 #[macro_use]
 extern crate more_asserts;

+#[cfg(all(test, feature = "unstable"))]
+extern crate test;
+
 use std::fmt::Display;
 use std::io;

@@ -39,11 +44,11 @@ pub use block_accessor::ColumnBlockAccessor;
 pub use column::{BytesColumn, Column, StrColumn};
 pub use column_index::ColumnIndex;
 pub use column_values::{
-    ColumnValues, EmptyColumnValues, MonotonicallyMappableToU64, MonotonicallyMappableToU128,
+    ColumnValues, EmptyColumnValues, MonotonicallyMappableToU128, MonotonicallyMappableToU64,
 };
 pub use columnar::{
-    CURRENT_VERSION, ColumnType, ColumnarReader, ColumnarWriter, HasAssociatedColumnType,
-    MergeRowOrder, ShuffleMergeOrder, StackMergeOrder, Version, merge_columnar,
+    merge_columnar, ColumnType, ColumnarReader, ColumnarWriter, HasAssociatedColumnType,
+    MergeRowOrder, ShuffleMergeOrder, StackMergeOrder, Version, CURRENT_VERSION,
 };
 use sstable::VoidSSTable;
 pub use value::{NumericalType, NumericalValue};
--- a/columnar/src/tests.rs
+++ b/columnar/src/tests.rs
@@ -380,7 +380,7 @@ fn assert_columnar_eq(
    right: &ColumnarReader,
    lenient_on_numerical_value: bool,
 ) {
-    assert_eq!(left.num_docs(), right.num_docs());
+    assert_eq!(left.num_rows(), right.num_rows());
    let left_columns = left.list_columns().unwrap();
    let right_columns = right.list_columns().unwrap();
    assert_eq!(left_columns.len(), right_columns.len());
@@ -588,7 +588,7 @@ proptest! {
    #[test]
    fn test_single_columnar_builder_proptest(docs in columnar_docs_strategy()) {
        let columnar = build_columnar(&docs[..]);
-        assert_eq!(columnar.num_docs() as usize, docs.len());
+        assert_eq!(columnar.num_rows() as usize, docs.len());
        let mut expected_columns: HashMap<(&str, ColumnTypeCategory), HashMap<u32, Vec<&ColumnValue>> > = Default::default();
        for (doc_id, doc_vals) in docs.iter().enumerate() {
            for (col_name, col_val) in doc_vals {
@@ -715,9 +715,8 @@ fn test_columnar_merging_number_columns() {
 // TODO test required_columns
 // TODO document edge case: required_columns incompatible with values.

-#[allow(clippy::type_complexity)]
-fn columnar_docs_and_remap()
-> impl Strategy<Value = (Vec<Vec<Vec<(&'static str, ColumnValue)>>>, Vec<RowAddr>)> {
+fn columnar_docs_and_remap(
+) -> impl Strategy<Value = (Vec<Vec<Vec<(&'static str, ColumnValue)>>>, Vec<RowAddr>)> {
    proptest::collection::vec(columnar_docs_strategy(), 2..=3).prop_flat_map(
        |columnars_docs: Vec<Vec<Vec<(&str, ColumnValue)>>>| {
            let row_addrs: Vec<RowAddr> = columnars_docs
@@ -820,7 +819,7 @@ fn test_columnar_merge_empty() {
    )
    .unwrap();
    let merged_columnar = ColumnarReader::open(output).unwrap();
-    assert_eq!(merged_columnar.num_docs(), 0);
+    assert_eq!(merged_columnar.num_rows(), 0);
    assert_eq!(merged_columnar.num_columns(), 0);
 }

@@ -846,7 +845,7 @@ fn test_columnar_merge_single_str_column() {
    )
    .unwrap();
    let merged_columnar = ColumnarReader::open(output).unwrap();
-    assert_eq!(merged_columnar.num_docs(), 1);
+    assert_eq!(merged_columnar.num_rows(), 1);
    assert_eq!(merged_columnar.num_columns(), 1);
 }

@@ -878,7 +877,7 @@ fn test_delete_decrease_cardinality() {
    )
    .unwrap();
    let merged_columnar = ColumnarReader::open(output).unwrap();
-    assert_eq!(merged_columnar.num_docs(), 1);
+    assert_eq!(merged_columnar.num_rows(), 1);
    assert_eq!(merged_columnar.num_columns(), 1);
    let cols = merged_columnar.read_columns("c").unwrap();
    assert_eq!(cols.len(), 1);
--- a/columnar/src/value.rs
+++ b/columnar/src/value.rs
@@ -1,5 +1,3 @@
-use std::str::FromStr;
-
 use common::DateTime;

 use crate::InvalidData;
@@ -11,23 +9,6 @@ pub enum NumericalValue {
    F64(f64),
 }

-impl FromStr for NumericalValue {
-    type Err = ();
-
-    fn from_str(s: &str) -> Result<Self, ()> {
-        if let Ok(val_i64) = s.parse::<i64>() {
-            return Ok(val_i64.into());
-        }
-        if let Ok(val_u64) = s.parse::<u64>() {
-            return Ok(val_u64.into());
-        }
-        if let Ok(val_f64) = s.parse::<f64>() {
-            return Ok(NumericalValue::from(val_f64).normalize());
-        }
-        Err(())
-    }
-}
-
 impl NumericalValue {
    pub fn numerical_type(&self) -> NumericalType {
        match self {
@@ -45,7 +26,7 @@ impl NumericalValue {
                if val <= i64::MAX as u64 {
                    NumericalValue::I64(val as i64)
                } else {
-                    NumericalValue::U64(val)
+                    NumericalValue::F64(val as f64)
                }
            }
            NumericalValue::I64(val) => NumericalValue::I64(val),
@@ -160,7 +141,6 @@ impl Coerce for DateTime {
 #[cfg(test)]
 mod tests {
    use super::NumericalType;
-    use crate::NumericalValue;

    #[test]
    fn test_numerical_type_code() {
@@ -173,58 +153,4 @@ mod tests {
        }
        assert_eq!(num_numerical_type, 3);
    }
-
-    #[test]
-    fn test_parse_numerical() {
-        assert_eq!(
-            "123".parse::<NumericalValue>().unwrap(),
-            NumericalValue::I64(123)
-        );
-        assert_eq!(
-            "18446744073709551615".parse::<NumericalValue>().unwrap(),
-            NumericalValue::U64(18446744073709551615u64)
-        );
-        assert_eq!(
-            "1.0".parse::<NumericalValue>().unwrap(),
-            NumericalValue::I64(1i64)
-        );
-        assert_eq!(
-            "1.1".parse::<NumericalValue>().unwrap(),
-            NumericalValue::F64(1.1f64)
-        );
-        assert_eq!(
-            "-1.0".parse::<NumericalValue>().unwrap(),
-            NumericalValue::I64(-1i64)
-        );
-    }
-
-    #[test]
-    fn test_normalize_numerical() {
-        assert_eq!(
-            NumericalValue::from(1u64).normalize(),
-            NumericalValue::I64(1i64),
-        );
-        let limit_val = i64::MAX as u64 + 1u64;
-        assert_eq!(
-            NumericalValue::from(limit_val).normalize(),
-            NumericalValue::U64(limit_val),
-        );
-        assert_eq!(
-            NumericalValue::from(-1i64).normalize(),
-            NumericalValue::I64(-1i64),
-        );
-        assert_eq!(
-            NumericalValue::from(-2.0f64).normalize(),
-            NumericalValue::I64(-2i64),
-        );
-        assert_eq!(
-            NumericalValue::from(-2.1f64).normalize(),
-            NumericalValue::F64(-2.1f64),
-        );
-        let large_float = 2.0f64.powf(70.0f64);
-        assert_eq!(
-            NumericalValue::from(large_float).normalize(),
-            NumericalValue::F64(large_float),
-        );
-    }
 }
--- a/common/Cargo.toml
+++ b/common/Cargo.toml
@@ -1,9 +1,9 @@
 [package]
 name = "tantivy-common"
-version = "0.10.0"
+version = "0.7.0"
 authors = ["Paul Masurel <paul@quickwit.io>", "Pascal Seitz <pascal@quickwit.io>"]
 license = "MIT"
-edition = "2024"
+edition = "2021"
 description = "common traits and utility functions used by multiple tantivy subcrates"
 documentation = "https://docs.rs/tantivy_common/"
 homepage = "https://github.com/quickwit-oss/tantivy"
@@ -13,7 +13,7 @@ repository = "https://github.com/quickwit-oss/tantivy"

 [dependencies]
 byteorder = "1.4.3"
-ownedbytes = { version= "0.9", path="../ownedbytes" }
+ownedbytes = { version= "0.7", path="../ownedbytes" }
 async-trait = "0.1"
 time = { version = "0.3.10", features = ["serde-well-known"] }
 serde = { version = "1.0.136", features = ["derive"] }
--- a/common/benches/bench.rs
+++ b/common/benches/bench.rs
@@ -1,7 +1,7 @@
-use binggan::{BenchRunner, black_box};
+use binggan::{black_box, BenchRunner};
 use rand::seq::IteratorRandom;
 use rand::thread_rng;
-use tantivy_common::{BitSet, TinySet, serialize_vint_u32};
+use tantivy_common::{serialize_vint_u32, BitSet, TinySet};

 fn bench_vint() {
    let mut runner = BenchRunner::new();
--- a/common/src/bitset.rs
+++ b/common/src/bitset.rs
@@ -183,7 +183,7 @@ pub struct BitSet {
 }

 fn num_buckets(max_val: u32) -> u32 {
-    max_val.div_ceil(64u32)
+    (max_val + 63u32) / 64u32
 }

 impl BitSet {
--- a/common/src/bounds.rs
+++ b/common/src/bounds.rs
@@ -65,11 +65,11 @@ pub fn transform_bound_inner_res<TFrom, TTo>(
 ) -> io::Result<Bound<TTo>> {
    use self::Bound::*;
    Ok(match bound {
-        Excluded(from_val) => match transform(from_val)? {
+        Excluded(ref from_val) => match transform(from_val)? {
            TransformBound::NewBound(new_val) => new_val,
            TransformBound::Existing(new_val) => Excluded(new_val),
        },
-        Included(from_val) => match transform(from_val)? {
+        Included(ref from_val) => match transform(from_val)? {
            TransformBound::NewBound(new_val) => new_val,
            TransformBound::Existing(new_val) => Included(new_val),
        },
@@ -85,11 +85,11 @@ pub fn transform_bound_inner<TFrom, TTo>(
 ) -> Bound<TTo> {
    use self::Bound::*;
    match bound {
-        Excluded(from_val) => match transform(from_val) {
+        Excluded(ref from_val) => match transform(from_val) {
            TransformBound::NewBound(new_val) => new_val,
            TransformBound::Existing(new_val) => Excluded(new_val),
        },
-        Included(from_val) => match transform(from_val) {
+        Included(ref from_val) => match transform(from_val) {
            TransformBound::NewBound(new_val) => new_val,
            TransformBound::Existing(new_val) => Included(new_val),
        },
@@ -111,8 +111,8 @@ pub fn map_bound<TFrom, TTo>(
 ) -> Bound<TTo> {
    use self::Bound::*;
    match bound {
-        Excluded(from_val) => Bound::Excluded(transform(from_val)),
-        Included(from_val) => Bound::Included(transform(from_val)),
+        Excluded(ref from_val) => Bound::Excluded(transform(from_val)),
+        Included(ref from_val) => Bound::Included(transform(from_val)),
        Unbounded => Unbounded,
    }
 }
@@ -123,8 +123,8 @@ pub fn map_bound_res<TFrom, TTo, Err>(
 ) -> Result<Bound<TTo>, Err> {
    use self::Bound::*;
    Ok(match bound {
-        Excluded(from_val) => Excluded(transform(from_val)?),
-        Included(from_val) => Included(transform(from_val)?),
+        Excluded(ref from_val) => Excluded(transform(from_val)?),
+        Included(ref from_val) => Included(transform(from_val)?),
        Unbounded => Unbounded,
    })
 }
--- a/common/src/file_slice.rs
+++ b/common/src/file_slice.rs
@@ -1,6 +1,5 @@
 use std::fs::File;
 use std::ops::{Deref, Range, RangeBounds};
-use std::path::Path;
 use std::sync::Arc;
 use std::{fmt, io};

@@ -74,7 +73,7 @@ impl FileHandle for WrapFile {
        {
            use std::io::{Read, Seek};
            let mut file = self.file.try_clone()?; // Clone the file to read from it separately
-            // Seek to the start position in the file
+                                                   // Seek to the start position in the file
            file.seek(io::SeekFrom::Start(start as u64))?;
            // Read the data into the buffer
            file.read_exact(&mut buffer)?;
@@ -178,12 +177,6 @@ fn combine_ranges<R: RangeBounds<usize>>(orig_range: Range<usize>, rel_range: R)
 }

 impl FileSlice {
-    /// Creates a FileSlice from a path.
-    pub fn open(path: &Path) -> io::Result<FileSlice> {
-        let wrap_file = WrapFile::new(File::open(path)?)?;
-        Ok(FileSlice::new(Arc::new(wrap_file)))
-    }
-
    /// Wraps a FileHandle.
    pub fn new(file_handle: Arc<dyn FileHandle>) -> Self {
        let num_bytes = file_handle.len();
@@ -346,8 +339,8 @@ mod tests {
    use std::sync::Arc;

    use super::{FileHandle, FileSlice};
-    use crate::HasLen;
    use crate::file_slice::combine_ranges;
+    use crate::HasLen;

    #[test]
    fn test_file_slice() -> io::Result<()> {
--- a/common/src/lib.rs
+++ b/common/src/lib.rs
@@ -22,7 +22,7 @@ pub use json_path_writer::JsonPathWriter;
 pub use ownedbytes::{OwnedBytes, StableDeref};
 pub use serialize::{BinarySerializable, DeserializeFrom, FixedSize};
 pub use vint::{
-    VInt, VIntU128, read_u32_vint, read_u32_vint_no_advance, serialize_vint_u32, write_u32_vint,
+    read_u32_vint, read_u32_vint_no_advance, serialize_vint_u32, write_u32_vint, VInt, VIntU128,
 };
 pub use writer::{AntiCallToken, CountingWriter, TerminatingWrite};

@@ -177,10 +177,8 @@ pub(crate) mod test {

    #[test]
    fn test_f64_order() {
-        assert!(
-            !(f64_to_u64(f64::NEG_INFINITY)..f64_to_u64(f64::INFINITY))
-                .contains(&f64_to_u64(f64::NAN))
-        ); // nan is not a number
+        assert!(!(f64_to_u64(f64::NEG_INFINITY)..f64_to_u64(f64::INFINITY))
+            .contains(&f64_to_u64(f64::NAN))); // nan is not a number
        assert!(f64_to_u64(1.5) > f64_to_u64(1.0)); // same exponent, different mantissa
        assert!(f64_to_u64(2.0) > f64_to_u64(1.0)); // same mantissa, different exponent
        assert!(f64_to_u64(2.0) > f64_to_u64(1.5)); // different exponent and mantissa
--- a/common/src/vint.rs
+++ b/common/src/vint.rs
@@ -222,7 +222,7 @@ impl BinarySerializable for VInt {
 #[cfg(test)]
 mod tests {

-    use super::{BinarySerializable, VInt, serialize_vint_u32};
+    use super::{serialize_vint_u32, BinarySerializable, VInt};

    fn aux_test_vint(val: u64) {
        let mut v = [14u8; 10];
--- a/common/src/writer.rs
+++ b/common/src/writer.rs
@@ -87,7 +87,7 @@ impl<W: TerminatingWrite> TerminatingWrite for BufWriter<W> {
    }
 }

-impl TerminatingWrite for &mut Vec<u8> {
+impl<'a> TerminatingWrite for &'a mut Vec<u8> {
    fn terminate_ref(&mut self, _a: AntiCallToken) -> io::Result<()> {
        self.flush()
    }
--- a/doc/assets/images/paradedb.png
+++ b/doc/assets/images/paradedb.png
--- a/doc/assets/images/searchbenchmark.png
+++ b/doc/assets/images/searchbenchmark.png
--- a/examples/basic_search.rs
+++ b/examples/basic_search.rs
@@ -51,7 +51,7 @@ fn main() -> tantivy::Result<()> {

    // Our second field is body.
    // We want full-text search for it, but we do not
-    // need to be able to retrieve it
+    // need to be able to be able to retrieve it
    // for our application.
    //
    // We can make our index lighter by omitting the `STORED` flag.
--- a/examples/filter_aggregation.rs
+++ b/examples/filter_aggregation.rs
@@ -1,212 +0,0 @@
-// # Filter Aggregation Example
-//
-// This example demonstrates filter aggregations - creating buckets of documents
-// matching specific queries, with nested aggregations computed on each bucket.
-//
-// Filter aggregations are useful for computing metrics on different subsets of
-// your data in a single query, like "average price overall + average price for
-// electronics + count of in-stock items".
-
-use serde_json::json;
-use tantivy::aggregation::agg_req::Aggregations;
-use tantivy::aggregation::AggregationCollector;
-use tantivy::query::AllQuery;
-use tantivy::schema::{Schema, FAST, INDEXED, TEXT};
-use tantivy::{doc, Index};
-
-fn main() -> tantivy::Result<()> {
-    // Create a simple product schema
-    let mut schema_builder = Schema::builder();
-    schema_builder.add_text_field("category", TEXT | FAST);
-    schema_builder.add_text_field("brand", TEXT | FAST);
-    schema_builder.add_u64_field("price", FAST);
-    schema_builder.add_f64_field("rating", FAST);
-    schema_builder.add_bool_field("in_stock", FAST | INDEXED);
-    let schema = schema_builder.build();
-
-    // Create index and add sample products
-    let index = Index::create_in_ram(schema.clone());
-    let mut writer = index.writer(50_000_000)?;
-
-    writer.add_document(doc!(
-        schema.get_field("category")? => "electronics",
-        schema.get_field("brand")? => "apple",
-        schema.get_field("price")? => 999u64,
-        schema.get_field("rating")? => 4.5f64,
-        schema.get_field("in_stock")? => true
-    ))?;
-    writer.add_document(doc!(
-        schema.get_field("category")? => "electronics",
-        schema.get_field("brand")? => "samsung",
-        schema.get_field("price")? => 799u64,
-        schema.get_field("rating")? => 4.2f64,
-        schema.get_field("in_stock")? => true
-    ))?;
-    writer.add_document(doc!(
-        schema.get_field("category")? => "clothing",
-        schema.get_field("brand")? => "nike",
-        schema.get_field("price")? => 120u64,
-        schema.get_field("rating")? => 4.1f64,
-        schema.get_field("in_stock")? => false
-    ))?;
-    writer.add_document(doc!(
-        schema.get_field("category")? => "books",
-        schema.get_field("brand")? => "penguin",
-        schema.get_field("price")? => 25u64,
-        schema.get_field("rating")? => 4.8f64,
-        schema.get_field("in_stock")? => true
-    ))?;
-
-    writer.commit()?;
-
-    let reader = index.reader()?;
-    let searcher = reader.searcher();
-
-    // Example 1: Basic filter with metric aggregation
-    println!("=== Example 1: Electronics average price ===");
-    let agg_req = json!({
-        "electronics": {
-            "filter": "category:electronics",
-            "aggs": {
-                "avg_price": { "avg": { "field": "price" } }
-            }
-        }
-    });
-
-    let agg: Aggregations = serde_json::from_value(agg_req)?;
-    let collector = AggregationCollector::from_aggs(agg, Default::default());
-    let result = searcher.search(&AllQuery, &collector)?;
-
-    let expected = json!({
-        "electronics": {
-            "doc_count": 2,
-            "avg_price": { "value": 899.0 }
-        }
-    });
-    assert_eq!(serde_json::to_value(&result)?, expected);
-    println!("{}\n", serde_json::to_string_pretty(&result)?);
-
-    // Example 2: Multiple independent filters
-    println!("=== Example 2: Multiple filters in one query ===");
-    let agg_req = json!({
-        "electronics": {
-            "filter": "category:electronics",
-            "aggs": { "avg_price": { "avg": { "field": "price" } } }
-        },
-        "in_stock": {
-            "filter": "in_stock:true",
-            "aggs": { "count": { "value_count": { "field": "brand" } } }
-        },
-        "high_rated": {
-            "filter": "rating:[4.5 TO *]",
-            "aggs": { "count": { "value_count": { "field": "brand" } } }
-        }
-    });
-
-    let agg: Aggregations = serde_json::from_value(agg_req)?;
-    let collector = AggregationCollector::from_aggs(agg, Default::default());
-    let result = searcher.search(&AllQuery, &collector)?;
-
-    let expected = json!({
-        "electronics": {
-            "doc_count": 2,
-            "avg_price": { "value": 899.0 }
-        },
-        "in_stock": {
-            "doc_count": 3,
-            "count": { "value": 3.0 }
-        },
-        "high_rated": {
-            "doc_count": 2,
-            "count": { "value": 2.0 }
-        }
-    });
-    assert_eq!(serde_json::to_value(&result)?, expected);
-    println!("{}\n", serde_json::to_string_pretty(&result)?);
-
-    // Example 3: Nested filters - progressive refinement
-    println!("=== Example 3: Nested filters ===");
-    let agg_req = json!({
-        "in_stock": {
-            "filter": "in_stock:true",
-            "aggs": {
-                "electronics": {
-                    "filter": "category:electronics",
-                    "aggs": {
-                        "expensive": {
-                            "filter": "price:[800 TO *]",
-                            "aggs": {
-                                "avg_rating": { "avg": { "field": "rating" } }
-                            }
-                        }
-                    }
-                }
-            }
-        }
-    });
-
-    let agg: Aggregations = serde_json::from_value(agg_req)?;
-    let collector = AggregationCollector::from_aggs(agg, Default::default());
-    let result = searcher.search(&AllQuery, &collector)?;
-
-    let expected = json!({
-        "in_stock": {
-            "doc_count": 3,  // apple, samsung, penguin
-            "electronics": {
-                "doc_count": 2,  // apple, samsung
-                "expensive": {
-                    "doc_count": 1,  // only apple (999)
-                    "avg_rating": { "value": 4.5 }
-                }
-            }
-        }
-    });
-    assert_eq!(serde_json::to_value(&result)?, expected);
-    println!("{}\n", serde_json::to_string_pretty(&result)?);
-
-    // Example 4: Filter with sub-aggregation (terms)
-    println!("=== Example 4: Filter with terms sub-aggregation ===");
-    let agg_req = json!({
-        "electronics": {
-            "filter": "category:electronics",
-            "aggs": {
-                "by_brand": {
-                    "terms": { "field": "brand" },
-                    "aggs": {
-                        "avg_price": { "avg": { "field": "price" } }
-                    }
-                }
-            }
-        }
-    });
-
-    let agg: Aggregations = serde_json::from_value(agg_req)?;
-    let collector = AggregationCollector::from_aggs(agg, Default::default());
-    let result = searcher.search(&AllQuery, &collector)?;
-
-    let expected = json!({
-        "electronics": {
-            "doc_count": 2,
-            "by_brand": {
-                "buckets": [
-                    {
-                        "key": "samsung",
-                        "doc_count": 1,
-                        "avg_price": { "value": 799.0 }
-                    },
-                    {
-                        "key": "apple",
-                        "doc_count": 1,
-                        "avg_price": { "value": 999.0 }
-                    }
-                ],
-                "sum_other_doc_count": 0,
-                "doc_count_error_upper_bound": 0
-            }
-        }
-    });
-    assert_eq!(serde_json::to_value(&result)?, expected);
-    println!("{}", serde_json::to_string_pretty(&result)?);
-
-    Ok(())
-}
--- a/examples/fuzzy_search.rs
+++ b/examples/fuzzy_search.rs
@@ -85,6 +85,7 @@ fn main() -> tantivy::Result<()> {
    index_writer.add_document(doc!(
        title => "The Diary of a Young Girl",
    ))?;
+    index_writer.commit()?;

    // ### Committing
    //
--- a/ownedbytes/Cargo.toml
+++ b/ownedbytes/Cargo.toml
@@ -1,7 +1,7 @@
 [package]
 authors = ["Paul Masurel <paul@quickwit.io>", "Pascal Seitz <pascal@quickwit.io>"]
 name = "ownedbytes"
-version = "0.9.0"
+version = "0.7.0"
 edition = "2021"
 description = "Expose data as static slice"
 license = "MIT"
--- a/query-grammar/Cargo.toml
+++ b/query-grammar/Cargo.toml
@@ -1,6 +1,6 @@
 [package]
 name = "tantivy-query-grammar"
-version = "0.25.0"
+version = "0.22.0"
 authors = ["Paul Masurel <paul.masurel@gmail.com>"]
 license = "MIT"
 categories = ["database-implementations", "data-structures"]
@@ -9,11 +9,7 @@ homepage = "https://github.com/quickwit-oss/tantivy"
 repository = "https://github.com/quickwit-oss/tantivy"
 readme = "README.md"
 keywords = ["search", "information", "retrieval"]
-edition = "2024"
+edition = "2021"

 [dependencies]
 nom = "7"
-serde = { version = "1.0.219", features = ["derive"] }
-serde_json = "1.0.140"
-ordered-float = "5.0.0"
-fnv = "1.0.7"
--- a/query-grammar/src/infallible.rs
+++ b/query-grammar/src/infallible.rs
@@ -3,7 +3,6 @@
 use std::convert::Infallible;

 use nom::{AsChar, IResult, InputLength, InputTakeAtPosition};
-use serde::Serialize;

 pub(crate) type ErrorList = Vec<LenientErrorInternal>;
 pub(crate) type JResult<I, O> = IResult<I, (O, ErrorList), Infallible>;
@@ -16,8 +15,7 @@ pub(crate) struct LenientErrorInternal {
 }

 /// A recoverable error and the position it happened at
-#[derive(Debug, PartialEq, Serialize)]
-#[serde(rename_all = "snake_case")]
+#[derive(Debug, PartialEq)]
 pub struct LenientError {
    pub pos: usize,
    pub message: String,
@@ -117,22 +115,6 @@ where F: nom::Parser<I, (O, ErrorList), Infallible> {
    }
 }

-pub(crate) fn terminated_infallible<I, O1, O2, F, G>(
-    mut first: F,
-    mut second: G,
-) -> impl FnMut(I) -> JResult<I, O1>
-where
-    F: nom::Parser<I, (O1, ErrorList), Infallible>,
-    G: nom::Parser<I, (O2, ErrorList), Infallible>,
-{
-    move |input: I| {
-        let (input, (o1, mut err)) = first.parse(input)?;
-        let (input, (_, mut err2)) = second.parse(input)?;
-        err.append(&mut err2);
-        Ok((input, (o1, err)))
-    }
-}
-
 pub(crate) fn delimited_infallible<I, O1, O2, O3, F, G, H>(
    mut first: F,
    mut second: G,
@@ -202,19 +184,19 @@ macro_rules! tuple_trait_impl(
 );

 macro_rules! tuple_trait_inner(
-  ($it:tt, $self:expr_2021, $input:expr_2021, (), $error_list:expr_2021, $head:ident $($id:ident)+) => ({
+  ($it:tt, $self:expr, $input:expr, (), $error_list:expr, $head:ident $($id:ident)+) => ({
    let (i, (o, mut err)) = $self.$it.parse($input.clone())?;
    $error_list.append(&mut err);

    succ!($it, tuple_trait_inner!($self, i, ( o ), $error_list, $($id)+))
  });
-  ($it:tt, $self:expr_2021, $input:expr_2021, ($($parsed:tt)*), $error_list:expr_2021, $head:ident $($id:ident)+) => ({
+  ($it:tt, $self:expr, $input:expr, ($($parsed:tt)*), $error_list:expr, $head:ident $($id:ident)+) => ({
    let (i, (o, mut err)) = $self.$it.parse($input.clone())?;
    $error_list.append(&mut err);

    succ!($it, tuple_trait_inner!($self, i, ($($parsed)* , o), $error_list, $($id)+))
  });
-  ($it:tt, $self:expr_2021, $input:expr_2021, ($($parsed:tt)*), $error_list:expr_2021, $head:ident) => ({
+  ($it:tt, $self:expr, $input:expr, ($($parsed:tt)*), $error_list:expr, $head:ident) => ({
    let (i, (o, mut err)) = $self.$it.parse($input.clone())?;
    $error_list.append(&mut err);

@@ -344,13 +326,13 @@ macro_rules! alt_trait_impl(
 );

 macro_rules! alt_trait_inner(
-  ($it:tt, $self:expr_2021, $input:expr_2021, $head_cond:ident $head:ident, $($id_cond:ident $id:ident),+) => (
+  ($it:tt, $self:expr, $input:expr, $head_cond:ident $head:ident, $($id_cond:ident $id:ident),+) => (
    match $self.$it.0.parse($input.clone()) {
      Err(_) => succ!($it, alt_trait_inner!($self, $input, $($id_cond $id),+)),
      Ok((input_left, _)) => Some($self.$it.1.parse(input_left)),
    }
  );
-  ($it:tt, $self:expr_2021, $input:expr_2021, $head_cond:ident $head:ident) => (
+  ($it:tt, $self:expr, $input:expr, $head_cond:ident $head:ident) => (
    None
  );
 );
@@ -371,21 +353,3 @@ where
 {
    move |i: I| l.choice(i.clone()).unwrap_or_else(|| default.parse(i))
 }
-
-#[cfg(test)]
-mod tests {
-    use super::*;
-
-    #[test]
-    fn test_lenient_error_serialization() {
-        let error = LenientError {
-            pos: 42,
-            message: "test error message".to_string(),
-        };
-
-        assert_eq!(
-            serde_json::to_string(&error).unwrap(),
-            "{\"pos\":42,\"message\":\"test error message\"}"
-        );
-    }
-}
--- a/query-grammar/src/lib.rs
+++ b/query-grammar/src/lib.rs
@@ -1,7 +1,5 @@
 #![allow(clippy::derive_partial_eq_without_eq)]

-use serde::Serialize;
-
 mod infallible;
 mod occur;
 mod query_grammar;
@@ -14,8 +12,6 @@ pub use crate::user_input_ast::{
    Delimiter, UserInputAst, UserInputBound, UserInputLeaf, UserInputLiteral,
 };

-#[derive(Debug, Serialize)]
-#[serde(rename_all = "snake_case")]
 pub struct Error;

 /// Parse a query
@@ -28,41 +24,3 @@ pub fn parse_query(query: &str) -> Result<UserInputAst, Error> {
 pub fn parse_query_lenient(query: &str) -> (UserInputAst, Vec<LenientError>) {
    parse_to_ast_lenient(query)
 }
-
-#[cfg(test)]
-mod tests {
-    use crate::{UserInputAst, parse_query, parse_query_lenient};
-
-    #[test]
-    fn test_deduplication() {
-        let ast: UserInputAst = parse_query("a a").unwrap();
-        let json = serde_json::to_string(&ast).unwrap();
-        assert_eq!(
-            json,
-            r#"{"type":"bool","clauses":[[null,{"type":"literal","field_name":null,"phrase":"a","delimiter":"none","slop":0,"prefix":false}]]}"#
-        );
-    }
-
-    #[test]
-    fn test_parse_query_serialization() {
-        let ast = parse_query("title:hello OR title:x").unwrap();
-        let json = serde_json::to_string(&ast).unwrap();
-        assert_eq!(
-            json,
-            r#"{"type":"bool","clauses":[["should",{"type":"literal","field_name":"title","phrase":"hello","delimiter":"none","slop":0,"prefix":false}],["should",{"type":"literal","field_name":"title","phrase":"x","delimiter":"none","slop":0,"prefix":false}]]}"#
-        );
-    }
-
-    #[test]
-    fn test_parse_query_wrong_query() {
-        assert!(parse_query("title:").is_err());
-    }
-
-    #[test]
-    fn test_parse_query_lenient_wrong_query() {
-        let (_, errors) = parse_query_lenient("title:");
-        assert!(errors.len() == 1);
-        let json = serde_json::to_string(&errors).unwrap();
-        assert_eq!(json, r#"[{"pos":6,"message":"expected word"}]"#);
-    }
-}
--- a/query-grammar/src/occur.rs
+++ b/query-grammar/src/occur.rs
@@ -1,12 +1,9 @@
 use std::fmt;
 use std::fmt::Write;

-use serde::Serialize;
-
 /// Defines whether a term in a query must be present,
 /// should be present or must not be present.
-#[derive(Debug, Clone, Hash, Copy, Eq, PartialEq, Serialize)]
-#[serde(rename_all = "snake_case")]
+#[derive(Debug, Clone, Hash, Copy, Eq, PartialEq)]
 pub enum Occur {
    /// For a given document to be considered for scoring,
    /// at least one of the queries with the Should or the Must
--- a/query-grammar/src/query_grammar.rs
+++ b/query-grammar/src/query_grammar.rs
@@ -1,8 +1,6 @@
 use std::borrow::Cow;
 use std::iter::once;

-use fnv::FnvHashSet;
-use nom::IResult;
 use nom::branch::alt;
 use nom::bytes::complete::tag;
 use nom::character::complete::{
@@ -12,11 +10,12 @@ use nom::combinator::{eof, map, map_res, opt, peek, recognize, value, verify};
 use nom::error::{Error, ErrorKind};
 use nom::multi::{many0, many1, separated_list0};
 use nom::sequence::{delimited, preceded, separated_pair, terminated, tuple};
+use nom::IResult;

 use super::user_input_ast::{UserInputAst, UserInputBound, UserInputLeaf, UserInputLiteral};
-use crate::Occur;
 use crate::infallible::*;
 use crate::user_input_ast::Delimiter;
+use crate::Occur;

 // Note: '-' char is only forbidden at the beginning of a field name, would be clearer to add it to
 // special characters.
@@ -37,7 +36,7 @@ fn field_name(inp: &str) -> IResult<&str, String> {
                alt((first_char, escape_sequence())),
                many0(alt((simple_char, escape_sequence(), char('\\')))),
            )),
-            tuple((multispace0, char(':'), multispace0)),
+            char(':'),
        ),
        |(first_char, next)| once(first_char).chain(next).collect(),
    )(inp)
@@ -69,7 +68,7 @@ fn interpret_escape(source: &str) -> String {

 /// Consume a word outside of any context.
 // TODO should support escape sequences
-fn word(inp: &str) -> IResult<&str, Cow<'_, str>> {
+fn word(inp: &str) -> IResult<&str, Cow<str>> {
    map_res(
        recognize(tuple((
            alt((
@@ -306,14 +305,15 @@ fn term_group_infallible(inp: &str) -> JResult<&str, UserInputAst> {
    let (inp, (field_name, _, _, _)) =
        tuple((field_name, multispace0, char('('), multispace0))(inp).expect("precondition failed");

-    delimited_infallible(
+    let res = delimited_infallible(
        nothing,
        map(ast_infallible, |(mut ast, errors)| {
            ast.set_default_field(field_name.to_string());
            (ast, errors)
        }),
        opt_i_err(char(')'), "expected ')'"),
-    )(inp)
+    )(inp);
+    res
 }

 fn exists(inp: &str) -> IResult<&str, UserInputLeaf> {
@@ -321,17 +321,7 @@ fn exists(inp: &str) -> IResult<&str, UserInputLeaf> {
        UserInputLeaf::Exists {
            field: String::new(),
        },
-        tuple((
-            multispace0,
-            char('*'),
-            peek(alt((
-                value(
-                    "",
-                    satisfy(|c: char| c.is_whitespace() || ESCAPE_IN_WORD.contains(&c)),
-                ),
-                eof,
-            ))),
-        )),
+        tuple((multispace0, char('*'))),
    )(inp)
 }

@@ -341,14 +331,7 @@ fn exists_precond(inp: &str) -> IResult<&str, (), ()> {
        peek(tuple((
            field_name,
            multispace0,
-            char('*'),
-            peek(alt((
-                value(
-                    "",
-                    satisfy(|c: char| c.is_whitespace() || ESCAPE_IN_WORD.contains(&c)),
-                ),
-                eof,
-            ))), // we need to check this isn't a wildcard query
+            char('*'), // when we are here, we know it can't be anything but a exists
        ))),
    )(inp)
    .map_err(|e| e.map(|_| ()))
@@ -367,10 +350,7 @@ fn literal(inp: &str) -> IResult<&str, UserInputAst> {
    // something (a field name) got parsed before
    alt((
        map(
-            tuple((
-                opt(field_name),
-                alt((range, set, exists, regex, term_or_phrase)),
-            )),
+            tuple((opt(field_name), alt((range, set, exists, term_or_phrase)))),
            |(field_name, leaf): (Option<String>, UserInputLeaf)| leaf.set_field(field_name).into(),
        ),
        term_group,
@@ -392,10 +372,6 @@ fn literal_no_group_infallible(inp: &str) -> JResult<&str, Option<UserInputAst>>
                        value((), peek(one_of("{[><"))),
                        map(range_infallible, |(range, errs)| (Some(range), errs)),
                    ),
-                    (
-                        value((), peek(one_of("/"))),
-                        map(regex_infallible, |(regex, errs)| (Some(regex), errs)),
-                    ),
                ),
                delimited_infallible(space0_infallible, term_or_phrase_infallible, nothing),
            ),
@@ -696,61 +672,6 @@ fn set_infallible(mut inp: &str) -> JResult<&str, UserInputLeaf> {
    }
 }

-fn regex(inp: &str) -> IResult<&str, UserInputLeaf> {
-    map(
-        terminated(
-            delimited(
-                char('/'),
-                many1(alt((preceded(char('\\'), char('/')), none_of("/")))),
-                char('/'),
-            ),
-            peek(alt((multispace1, eof))),
-        ),
-        |elements| UserInputLeaf::Regex {
-            field: None,
-            pattern: elements.into_iter().collect::<String>(),
-        },
-    )(inp)
-}
-
-fn regex_infallible(inp: &str) -> JResult<&str, UserInputLeaf> {
-    match terminated_infallible(
-        delimited_infallible(
-            opt_i_err(char('/'), "missing delimiter /"),
-            opt_i(many1(alt((preceded(char('\\'), char('/')), none_of("/"))))),
-            opt_i_err(char('/'), "missing delimiter /"),
-        ),
-        opt_i_err(
-            peek(alt((multispace1, eof))),
-            "expected whitespace or end of input",
-        ),
-    )(inp)
-    {
-        Ok((rest, (elements_part, errors))) => {
-            let pattern = match elements_part {
-                Some(elements_part) => elements_part.into_iter().collect(),
-                None => String::new(),
-            };
-            let res = UserInputLeaf::Regex {
-                field: None,
-                pattern,
-            };
-            Ok((rest, (res, errors)))
-        }
-        Err(e) => {
-            let errs = vec![LenientErrorInternal {
-                pos: inp.len(),
-                message: e.to_string(),
-            }];
-            let res = UserInputLeaf::Regex {
-                field: None,
-                pattern: String::new(),
-            };
-            Ok((inp, (res, errs)))
-        }
-    }
-}
-
 fn negate(expr: UserInputAst) -> UserInputAst {
    expr.unary(Occur::MustNot)
 }
@@ -815,7 +736,7 @@ fn boosted_leaf(inp: &str) -> IResult<&str, UserInputAst> {
        tuple((leaf, fallible(boost))),
        |(leaf, boost_opt)| match boost_opt {
            Some(boost) if (boost - 1.0).abs() > f64::EPSILON => {
-                UserInputAst::Boost(Box::new(leaf), boost.into())
+                UserInputAst::Boost(Box::new(leaf), boost)
            }
            _ => leaf,
        },
@@ -827,7 +748,7 @@ fn boosted_leaf_infallible(inp: &str) -> JResult<&str, Option<UserInputAst>> {
        tuple_infallible((leaf_infallible, boost)),
        |((leaf, boost_opt), error)| match boost_opt {
            Some(boost) if (boost - 1.0).abs() > f64::EPSILON => (
-                leaf.map(|leaf| UserInputAst::Boost(Box::new(leaf), boost.into())),
+                leaf.map(|leaf| UserInputAst::Boost(Box::new(leaf), boost)),
                error,
            ),
            _ => (leaf, error),
@@ -1078,25 +999,12 @@ pub fn parse_to_ast_lenient(query_str: &str) -> (UserInputAst, Vec<LenientError>
    (rewrite_ast(res), errors)
 }

+/// Removes unnecessary children clauses in AST
+///
+/// Motivated by [issue #1433](https://github.com/quickwit-oss/tantivy/issues/1433)
 fn rewrite_ast(mut input: UserInputAst) -> UserInputAst {
-    if let UserInputAst::Clause(sub_clauses) = &mut input {
-        // call rewrite_ast recursively on children clauses if applicable
-        let mut new_clauses = Vec::with_capacity(sub_clauses.len());
-        for (occur, clause) in sub_clauses.drain(..) {
-            let rewritten_clause = rewrite_ast(clause);
-            new_clauses.push((occur, rewritten_clause));
-        }
-        *sub_clauses = new_clauses;
-
-        // remove duplicate child clauses
-        // e.g. (+a +b) OR (+c +d) OR (+a +b)  => (+a +b) OR (+c +d)
-        let mut seen = FnvHashSet::default();
-        sub_clauses.retain(|term| seen.insert(term.clone()));
-
-        // Removes unnecessary children clauses in AST
-        //
-        // Motivated by [issue #1433](https://github.com/quickwit-oss/tantivy/issues/1433)
-        for term in sub_clauses {
+    if let UserInputAst::Clause(terms) = &mut input {
+        for term in terms {
            rewrite_ast_clause(term);
        }
    }
@@ -1105,7 +1013,7 @@ fn rewrite_ast(mut input: UserInputAst) -> UserInputAst {

 fn rewrite_ast_clause(input: &mut (Option<Occur>, UserInputAst)) {
    match input {
-        (None, UserInputAst::Clause(clauses)) if clauses.len() == 1 => {
+        (None, UserInputAst::Clause(ref mut clauses)) if clauses.len() == 1 => {
            *input = clauses.pop().unwrap(); // safe because clauses.len() == 1
        }
        _ => {}
@@ -1358,10 +1266,6 @@ mod test {
            super::field_name("~my~field:a"),
            Ok(("a", "~my~field".to_string()))
        );
-        assert_eq!(
-            super::field_name(".my.field.name : a"),
-            Ok(("a", ".my.field.name".to_string()))
-        );
        for special_char in SPECIAL_CHARS.iter() {
            let query = &format!("\\{special_char}my\\{special_char}field:a");
            assert_eq!(
@@ -1455,7 +1359,7 @@ mod test {

    #[test]
    fn test_range_parser_lenient() {
-        let literal = |query| literal_infallible(query).unwrap().1.0.unwrap();
+        let literal = |query| literal_infallible(query).unwrap().1 .0.unwrap();

        // same tests as non-lenient
        let res = literal("title: <hello");
@@ -1593,11 +1497,6 @@ mod test {
        test_is_parse_err(r#"field:(+a -"b c""#, r#"(+"field":a -"field":"b c")"#);
    }

-    #[test]
-    fn field_re_specification() {
-        test_parse_query_to_ast_helper(r#"field:(abc AND b:cde)"#, r#"(+"field":abc +"b":cde)"#);
-    }
-
    #[test]
    fn test_parse_query_single_term() {
        test_parse_query_to_ast_helper("abc", "abc");
@@ -1720,19 +1619,13 @@ mod test {

    #[test]
    fn test_exist_query() {
-        test_parse_query_to_ast_helper("a:*", "$exists(\"a\")");
-        test_parse_query_to_ast_helper("a: *", "$exists(\"a\")");
+        test_parse_query_to_ast_helper("a:*", "\"a\":*");
+        test_parse_query_to_ast_helper("a: *", "\"a\":*");
+        // an exist followed by default term being b
+        test_is_parse_err("a:*b", "(*\"a\":* *b)");

-        test_parse_query_to_ast_helper(
-            "(hello AND toto:*) OR happy",
-            "(?(+hello +$exists(\"toto\")) ?happy)",
-        );
-        test_parse_query_to_ast_helper("(a:*)", "$exists(\"a\")");
-
-        // these are term/wildcard query (not a phrase prefix)
+        // this is a term query (not a phrase prefix)
        test_parse_query_to_ast_helper("a:b*", "\"a\":b*");
-        test_parse_query_to_ast_helper("a:*b", "\"a\":*b");
-        test_parse_query_to_ast_helper(r#"a:*def*"#, "\"a\":*def*");
    }

    #[test]
@@ -1768,72 +1661,4 @@ mod test {
    fn test_invalid_field() {
        test_is_parse_err(r#"!bc:def"#, "!bc:def");
    }
-
-    #[test]
-    fn test_regex_parser() {
-        let r = parse_to_ast(r#"a:/joh?n(ath[oa]n)/"#);
-        assert!(r.is_ok(), "Failed to parse custom query: {r:?}");
-        let (_, input) = r.unwrap();
-        match input {
-            UserInputAst::Leaf(leaf) => match leaf.as_ref() {
-                UserInputLeaf::Regex { field, pattern } => {
-                    assert_eq!(field, &Some("a".to_string()));
-                    assert_eq!(pattern, "joh?n(ath[oa]n)");
-                }
-                _ => panic!("Expected a regex leaf, got {leaf:?}"),
-            },
-            _ => panic!("Expected a leaf"),
-        }
-        let r = parse_to_ast(r#"a:/\\/cgi-bin\\/luci.*/"#);
-        assert!(r.is_ok(), "Failed to parse custom query: {r:?}");
-        let (_, input) = r.unwrap();
-        match input {
-            UserInputAst::Leaf(leaf) => match leaf.as_ref() {
-                UserInputLeaf::Regex { field, pattern } => {
-                    assert_eq!(field, &Some("a".to_string()));
-                    assert_eq!(pattern, "\\/cgi-bin\\/luci.*");
-                }
-                _ => panic!("Expected a regex leaf, got {leaf:?}"),
-            },
-            _ => panic!("Expected a leaf"),
-        }
-    }
-
-    #[test]
-    fn test_regex_parser_lenient() {
-        let literal = |query| literal_infallible(query).unwrap().1;
-
-        let (res, errs) = literal(r#"a:/joh?n(ath[oa]n)/"#);
-        let expected = UserInputLeaf::Regex {
-            field: Some("a".to_string()),
-            pattern: "joh?n(ath[oa]n)".to_string(),
-        }
-        .into();
-        assert_eq!(res.unwrap(), expected);
-        assert!(errs.is_empty(), "Expected no errors, got: {errs:?}");
-
-        let (res, errs) = literal("title:/joh?n(ath[oa]n)");
-        let expected = UserInputLeaf::Regex {
-            field: Some("title".to_string()),
-            pattern: "joh?n(ath[oa]n)".to_string(),
-        }
-        .into();
-        assert_eq!(res.unwrap(), expected);
-        assert_eq!(errs.len(), 1, "Expected 1 error, got: {errs:?}");
-        assert_eq!(
-            errs[0].message, "missing delimiter /",
-            "Unexpected error message",
-        );
-    }
-
-    #[test]
-    fn test_space_before_value() {
-        test_parse_query_to_ast_helper("field : a", r#""field":a"#);
-        test_parse_query_to_ast_helper("field:    a", r#""field":a"#);
-        test_parse_query_to_ast_helper("field         :a", r#""field":a"#);
-        test_parse_query_to_ast_helper(
-            "field : 'happy tax payer' AND other_field  : 1",
-            r#"(+"field":'happy tax payer' +"other_field":1)"#,
-        );
-    }
 }
--- a/query-grammar/src/user_input_ast.rs
+++ b/query-grammar/src/user_input_ast.rs
@@ -1,13 +1,9 @@
 use std::fmt;
 use std::fmt::{Debug, Formatter};

-use serde::Serialize;
-
 use crate::Occur;

-#[derive(PartialEq, Eq, Hash, Clone, Serialize)]
-#[serde(tag = "type")]
-#[serde(rename_all = "snake_case")]
+#[derive(PartialEq, Clone)]
 pub enum UserInputLeaf {
    Literal(UserInputLiteral),
    All,
@@ -23,10 +19,6 @@ pub enum UserInputLeaf {
    Exists {
        field: String,
    },
-    Regex {
-        field: Option<String>,
-        pattern: String,
-    },
 }

 impl UserInputLeaf {
@@ -50,13 +42,12 @@ impl UserInputLeaf {
            UserInputLeaf::Exists { field: _ } => UserInputLeaf::Exists {
                field: field.expect("Exist query without a field isn't allowed"),
            },
-            UserInputLeaf::Regex { field: _, pattern } => UserInputLeaf::Regex { field, pattern },
        }
    }

    pub(crate) fn set_default_field(&mut self, default_field: String) {
        match self {
-            UserInputLeaf::Literal(literal) if literal.field_name.is_none() => {
+            UserInputLeaf::Literal(ref mut literal) if literal.field_name.is_none() => {
                literal.field_name = Some(default_field)
            }
            UserInputLeaf::All => {
@@ -64,8 +55,12 @@ impl UserInputLeaf {
                    field: default_field,
                }
            }
-            UserInputLeaf::Range { field, .. } if field.is_none() => *field = Some(default_field),
-            UserInputLeaf::Set { field, .. } if field.is_none() => *field = Some(default_field),
+            UserInputLeaf::Range { ref mut field, .. } if field.is_none() => {
+                *field = Some(default_field)
+            }
+            UserInputLeaf::Set { ref mut field, .. } if field.is_none() => {
+                *field = Some(default_field)
+            }
            _ => (), // field was already set, do nothing
        }
    }
@@ -76,11 +71,11 @@ impl Debug for UserInputLeaf {
        match self {
            UserInputLeaf::Literal(literal) => literal.fmt(formatter),
            UserInputLeaf::Range {
-                field,
-                lower,
-                upper,
+                ref field,
+                ref lower,
+                ref upper,
            } => {
-                if let Some(field) = field {
+                if let Some(ref field) = field {
                    // TODO properly escape field (in case of \")
                    write!(formatter, "\"{field}\":")?;
                }
@@ -90,7 +85,7 @@ impl Debug for UserInputLeaf {
                Ok(())
            }
            UserInputLeaf::Set { field, elements } => {
-                if let Some(field) = field {
+                if let Some(ref field) = field {
                    // TODO properly escape field (in case of \")
                    write!(formatter, "\"{field}\": ")?;
                }
@@ -106,30 +101,20 @@ impl Debug for UserInputLeaf {
            }
            UserInputLeaf::All => write!(formatter, "*"),
            UserInputLeaf::Exists { field } => {
-                write!(formatter, "$exists(\"{field}\")")
-            }
-            UserInputLeaf::Regex { field, pattern } => {
-                if let Some(field) = field {
-                    // TODO properly escape field (in case of \")
-                    write!(formatter, "\"{field}\":")?;
-                }
-                // TODO properly escape pattern (in case of \")
-                write!(formatter, "/{pattern}/")
+                write!(formatter, "\"{field}\":*")
            }
        }
    }
 }

-#[derive(Copy, Clone, Eq, PartialEq, Hash, Debug, Serialize)]
-#[serde(rename_all = "snake_case")]
+#[derive(Copy, Clone, Eq, PartialEq, Debug)]
 pub enum Delimiter {
    SingleQuotes,
    DoubleQuotes,
    None,
 }

-#[derive(PartialEq, Eq, Hash, Clone, Serialize)]
-#[serde(rename_all = "snake_case")]
+#[derive(PartialEq, Clone)]
 pub struct UserInputLiteral {
    pub field_name: Option<String>,
    pub phrase: String,
@@ -167,9 +152,7 @@ impl fmt::Debug for UserInputLiteral {
    }
 }

-#[derive(PartialEq, Eq, Hash, Debug, Clone, Serialize)]
-#[serde(tag = "type", content = "value")]
-#[serde(rename_all = "snake_case")]
+#[derive(PartialEq, Debug, Clone)]
 pub enum UserInputBound {
    Inclusive(String),
    Exclusive(String),
@@ -204,39 +187,11 @@ impl UserInputBound {
    }
 }

-#[derive(PartialEq, Eq, Hash, Clone, Serialize)]
-#[serde(into = "UserInputAstSerde")]
+#[derive(PartialEq, Clone)]
 pub enum UserInputAst {
    Clause(Vec<(Option<Occur>, UserInputAst)>),
-    Boost(Box<UserInputAst>, ordered_float::OrderedFloat<f64>),
    Leaf(Box<UserInputLeaf>),
-}
-
-#[derive(Serialize)]
-#[serde(tag = "type", rename_all = "snake_case")]
-enum UserInputAstSerde {
-    Bool {
-        clauses: Vec<(Option<Occur>, UserInputAst)>,
-    },
-    Boost {
-        underlying: Box<UserInputAst>,
-        boost: f64,
-    },
-    #[serde(untagged)]
-    Leaf(Box<UserInputLeaf>),
-}
-
-impl From<UserInputAst> for UserInputAstSerde {
-    fn from(ast: UserInputAst) -> Self {
-        match ast {
-            UserInputAst::Clause(clause) => UserInputAstSerde::Bool { clauses: clause },
-            UserInputAst::Boost(underlying, boost) => UserInputAstSerde::Boost {
-                underlying,
-                boost: boost.into_inner(),
-            },
-            UserInputAst::Leaf(leaf) => UserInputAstSerde::Leaf(leaf),
-        }
-    }
+    Boost(Box<UserInputAst>, f64),
 }

 impl UserInputAst {
@@ -277,7 +232,7 @@ impl UserInputAst {
                .iter_mut()
                .for_each(|(_, ast)| ast.set_default_field(field.clone())),
            UserInputAst::Leaf(leaf) => leaf.set_default_field(field),
-            UserInputAst::Boost(ast, _) => ast.set_default_field(field),
+            UserInputAst::Boost(ref mut ast, _) => ast.set_default_field(field),
        }
    }
 }
@@ -330,126 +285,3 @@ impl fmt::Debug for UserInputAst {
        }
    }
 }
-
-#[cfg(test)]
-mod tests {
-    use super::*;
-
-    #[test]
-    fn test_all_leaf_serialization() {
-        let ast = UserInputAst::Leaf(Box::new(UserInputLeaf::All));
-        let json = serde_json::to_string(&ast).unwrap();
-        assert_eq!(json, r#"{"type":"all"}"#);
-    }
-
-    #[test]
-    fn test_literal_leaf_serialization() {
-        let literal = UserInputLiteral {
-            field_name: Some("title".to_string()),
-            phrase: "hello".to_string(),
-            delimiter: Delimiter::None,
-            slop: 0,
-            prefix: false,
-        };
-        let ast = UserInputAst::Leaf(Box::new(UserInputLeaf::Literal(literal)));
-        let json = serde_json::to_string(&ast).unwrap();
-        assert_eq!(
-            json,
-            r#"{"type":"literal","field_name":"title","phrase":"hello","delimiter":"none","slop":0,"prefix":false}"#
-        );
-    }
-
-    #[test]
-    fn test_range_leaf_serialization() {
-        let range = UserInputLeaf::Range {
-            field: Some("price".to_string()),
-            lower: UserInputBound::Inclusive("10".to_string()),
-            upper: UserInputBound::Exclusive("100".to_string()),
-        };
-        let ast = UserInputAst::Leaf(Box::new(range));
-        let json = serde_json::to_string(&ast).unwrap();
-        assert_eq!(
-            json,
-            r#"{"type":"range","field":"price","lower":{"type":"inclusive","value":"10"},"upper":{"type":"exclusive","value":"100"}}"#
-        );
-    }
-
-    #[test]
-    fn test_range_leaf_unbounded_serialization() {
-        let range = UserInputLeaf::Range {
-            field: Some("price".to_string()),
-            lower: UserInputBound::Inclusive("10".to_string()),
-            upper: UserInputBound::Unbounded,
-        };
-        let ast = UserInputAst::Leaf(Box::new(range));
-        let json = serde_json::to_string(&ast).unwrap();
-        assert_eq!(
-            json,
-            r#"{"type":"range","field":"price","lower":{"type":"inclusive","value":"10"},"upper":{"type":"unbounded"}}"#
-        );
-    }
-
-    #[test]
-    fn test_boost_serialization() {
-        let inner_ast = UserInputAst::Leaf(Box::new(UserInputLeaf::All));
-        let boost_ast = UserInputAst::Boost(Box::new(inner_ast), 2.5.into());
-        let json = serde_json::to_string(&boost_ast).unwrap();
-        assert_eq!(
-            json,
-            r#"{"type":"boost","underlying":{"type":"all"},"boost":2.5}"#
-        );
-    }
-
-    #[test]
-    fn test_boost_serialization2() {
-        let boost_ast = UserInputAst::Boost(
-            Box::new(UserInputAst::Clause(vec![
-                (
-                    Some(Occur::Must),
-                    UserInputAst::Leaf(Box::new(UserInputLeaf::All)),
-                ),
-                (
-                    Some(Occur::Should),
-                    UserInputAst::Leaf(Box::new(UserInputLeaf::Literal(UserInputLiteral {
-                        field_name: Some("title".to_string()),
-                        phrase: "hello".to_string(),
-                        delimiter: Delimiter::None,
-                        slop: 0,
-                        prefix: false,
-                    }))),
-                ),
-            ])),
-            2.5.into(),
-        );
-        let json = serde_json::to_string(&boost_ast).unwrap();
-        assert_eq!(
-            json,
-            r#"{"type":"boost","underlying":{"type":"bool","clauses":[["must",{"type":"all"}],["should",{"type":"literal","field_name":"title","phrase":"hello","delimiter":"none","slop":0,"prefix":false}]]},"boost":2.5}"#
-        );
-    }
-
-    #[test]
-    fn test_clause_serialization() {
-        let clause = UserInputAst::Clause(vec![
-            (
-                Some(Occur::Must),
-                UserInputAst::Leaf(Box::new(UserInputLeaf::All)),
-            ),
-            (
-                Some(Occur::Should),
-                UserInputAst::Leaf(Box::new(UserInputLeaf::Literal(UserInputLiteral {
-                    field_name: Some("title".to_string()),
-                    phrase: "hello".to_string(),
-                    delimiter: Delimiter::None,
-                    slop: 0,
-                    prefix: false,
-                }))),
-            ),
-        ]);
-        let json = serde_json::to_string(&clause).unwrap();
-        assert_eq!(
-            json,
-            r#"{"type":"bool","clauses":[["must",{"type":"all"}],["should",{"type":"literal","field_name":"title","phrase":"hello","delimiter":"none","slop":0,"prefix":false}]]}"#
-        );
-    }
-}
--- a/src/aggregation/README.md
+++ b/src/aggregation/README.md
@@ -20,16 +20,17 @@ Contains all metric aggregations, like average aggregation. Metric aggregations
 #### agg_req
 agg_req contains the users aggregation request. Deserialization from json is compatible with elasticsearch aggregation requests.

-#### agg_data
-agg_data contains the users aggregation request enriched with fast field accessors etc, which are
+#### agg_req_with_accessor
+agg_req_with_accessor contains the users aggregation request enriched with fast field accessors etc, which are
 used during collection.

 #### segment_agg_result
 segment_agg_result contains the aggregation result tree, which is used for collection of a segment.
-agg_data is passed during collection.
+The tree from agg_req_with_accessor is passed during collection.

 #### intermediate_agg_result
 intermediate_agg_result contains the aggregation tree for merging with other trees.

 #### agg_result
 agg_result contains the final aggregation tree.
+
--- a/src/aggregation/accessor_helpers.rs
+++ b/src/aggregation/accessor_helpers.rs
@@ -1,104 +0,0 @@
-//! This will enhance the request tree with access to the fastfield and metadata.
-
-use std::io;
-
-use columnar::{Column, ColumnType};
-
-use crate::aggregation::{f64_to_fastfield_u64, Key};
-use crate::index::SegmentReader;
-
-/// Get the missing value as internal u64 representation
-///
-/// For terms we use u64::MAX as sentinel value
-/// For numerical data we convert the value into the representation
-/// we would get from the fast field, when we open it as u64_lenient_for_type.
-///
-/// That way we can use it the same way as if it would come from the fastfield.
-pub(crate) fn get_missing_val_as_u64_lenient(
-    column_type: ColumnType,
-    missing: &Key,
-    field_name: &str,
-) -> crate::Result<Option<u64>> {
-    let missing_val = match missing {
-        Key::Str(_) if column_type == ColumnType::Str => Some(u64::MAX),
-        // Allow fallback to number on text fields
-        Key::F64(_) if column_type == ColumnType::Str => Some(u64::MAX),
-        Key::U64(_) if column_type == ColumnType::Str => Some(u64::MAX),
-        Key::I64(_) if column_type == ColumnType::Str => Some(u64::MAX),
-        Key::F64(val) if column_type.numerical_type().is_some() => {
-            f64_to_fastfield_u64(*val, &column_type)
-        }
-        // NOTE: We may loose precision of the passed missing value by casting i64 and u64 to f64.
-        Key::I64(val) if column_type.numerical_type().is_some() => {
-            f64_to_fastfield_u64(*val as f64, &column_type)
-        }
-        Key::U64(val) if column_type.numerical_type().is_some() => {
-            f64_to_fastfield_u64(*val as f64, &column_type)
-        }
-        _ => {
-            return Err(crate::TantivyError::InvalidArgument(format!(
-                "Missing value {missing:?} for field {field_name} is not supported for column \
-                 type {column_type:?}"
-            )));
-        }
-    };
-    Ok(missing_val)
-}
-
-pub(crate) fn get_numeric_or_date_column_types() -> &'static [ColumnType] {
-    &[
-        ColumnType::F64,
-        ColumnType::U64,
-        ColumnType::I64,
-        ColumnType::DateTime,
-    ]
-}
-
-/// Get fast field reader or empty as default.
-pub(crate) fn get_ff_reader(
-    reader: &SegmentReader,
-    field_name: &str,
-    allowed_column_types: Option<&[ColumnType]>,
-) -> crate::Result<(columnar::Column<u64>, ColumnType)> {
-    let ff_fields = reader.fast_fields();
-    let ff_field_with_type = ff_fields
-        .u64_lenient_for_type(allowed_column_types, field_name)?
-        .unwrap_or_else(|| {
-            (
-                Column::build_empty_column(reader.num_docs()),
-                ColumnType::U64,
-            )
-        });
-    Ok(ff_field_with_type)
-}
-
-pub(crate) fn get_dynamic_columns(
-    reader: &SegmentReader,
-    field_name: &str,
-) -> crate::Result<Vec<columnar::DynamicColumn>> {
-    let ff_fields = reader.fast_fields().dynamic_column_handles(field_name)?;
-    let cols = ff_fields
-        .iter()
-        .map(|h| h.open())
-        .collect::<io::Result<_>>()?;
-    assert!(!ff_fields.is_empty(), "field {field_name} not found");
-    Ok(cols)
-}
-
-/// Get all fast field reader or empty as default.
-///
-/// Is guaranteed to return at least one column.
-pub(crate) fn get_all_ff_reader_or_empty(
-    reader: &SegmentReader,
-    field_name: &str,
-    allowed_column_types: Option<&[ColumnType]>,
-    fallback_type: ColumnType,
-) -> crate::Result<Vec<(columnar::Column<u64>, ColumnType)>> {
-    let ff_fields = reader.fast_fields();
-    let mut ff_field_with_type =
-        ff_fields.u64_lenient_for_type_all(allowed_column_types, field_name)?;
-    if ff_field_with_type.is_empty() {
-        ff_field_with_type.push((Column::build_empty_column(reader.num_docs()), fallback_type));
-    }
-    Ok(ff_field_with_type)
-}
--- a/src/aggregation/agg_data.rs
+++ b/src/aggregation/agg_data.rs
--- a/src/aggregation/agg_limits.rs
+++ b/src/aggregation/agg_limits.rs
@@ -70,7 +70,7 @@ impl AggregationLimitsGuard {
    /// *memory_limit*
    /// memory_limit is defined in bytes.
    /// Aggregation fails when the estimated memory consumption of the aggregation is higher than
-    /// memory_limit.
+    /// memory_limit.     
    /// memory_limit will default to `DEFAULT_MEMORY_LIMIT` (500MB)
    ///
    /// *bucket_limit*
--- a/src/aggregation/agg_req.rs
+++ b/src/aggregation/agg_req.rs
@@ -26,14 +26,12 @@
 //! let _agg_req: Aggregations = serde_json::from_str(elasticsearch_compatible_json_req).unwrap();
 //! ```

-use std::collections::HashSet;
+use std::collections::{HashMap, HashSet};

-use rustc_hash::FxHashMap;
 use serde::{Deserialize, Serialize};

 use super::bucket::{
-    DateHistogramAggregationReq, FilterAggregation, HistogramAggregation, RangeAggregation,
-    TermsAggregation,
+    DateHistogramAggregationReq, HistogramAggregation, RangeAggregation, TermsAggregation,
 };
 use super::metric::{
    AverageAggregation, CardinalityAggregationReq, CountAggregation, ExtendedStatsAggregation,
@@ -45,7 +43,7 @@ use super::metric::{
 /// defined names. It is also used in buckets aggregations to define sub-aggregations.
 ///
 /// The key is the user defined name of the aggregation.
-pub type Aggregations = FxHashMap<String, Aggregation>;
+pub type Aggregations = HashMap<String, Aggregation>;

 /// Aggregation request.
 ///
@@ -131,9 +129,6 @@ pub enum AggregationVariants {
    /// Put data into buckets of terms.
    #[serde(rename = "terms")]
    Terms(TermsAggregation),
-    /// Filter documents into a single bucket.
-    #[serde(rename = "filter")]
-    Filter(FilterAggregation),

    // Metric aggregation types
    /// Computes the average of the extracted values.
@@ -179,7 +174,6 @@ impl AggregationVariants {
            AggregationVariants::Range(range) => vec![range.field.as_str()],
            AggregationVariants::Histogram(histogram) => vec![histogram.field.as_str()],
            AggregationVariants::DateHistogram(histogram) => vec![histogram.field.as_str()],
-            AggregationVariants::Filter(filter) => filter.get_fast_field_names(),
            AggregationVariants::Average(avg) => vec![avg.field_name()],
            AggregationVariants::Count(count) => vec![count.field_name()],
            AggregationVariants::Max(max) => vec![max.field_name()],
@@ -214,6 +208,13 @@ impl AggregationVariants {
            _ => None,
        }
    }
+    pub(crate) fn as_top_hits(&self) -> Option<&TopHitsAggregationReq> {
+        match &self {
+            AggregationVariants::TopHits(top_hits) => Some(top_hits),
+            _ => None,
+        }
+    }
+
    pub(crate) fn as_percentile(&self) -> Option<&PercentilesAggregationReq> {
        match &self {
            AggregationVariants::Percentiles(percentile_req) => Some(percentile_req),
--- a/src/aggregation/agg_req_with_accessor.rs
+++ b/src/aggregation/agg_req_with_accessor.rs
@@ -0,0 +1,457 @@
+//! This will enhance the request tree with access to the fastfield and metadata.
+
+use std::collections::HashMap;
+use std::io;
+
+use columnar::{Column, ColumnBlockAccessor, ColumnType, DynamicColumn, StrColumn};
+
+use super::agg_req::{Aggregation, AggregationVariants, Aggregations};
+use super::bucket::{
+    DateHistogramAggregationReq, HistogramAggregation, RangeAggregation, TermsAggregation,
+};
+use super::metric::{
+    AverageAggregation, CardinalityAggregationReq, CountAggregation, ExtendedStatsAggregation,
+    MaxAggregation, MinAggregation, StatsAggregation, SumAggregation,
+};
+use super::segment_agg_result::AggregationLimitsGuard;
+use super::VecWithNames;
+use crate::aggregation::{f64_to_fastfield_u64, Key};
+use crate::index::SegmentReader;
+use crate::SegmentOrdinal;
+
+#[derive(Default)]
+pub(crate) struct AggregationsWithAccessor {
+    pub aggs: VecWithNames<AggregationWithAccessor>,
+}
+
+impl AggregationsWithAccessor {
+    fn from_data(aggs: VecWithNames<AggregationWithAccessor>) -> Self {
+        Self { aggs }
+    }
+
+    pub fn is_empty(&self) -> bool {
+        self.aggs.is_empty()
+    }
+}
+
+pub struct AggregationWithAccessor {
+    pub(crate) segment_ordinal: SegmentOrdinal,
+    /// In general there can be buckets without fast field access, e.g. buckets that are created
+    /// based on search terms. That is not that case currently, but eventually this needs to be
+    /// Option or moved.
+    pub(crate) accessor: Column<u64>,
+    /// Load insert u64 for missing use case
+    pub(crate) missing_value_for_accessor: Option<u64>,
+    pub(crate) str_dict_column: Option<StrColumn>,
+    pub(crate) field_type: ColumnType,
+    pub(crate) sub_aggregation: AggregationsWithAccessor,
+    pub(crate) limits: AggregationLimitsGuard,
+    pub(crate) column_block_accessor: ColumnBlockAccessor<u64>,
+    /// Used for missing term aggregation, which checks all columns for existence.
+    /// And also for `top_hits` aggregation, which may sort on multiple fields.
+    /// By convention the missing aggregation is chosen, when this property is set
+    /// (instead bein set in `agg`).
+    /// If this needs to used by other aggregations, we need to refactor this.
+    // NOTE: we can make all other aggregations use this instead of the `accessor` and `field_type`
+    // (making them obsolete) But will it have a performance impact?
+    pub(crate) accessors: Vec<(Column<u64>, ColumnType)>,
+    /// Map field names to all associated column accessors.
+    /// This field is used for `docvalue_fields`, which is currently only supported for `top_hits`.
+    pub(crate) value_accessors: HashMap<String, Vec<DynamicColumn>>,
+    pub(crate) agg: Aggregation,
+}
+
+impl AggregationWithAccessor {
+    /// May return multiple accessors if the aggregation is e.g. on mixed field types.
+    fn try_from_agg(
+        agg: &Aggregation,
+        sub_aggregation: &Aggregations,
+        reader: &SegmentReader,
+        segment_ordinal: SegmentOrdinal,
+        limits: AggregationLimitsGuard,
+    ) -> crate::Result<Vec<AggregationWithAccessor>> {
+        let mut agg = agg.clone();
+
+        let add_agg_with_accessor = |agg: &Aggregation,
+                                     accessor: Column<u64>,
+                                     column_type: ColumnType,
+                                     aggs: &mut Vec<AggregationWithAccessor>|
+         -> crate::Result<()> {
+            let res = AggregationWithAccessor {
+                segment_ordinal,
+                accessor,
+                accessors: Default::default(),
+                value_accessors: Default::default(),
+                field_type: column_type,
+                sub_aggregation: get_aggs_with_segment_accessor_and_validate(
+                    sub_aggregation,
+                    reader,
+                    segment_ordinal,
+                    &limits,
+                )?,
+                agg: agg.clone(),
+                limits: limits.clone(),
+                missing_value_for_accessor: None,
+                str_dict_column: None,
+                column_block_accessor: Default::default(),
+            };
+            aggs.push(res);
+            Ok(())
+        };
+
+        let add_agg_with_accessors = |agg: &Aggregation,
+                                      accessors: Vec<(Column<u64>, ColumnType)>,
+                                      aggs: &mut Vec<AggregationWithAccessor>,
+                                      value_accessors: HashMap<String, Vec<DynamicColumn>>|
+         -> crate::Result<()> {
+            let (accessor, field_type) = accessors.first().expect("at least one accessor");
+            let limits = limits.clone();
+            let res = AggregationWithAccessor {
+                segment_ordinal,
+                // TODO: We should do away with the `accessor` field altogether
+                accessor: accessor.clone(),
+                value_accessors,
+                field_type: *field_type,
+                accessors,
+                sub_aggregation: get_aggs_with_segment_accessor_and_validate(
+                    sub_aggregation,
+                    reader,
+                    segment_ordinal,
+                    &limits,
+                )?,
+                agg: agg.clone(),
+                limits,
+                missing_value_for_accessor: None,
+                str_dict_column: None,
+                column_block_accessor: Default::default(),
+            };
+            aggs.push(res);
+            Ok(())
+        };
+
+        let mut res: Vec<AggregationWithAccessor> = Vec::new();
+        use AggregationVariants::*;
+
+        match agg.agg {
+            Range(RangeAggregation {
+                field: ref field_name,
+                ..
+            }) => {
+                let (accessor, column_type) =
+                    get_ff_reader(reader, field_name, Some(get_numeric_or_date_column_types()))?;
+                add_agg_with_accessor(&agg, accessor, column_type, &mut res)?;
+            }
+            Histogram(HistogramAggregation {
+                field: ref field_name,
+                ..
+            }) => {
+                let (accessor, column_type) =
+                    get_ff_reader(reader, field_name, Some(get_numeric_or_date_column_types()))?;
+                add_agg_with_accessor(&agg, accessor, column_type, &mut res)?;
+            }
+            DateHistogram(DateHistogramAggregationReq {
+                field: ref field_name,
+                ..
+            }) => {
+                let (accessor, column_type) =
+                    // Only DateTime is supported for DateHistogram
+                    get_ff_reader(reader, field_name, Some(&[ColumnType::DateTime]))?;
+                add_agg_with_accessor(&agg, accessor, column_type, &mut res)?;
+            }
+            Terms(TermsAggregation {
+                field: ref field_name,
+                ref missing,
+                ..
+            })
+            | Cardinality(CardinalityAggregationReq {
+                field: ref field_name,
+                ref missing,
+                ..
+            }) => {
+                let str_dict_column = reader.fast_fields().str(field_name)?;
+                let allowed_column_types = [
+                    ColumnType::I64,
+                    ColumnType::U64,
+                    ColumnType::F64,
+                    ColumnType::Str,
+                    ColumnType::DateTime,
+                    ColumnType::Bool,
+                    ColumnType::IpAddr,
+                    // ColumnType::Bytes Unsupported
+                ];
+
+                // In case the column is empty we want the shim column to match the missing type
+                let fallback_type = missing
+                    .as_ref()
+                    .map(|missing| match missing {
+                        Key::Str(_) => ColumnType::Str,
+                        Key::F64(_) => ColumnType::F64,
+                        Key::I64(_) => ColumnType::I64,
+                        Key::U64(_) => ColumnType::U64,
+                    })
+                    .unwrap_or(ColumnType::U64);
+                let column_and_types = get_all_ff_reader_or_empty(
+                    reader,
+                    field_name,
+                    Some(&allowed_column_types),
+                    fallback_type,
+                )?;
+                let missing_and_more_than_one_col = column_and_types.len() > 1 && missing.is_some();
+                let text_on_non_text_col = column_and_types.len() == 1
+                    && column_and_types[0].1.numerical_type().is_some()
+                    && missing
+                        .as_ref()
+                        .map(|m| matches!(m, Key::Str(_)))
+                        .unwrap_or(false);
+
+                // Actually we could convert the text to a number and have the fast path, if it is
+                // provided in Rfc3339 format. But this use case is probably common
+                // enough to justify the effort.
+                let text_on_date_col = column_and_types.len() == 1
+                    && column_and_types[0].1 == ColumnType::DateTime
+                    && missing
+                        .as_ref()
+                        .map(|m| matches!(m, Key::Str(_)))
+                        .unwrap_or(false);
+
+                let use_special_missing_agg =
+                    missing_and_more_than_one_col || text_on_non_text_col || text_on_date_col;
+                if use_special_missing_agg {
+                    let column_and_types =
+                        get_all_ff_reader_or_empty(reader, field_name, None, fallback_type)?;
+
+                    let accessors = column_and_types
+                        .iter()
+                        .map(|c_t| (c_t.0.clone(), c_t.1))
+                        .collect();
+                    add_agg_with_accessors(&agg, accessors, &mut res, Default::default())?;
+                }
+
+                for (accessor, column_type) in column_and_types {
+                    let missing_value_term_agg = if use_special_missing_agg {
+                        None
+                    } else {
+                        missing.clone()
+                    };
+
+                    let missing_value_for_accessor =
+                        if let Some(missing) = missing_value_term_agg.as_ref() {
+                            get_missing_val_as_u64_lenient(
+                                column_type,
+                                missing,
+                                agg.agg.get_fast_field_names()[0],
+                            )?
+                        } else {
+                            None
+                        };
+
+                    let limits = limits.clone();
+                    let agg = AggregationWithAccessor {
+                        segment_ordinal,
+                        missing_value_for_accessor,
+                        accessor,
+                        accessors: Default::default(),
+                        value_accessors: Default::default(),
+                        field_type: column_type,
+                        sub_aggregation: get_aggs_with_segment_accessor_and_validate(
+                            sub_aggregation,
+                            reader,
+                            segment_ordinal,
+                            &limits,
+                        )?,
+                        agg: agg.clone(),
+                        str_dict_column: str_dict_column.clone(),
+                        limits,
+                        column_block_accessor: Default::default(),
+                    };
+                    res.push(agg);
+                }
+            }
+            Average(AverageAggregation {
+                field: ref field_name,
+                ..
+            })
+            | Count(CountAggregation {
+                field: ref field_name,
+                ..
+            })
+            | Max(MaxAggregation {
+                field: ref field_name,
+                ..
+            })
+            | Min(MinAggregation {
+                field: ref field_name,
+                ..
+            })
+            | Stats(StatsAggregation {
+                field: ref field_name,
+                ..
+            })
+            | ExtendedStats(ExtendedStatsAggregation {
+                field: ref field_name,
+                ..
+            })
+            | Sum(SumAggregation {
+                field: ref field_name,
+                ..
+            }) => {
+                let (accessor, column_type) =
+                    get_ff_reader(reader, field_name, Some(get_numeric_or_date_column_types()))?;
+                add_agg_with_accessor(&agg, accessor, column_type, &mut res)?;
+            }
+            Percentiles(ref percentiles) => {
+                let (accessor, column_type) = get_ff_reader(
+                    reader,
+                    percentiles.field_name(),
+                    Some(get_numeric_or_date_column_types()),
+                )?;
+                add_agg_with_accessor(&agg, accessor, column_type, &mut res)?;
+            }
+            TopHits(ref mut top_hits) => {
+                top_hits.validate_and_resolve_field_names(reader.fast_fields().columnar())?;
+                let accessors: Vec<(Column<u64>, ColumnType)> = top_hits
+                    .field_names()
+                    .iter()
+                    .map(|field| {
+                        get_ff_reader(reader, field, Some(get_numeric_or_date_column_types()))
+                    })
+                    .collect::<crate::Result<_>>()?;
+
+                let value_accessors = top_hits
+                    .value_field_names()
+                    .iter()
+                    .map(|field_name| {
+                        Ok((
+                            field_name.to_string(),
+                            get_dynamic_columns(reader, field_name)?,
+                        ))
+                    })
+                    .collect::<crate::Result<_>>()?;
+
+                add_agg_with_accessors(&agg, accessors, &mut res, value_accessors)?;
+            }
+        };
+
+        Ok(res)
+    }
+}
+
+/// Get the missing value as internal u64 representation
+///
+/// For terms we use u64::MAX as sentinel value
+/// For numerical data we convert the value into the representation
+/// we would get from the fast field, when we open it as u64_lenient_for_type.
+///
+/// That way we can use it the same way as if it would come from the fastfield.
+fn get_missing_val_as_u64_lenient(
+    column_type: ColumnType,
+    missing: &Key,
+    field_name: &str,
+) -> crate::Result<Option<u64>> {
+    let missing_val = match missing {
+        Key::Str(_) if column_type == ColumnType::Str => Some(u64::MAX),
+        // Allow fallback to number on text fields
+        Key::F64(_) if column_type == ColumnType::Str => Some(u64::MAX),
+        Key::U64(_) if column_type == ColumnType::Str => Some(u64::MAX),
+        Key::I64(_) if column_type == ColumnType::Str => Some(u64::MAX),
+        Key::F64(val) if column_type.numerical_type().is_some() => {
+            f64_to_fastfield_u64(*val, &column_type)
+        }
+        // NOTE: We may loose precision of the passed missing value by casting i64 and u64 to f64.
+        Key::I64(val) if column_type.numerical_type().is_some() => {
+            f64_to_fastfield_u64(*val as f64, &column_type)
+        }
+        Key::U64(val) if column_type.numerical_type().is_some() => {
+            f64_to_fastfield_u64(*val as f64, &column_type)
+        }
+        _ => {
+            return Err(crate::TantivyError::InvalidArgument(format!(
+                "Missing value {missing:?} for field {field_name} is not supported for column \
+                 type {column_type:?}"
+            )));
+        }
+    };
+    Ok(missing_val)
+}
+
+fn get_numeric_or_date_column_types() -> &'static [ColumnType] {
+    &[
+        ColumnType::F64,
+        ColumnType::U64,
+        ColumnType::I64,
+        ColumnType::DateTime,
+    ]
+}
+
+pub(crate) fn get_aggs_with_segment_accessor_and_validate(
+    aggs: &Aggregations,
+    reader: &SegmentReader,
+    segment_ordinal: SegmentOrdinal,
+    limits: &AggregationLimitsGuard,
+) -> crate::Result<AggregationsWithAccessor> {
+    let mut aggss = Vec::new();
+    for (key, agg) in aggs.iter() {
+        let aggs = AggregationWithAccessor::try_from_agg(
+            agg,
+            agg.sub_aggregation(),
+            reader,
+            segment_ordinal,
+            limits.clone(),
+        )?;
+        for agg in aggs {
+            aggss.push((key.to_string(), agg));
+        }
+    }
+    Ok(AggregationsWithAccessor::from_data(
+        VecWithNames::from_entries(aggss),
+    ))
+}
+
+/// Get fast field reader or empty as default.
+fn get_ff_reader(
+    reader: &SegmentReader,
+    field_name: &str,
+    allowed_column_types: Option<&[ColumnType]>,
+) -> crate::Result<(columnar::Column<u64>, ColumnType)> {
+    let ff_fields = reader.fast_fields();
+    let ff_field_with_type = ff_fields
+        .u64_lenient_for_type(allowed_column_types, field_name)?
+        .unwrap_or_else(|| {
+            (
+                Column::build_empty_column(reader.num_docs()),
+                ColumnType::U64,
+            )
+        });
+    Ok(ff_field_with_type)
+}
+
+fn get_dynamic_columns(
+    reader: &SegmentReader,
+    field_name: &str,
+) -> crate::Result<Vec<columnar::DynamicColumn>> {
+    let ff_fields = reader.fast_fields().dynamic_column_handles(field_name)?;
+    let cols = ff_fields
+        .iter()
+        .map(|h| h.open())
+        .collect::<io::Result<_>>()?;
+    assert!(!ff_fields.is_empty(), "field {field_name} not found");
+    Ok(cols)
+}
+
+/// Get all fast field reader or empty as default.
+///
+/// Is guaranteed to return at least one column.
+fn get_all_ff_reader_or_empty(
+    reader: &SegmentReader,
+    field_name: &str,
+    allowed_column_types: Option<&[ColumnType]>,
+    fallback_type: ColumnType,
+) -> crate::Result<Vec<(columnar::Column<u64>, ColumnType)>> {
+    let ff_fields = reader.fast_fields();
+    let mut ff_field_with_type =
+        ff_fields.u64_lenient_for_type_all(allowed_column_types, field_name)?;
+    if ff_field_with_type.is_empty() {
+        ff_field_with_type.push((Column::build_empty_column(reader.num_docs()), fallback_type));
+    }
+    Ok(ff_field_with_type)
+}
--- a/src/aggregation/agg_result.rs
+++ b/src/aggregation/agg_result.rs
@@ -156,8 +156,6 @@ pub enum BucketResult {
        /// The upper bound error for the doc count of each term.
        doc_count_error_upper_bound: Option<u64>,
    },
-    /// This is the filter result - a single bucket with sub-aggregations
-    Filter(FilterBucketResult),
 }

 impl BucketResult {
@@ -174,11 +172,6 @@ impl BucketResult {
                sum_other_doc_count: _,
                doc_count_error_upper_bound: _,
            } => buckets.iter().map(|bucket| bucket.get_bucket_count()).sum(),
-            BucketResult::Filter(filter_result) => {
-                // Filter doesn't add to bucket count - it's not a user-facing bucket
-                // Only count sub-aggregation buckets
-                filter_result.sub_aggregations.get_bucket_count()
-            }
        }
    }
 }
@@ -315,25 +308,3 @@ impl RangeBucketEntry {
        1 + self.sub_aggregation.get_bucket_count()
    }
 }
-
-/// This is the filter bucket result, which contains the document count and sub-aggregations.
-///
-/// # JSON Format
-/// ```json
-/// {
-///   "electronics_only": {
-///     "doc_count": 2,
-///     "avg_price": {
-///       "value": 150.0
-///     }
-///   }
-/// }
-/// ```
-#[derive(Clone, Debug, PartialEq, Serialize, Deserialize)]
-pub struct FilterBucketResult {
-    /// Number of documents in the filter bucket
-    pub doc_count: u64,
-    /// Sub-aggregation results
-    #[serde(flatten)]
-    pub sub_aggregations: AggregationResults,
-}
--- a/src/aggregation/agg_tests.rs
+++ b/src/aggregation/agg_tests.rs
@@ -5,6 +5,7 @@ use crate::aggregation::agg_result::AggregationResults;
 use crate::aggregation::buf_collector::DOC_BLOCK_SIZE;
 use crate::aggregation::collector::AggregationCollector;
 use crate::aggregation::intermediate_agg_result::IntermediateAggregationResults;
+use crate::aggregation::segment_agg_result::AggregationLimitsGuard;
 use crate::aggregation::tests::{get_test_index_2_segments, get_test_index_from_values_and_terms};
 use crate::aggregation::DistributedAggregationCollector;
 use crate::query::{AllQuery, TermQuery};
@@ -127,8 +128,10 @@ fn test_aggregation_flushing(
            .unwrap();

    let agg_res: AggregationResults = if use_distributed_collector {
-        let collector =
-            DistributedAggregationCollector::from_aggs(agg_req.clone(), Default::default());
+        let collector = DistributedAggregationCollector::from_aggs(
+            agg_req.clone(),
+            AggregationLimitsGuard::default(),
+        );

        let searcher = reader.searcher();
        let intermediate_agg_result = searcher.search(&AllQuery, &collector).unwrap();
--- a/src/aggregation/bucket/filter.rs
+++ b/src/aggregation/bucket/filter.rs
--- a/Show More
+++ b/Show More