update CHANGELOG for tantivy 0.26 release

Fix format (#2852 )
Co-authored-by: Paul Masurel <paul.masurel@datadoghq.com>
2026-03-21 12:40:43 +00:00 · 2026-03-18 19:14:45 +01:00 · 2026-03-16 10:43:39 +01:00 · 2026-03-12 16:47:11 +01:00 · 2026-03-12 16:44:38 +01:00 · 2026-02-20 16:56:56 +01:00
44 changed files with 171 additions and 7000 deletions
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -1,3 +1,47 @@
+Tantivy 0.26 (Unreleased)
+================================
+
+## Bugfixes
+- Align float query coercion during search with the columnar coercion rules [#2692](https://github.com/quickwit-oss/tantivy/pull/2692)(@fulmicoton)
+- Fix lenient elastic range queries with trailing closing parentheses [#2816](https://github.com/quickwit-oss/tantivy/pull/2816)(@evance-br)
+- Fix intersection `seek()` advancing below current doc id [#2812](https://github.com/quickwit-oss/tantivy/pull/2812)(@fulmicoton)
+- Fix phrase query prefixed with `*` [#2751](https://github.com/quickwit-oss/tantivy/pull/2751)(@Darkheir)
+- Fix `vint` buffer overflow during index creation [#2778](https://github.com/quickwit-oss/tantivy/pull/2778)(@rebasedming)
+- Fix integer overflow in `ExpUnrolledLinkedList` for large datasets [#2735](https://github.com/quickwit-oss/tantivy/pull/2735)(@mdashti)
+- Fix integer overflow in segment sorting and merge policy truncation [#2846](https://github.com/quickwit-oss/tantivy/pull/2846)(@anaslimem)
+- Fix merging of intermediate aggregation results [#2719](https://github.com/quickwit-oss/tantivy/pull/2719)(@PSeitz)
+
+## Features/Improvements
+- **Aggregation**
+    - Add filter aggregation [#2711](https://github.com/quickwit-oss/tantivy/pull/2711)(@mdashti)
+    - Add include/exclude filtering for term aggregations [#2717](https://github.com/quickwit-oss/tantivy/pull/2717)(@PSeitz)
+    - Add public accessors for intermediate aggregation results [#2829](https://github.com/quickwit-oss/tantivy/pull/2829)(@congx4)
+    - Replace HyperLogLog++ with Apache DataSketches HLL for cardinality aggregation [#2837](https://github.com/quickwit-oss/tantivy/pull/2837) [#2842](https://github.com/quickwit-oss/tantivy/pull/2842)(@congx4)
+- **Fast Fields**
+    - Add fast field fallback for `TermQuery` when the field is not indexed [#2693](https://github.com/quickwit-oss/tantivy/pull/2693)(@PSeitz-dd)
+    - Add fast field support for `Bytes` values [#2830](https://github.com/quickwit-oss/tantivy/pull/2830)(@mdashti)
+- Add support for regexes in the query grammar [#2677](https://github.com/quickwit-oss/tantivy/pull/2677) [#2818](https://github.com/quickwit-oss/tantivy/pull/2818)(@Darkheir)
+- Add erased `SortKeyComputer` for sorting on column types unknown until runtime [#2770](https://github.com/quickwit-oss/tantivy/pull/2770) [#2790](https://github.com/quickwit-oss/tantivy/pull/2790)(@stuhood @PSeitz)
+- Add natural-order-with-none-highest support in `TopDocs::order_by` [#2780](https://github.com/quickwit-oss/tantivy/pull/2780)(@stuhood)
+- Move stemming behing `stemmer` feature flag [#2791](https://github.com/quickwit-oss/tantivy/pull/2791)(@fulmicoton)
+- Make `DeleteMeta`, `advance_deletes`, `with_max_doc`, `serializer` module, and `delete_queue` public [#2765](https://github.com/quickwit-oss/tantivy/pull/2765) [#2766](https://github.com/quickwit-oss/tantivy/pull/2766) [#2835](https://github.com/quickwit-oss/tantivy/pull/2835)(@philippemnoel @PSeitz)
+- Make `Language` hashable [#2763](https://github.com/quickwit-oss/tantivy/pull/2763)(@philippemnoel)
+- Improve `space_usage` reporting for JSON fields and columnar data [#2761](https://github.com/quickwit-oss/tantivy/pull/2761)(@PSeitz-dd)
+
+## Performance
+- **Aggregation**
+    - Large speed up for nested high cardinality aggregations by using one collector per request instead of one per bucket, and add `PagedTermMap` for faster medium cardinality term aggregations [#2759](https://github.com/quickwit-oss/tantivy/pull/2759)(@PSeitz-dd)
+    - Optimize low-cardinality term aggregations by using a `Vec` instead of a `HashMap` [#2740](https://github.com/quickwit-oss/tantivy/pull/2740)(@fulmicoton-dd)
+- Optimize `ExistsQuery` for a high number of dynamic columns [#2694](https://github.com/quickwit-oss/tantivy/pull/2694)(@PSeitz-dd)
+- Add lazy scorers to stop score evaluation early when a doc won't reach the top-K threshold [#2726](https://github.com/quickwit-oss/tantivy/pull/2726) [#2777](https://github.com/quickwit-oss/tantivy/pull/2777)(@fulmicoton @stuhood)
+- Add `DocSet::cost()` and use it to order scorers in intersections [#2707](https://github.com/quickwit-oss/tantivy/pull/2707)(@PSeitz)
+- Add `collect_block` support for collector wrappers [#2727](https://github.com/quickwit-oss/tantivy/pull/2727)(@stuhood)
+- Optimize saturated posting lists by replacing them with `AllScorer` in boolean queries [#2745](https://github.com/quickwit-oss/tantivy/pull/2745) [#2760](https://github.com/quickwit-oss/tantivy/pull/2760) [#2774](https://github.com/quickwit-oss/tantivy/pull/2774)(@fulmicoton @mdashti @trinity-1686a)
+- Add `seek_danger` on `DocSet` for more efficient intersections [#2538](https://github.com/quickwit-oss/tantivy/pull/2538) [#2810](https://github.com/quickwit-oss/tantivy/pull/2810)(@PSeitz @stuhood @fulmicoton)
+- Skip column traversal in `RangeDocSet` when query range does not overlap with column bounds [#2783](https://github.com/quickwit-oss/tantivy/pull/2783)(@ChangRui-Ryan)
+- Speed up exclude queries by supporting multiple excluded `DocSet`s without intermediate union [#2825](https://github.com/quickwit-oss/tantivy/pull/2825)(@PSeitz)
+- Deduplicate queries in query parser [#2698](https://github.com/quickwit-oss/tantivy/pull/2698)(@PSeitz-dd)
+
 Tantivy 0.25
 ================================

--- a/Cargo.toml
+++ b/Cargo.toml
@@ -47,7 +47,7 @@ rustc-hash = "2.0.0"
 thiserror = "2.0.1"
 htmlescape = "0.3.1"
 fail = { version = "0.5.0", optional = true }
-time = { version = "0.3.35", features = ["serde-well-known"] }
+time = { version = "0.3.47", features = ["serde-well-known"] }
 smallvec = "1.8.0"
 rayon = "1.5.2"
 lru = "0.16.3"
@@ -64,7 +64,7 @@ query-grammar = { version = "0.25.0", path = "./query-grammar", package = "tanti
 tantivy-bitpacker = { version = "0.9", path = "./bitpacker" }
 common = { version = "0.10", path = "./common/", package = "tantivy-common" }
 tokenizer-api = { version = "0.6", path = "./tokenizer-api", package = "tantivy-tokenizer-api" }
-sketches-ddsketch = { path = "./sketches-ddsketch", features = ["use_serde"] }
+sketches-ddsketch = { git = "https://github.com/quickwit-oss/rust-sketches-ddsketch.git", rev = "555caf1", features = ["use_serde"] }
 datasketches = "0.2.0"
 futures-util = { version = "0.3.28", optional = true }
 futures-channel = { version = "0.3.28", optional = true }
@@ -86,7 +86,7 @@ futures = "0.3.21"
 paste = "1.0.11"
 more-asserts = "0.3.1"
 rand_distr = "0.5"
-time = { version = "0.3.10", features = ["serde-well-known", "macros"] }
+time = { version = "0.3.47", features = ["serde-well-known", "macros"] }
 postcard = { version = "1.0.4", features = [
    "use-std",
 ], default-features = false }
@@ -144,7 +144,6 @@ members = [
    "sstable",
    "tokenizer-api",
    "columnar",
-    "sketches-ddsketch",
 ]

 # Following the "fail" crate best practises, we isolate
@@ -202,4 +201,3 @@ harness = false
 [[bench]]
 name = "regex_all_terms"
 harness = false
-
--- a/benches/agg_bench.rs
+++ b/benches/agg_bench.rs
@@ -1,6 +1,5 @@
 use binggan::plugins::PeakMemAllocPlugin;
 use binggan::{black_box, InputGroup, PeakMemAlloc, INSTRUMENTED_SYSTEM};
-use common::DateTime;
 use rand::distr::weighted::WeightedIndex;
 use rand::rngs::StdRng;
 use rand::seq::IndexedRandom;
@@ -71,12 +70,6 @@ fn bench_agg(mut group: InputGroup<Index>) {

    register!(group, terms_many_json_mixed_type_with_avg_sub_agg);

-    register!(group, composite_term_many_page_1000);
-    register!(group, composite_term_many_page_1000_with_avg_sub_agg);
-    register!(group, composite_term_few);
-    register!(group, composite_histogram);
-    register!(group, composite_histogram_calendar);
-
    register!(group, cardinality_agg);
    register!(group, terms_status_with_cardinality_agg);

@@ -320,75 +313,6 @@ fn terms_many_json_mixed_type_with_avg_sub_agg(index: &Index) {
    });
    execute_agg(index, agg_req);
 }
-fn composite_term_few(index: &Index) {
-    let agg_req = json!({
-        "my_ctf": {
-            "composite": {
-                "sources": [
-                    { "text_few_terms": { "terms": { "field": "text_few_terms" } } }
-                ],
-                "size": 1000
-            }
-        },
-    });
-    execute_agg(index, agg_req);
-}
-fn composite_term_many_page_1000(index: &Index) {
-    let agg_req = json!({
-        "my_ctmp1000": {
-            "composite": {
-                "sources": [
-                    { "text_many_terms": { "terms": { "field": "text_many_terms" } } }
-                ],
-                "size": 1000
-            }
-        },
-    });
-    execute_agg(index, agg_req);
-}
-fn composite_term_many_page_1000_with_avg_sub_agg(index: &Index) {
-    let agg_req = json!({
-        "my_ctmp1000wasa": {
-            "composite": {
-                "sources": [
-                    { "text_many_terms": { "terms": { "field": "text_many_terms" } } }
-                ],
-                "size": 1000,
-
-            },
-            "aggs": {
-                "average_f64": { "avg": { "field": "score_f64" } }
-            }
-        },
-    });
-    execute_agg(index, agg_req);
-}
-fn composite_histogram(index: &Index) {
-    let agg_req = json!({
-        "my_ch": {
-            "composite": {
-                "sources": [
-                    { "f64_histogram": { "histogram": { "field": "score_f64", "interval": 1 } } }
-                ],
-                "size": 1000
-            }
-        },
-    });
-    execute_agg(index, agg_req);
-}
-fn composite_histogram_calendar(index: &Index) {
-    let agg_req = json!({
-        "my_chc": {
-            "composite": {
-                "sources": [
-                    { "time_histogram": { "date_histogram": { "field": "timestamp", "calendar_interval": "month" } } }
-                ],
-                "size": 1000
-            }
-        },
-    });
-    execute_agg(index, agg_req);
-}

 fn execute_agg(index: &Index, agg_req: serde_json::Value) {
    let agg_req: Aggregations = serde_json::from_value(agg_req).unwrap();
@@ -580,7 +504,6 @@ fn get_test_index_bench(cardinality: Cardinality) -> tantivy::Result<Index> {
    let score_field = schema_builder.add_u64_field("score", score_fieldtype.clone());
    let score_field_f64 = schema_builder.add_f64_field("score_f64", score_fieldtype.clone());
    let score_field_i64 = schema_builder.add_i64_field("score_i64", score_fieldtype);
-    let date_field = schema_builder.add_date_field("timestamp", FAST);
    // use tmp dir
    let index = if reuse_index {
        Index::create_in_dir("agg_bench", schema_builder.build())?
@@ -670,7 +593,6 @@ fn get_test_index_bench(cardinality: Cardinality) -> tantivy::Result<Index> {
                score_field => val as u64,
                score_field_f64 => lg_norm.sample(&mut rng),
                score_field_i64 => val as i64,
-                date_field => DateTime::from_timestamp_millis((val * 1_000_000.) as i64),
            ))?;
            if cardinality == Cardinality::OptionalSparse {
                for _ in 0..20 {
--- a/columnar/src/column_values/mod.rs
+++ b/columnar/src/column_values/mod.rs
@@ -31,7 +31,7 @@ pub use u64_based::{
    serialize_and_load_u64_based_column_values, serialize_u64_based_column_values,
 };
 pub use u128_based::{
-    CompactHit, CompactSpaceU64Accessor, open_u128_as_compact_u64, open_u128_mapped,
+    CompactSpaceU64Accessor, open_u128_as_compact_u64, open_u128_mapped,
    serialize_column_values_u128,
 };
 pub use vec_column::VecColumn;
--- a/columnar/src/column_values/u128_based/compact_space/mod.rs
+++ b/columnar/src/column_values/u128_based/compact_space/mod.rs
@@ -292,19 +292,6 @@ impl BinarySerializable for IPCodecParams {
    }
 }

-/// Represents the result of looking up a u128 value in the compact space.
-///
-/// If a value is outside the compact space, the next compact value is returned.
-#[derive(Debug, Clone, Copy, PartialEq, Eq)]
-pub enum CompactHit {
-    /// The value exists in the compact space
-    Exact(u32),
-    /// The value does not exist in the compact space, but the next higher value does
-    Next(u32),
-    /// The value is greater than the maximum compact value
-    AfterLast,
-}
-
 /// Exposes the compact space compressed values as u64.
 ///
 /// This allows faster access to the values, as u64 is faster to work with than u128.
@@ -322,11 +309,6 @@ impl CompactSpaceU64Accessor {
    pub fn compact_to_u128(&self, compact: u32) -> u128 {
        self.0.compact_to_u128(compact)
    }
-
-    /// Finds the next compact space value for a given u128 value.
-    pub fn u128_to_next_compact(&self, value: u128) -> CompactHit {
-        self.0.u128_to_next_compact(value)
-    }
 }

 impl ColumnValues<u64> for CompactSpaceU64Accessor {
@@ -448,26 +430,6 @@ impl CompactSpaceDecompressor {
        Ok(decompressor)
    }

-    /// Finds the next compact space value for a given u128 value
-    pub fn u128_to_next_compact(&self, value: u128) -> CompactHit {
-        // Try to convert to compact space
-        match self.u128_to_compact(value) {
-            // Value is in compact space, return its compact representation
-            Ok(compact) => CompactHit::Exact(compact),
-            // Value is not in compact space
-            Err(pos) => {
-                if pos >= self.params.compact_space.ranges_mapping.len() {
-                    // Value is beyond all ranges, no next value exists
-                    CompactHit::AfterLast
-                } else {
-                    // Get the next range and return its start compact value
-                    let next_range = &self.params.compact_space.ranges_mapping[pos];
-                    CompactHit::Next(next_range.compact_start)
-                }
-            }
-        }
-    }
-
    /// Converting to compact space for the decompressor is more complex, since we may get values
    /// which are outside the compact space. e.g. if we map
    /// 1000 => 5
@@ -861,41 +823,6 @@ mod tests {
        let _data = test_aux_vals(vals);
    }

-    #[test]
-    fn test_u128_to_next_compact() {
-        let vals = &[100u128, 200u128, 1_000_000_000u128, 1_000_000_100u128];
-        let mut data = test_aux_vals(vals);
-
-        let _header = U128Header::deserialize(&mut data);
-        let decomp = CompactSpaceDecompressor::open(data).unwrap();
-
-        // Test value that's already in a range
-        let compact_100 = decomp.u128_to_compact(100).unwrap();
-        assert_eq!(
-            decomp.u128_to_next_compact(100),
-            CompactHit::Exact(compact_100)
-        );
-
-        // Test value between two ranges
-        let compact_million = decomp.u128_to_compact(1_000_000_000).unwrap();
-        assert_eq!(
-            decomp.u128_to_next_compact(250),
-            CompactHit::Next(compact_million)
-        );
-
-        // Test value before the first range
-        assert_eq!(
-            decomp.u128_to_next_compact(50),
-            CompactHit::Next(compact_100)
-        );
-
-        // Test value after the last range
-        assert_eq!(
-            decomp.u128_to_next_compact(10_000_000_000),
-            CompactHit::AfterLast
-        );
-    }
-
    use proptest::prelude::*;

    fn num_strategy() -> impl Strategy<Value = u128> {
--- a/columnar/src/column_values/u128_based/mod.rs
+++ b/columnar/src/column_values/u128_based/mod.rs
@@ -7,7 +7,7 @@ mod compact_space;

 use common::{BinarySerializable, OwnedBytes, VInt};
 pub use compact_space::{
-    CompactHit, CompactSpaceCompressor, CompactSpaceDecompressor, CompactSpaceU64Accessor,
+    CompactSpaceCompressor, CompactSpaceDecompressor, CompactSpaceU64Accessor,
 };

 use crate::column_values::monotonic_map_column;
--- a/columnar/src/lib.rs
+++ b/columnar/src/lib.rs
@@ -59,7 +59,7 @@ pub struct RowAddr {
    pub row_id: RowId,
 }

-pub use sstable::{Dictionary, TermOrdHit};
+pub use sstable::Dictionary;
 pub type Streamer<'a> = sstable::Streamer<'a, VoidSSTable>;

 pub use common::DateTime;
--- a/common/Cargo.toml
+++ b/common/Cargo.toml
@@ -15,11 +15,10 @@ repository = "https://github.com/quickwit-oss/tantivy"
 byteorder = "1.4.3"
 ownedbytes = { version= "0.9", path="../ownedbytes" }
 async-trait = "0.1"
-time = { version = "0.3.10", features = ["serde-well-known"] }
+time = { version = "0.3.47", features = ["serde-well-known"] }
 serde = { version = "1.0.136", features = ["derive"] }

 [dev-dependencies]
 binggan = "0.14.0"
 proptest = "1.0.0"
 rand = "0.9"
-
--- a/common/src/writer.rs
+++ b/common/src/writer.rs
@@ -62,7 +62,9 @@ impl<W: TerminatingWrite> TerminatingWrite for CountingWriter<W> {
 pub struct AntiCallToken(());

 /// Trait used to indicate when no more write need to be done on a writer
-pub trait TerminatingWrite: Write + Send + Sync {
+///
+/// Thread-safety is enforced at the call sites that require it.
+pub trait TerminatingWrite: Write {
    /// Indicate that the writer will no longer be used. Internally call terminate_ref.
    fn terminate(mut self) -> io::Result<()>
    where Self: Sized {
--- a/sketches-ddsketch/Cargo.toml
+++ b/sketches-ddsketch/Cargo.toml
@@ -1,27 +0,0 @@
-[package]
-name = "sketches-ddsketch"
-version = "0.3.0"
-authors = ["Mike Heffner <mikeh@fesnel.com>"]
-edition = "2018"
-license = "Apache-2.0"
-readme = "README.md"
-repository = "https://github.com/mheffner/rust-sketches-ddsketch"
-homepage = "https://github.com/mheffner/rust-sketches-ddsketch"
-description = """
-A direct port of the Golang DDSketch implementation.
-"""
-exclude = [".gitignore"]
-
-# See more keys and their definitions at https://doc.rust-lang.org/cargo/reference/manifest.html
-
-[dependencies]
-serde = { package = "serde", version = "1.0", optional = true, features = ["derive", "serde_derive"] }
-
-[dev-dependencies]
-approx = "0.5.1"
-rand = "0.8.5"
-rand_distr = "0.4.3"
-
-[features]
-use_serde = ["serde", "serde/derive"]
-
--- a/sketches-ddsketch/LICENSE
+++ b/sketches-ddsketch/LICENSE
@@ -1,201 +0,0 @@
-                                 Apache License
-                           Version 2.0, January 2004
-                        http://www.apache.org/licenses/
-
-   TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION
-
-   1. Definitions.
-
-      "License" shall mean the terms and conditions for use, reproduction,
-      and distribution as defined by Sections 1 through 9 of this document.
-
-      "Licensor" shall mean the copyright owner or entity authorized by
-      the copyright owner that is granting the License.
-
-      "Legal Entity" shall mean the union of the acting entity and all
-      other entities that control, are controlled by, or are under common
-      control with that entity. For the purposes of this definition,
-      "control" means (i) the power, direct or indirect, to cause the
-      direction or management of such entity, whether by contract or
-      otherwise, or (ii) ownership of fifty percent (50%) or more of the
-      outstanding shares, or (iii) beneficial ownership of such entity.
-
-      "You" (or "Your") shall mean an individual or Legal Entity
-      exercising permissions granted by this License.
-
-      "Source" form shall mean the preferred form for making modifications,
-      including but not limited to software source code, documentation
-      source, and configuration files.
-
-      "Object" form shall mean any form resulting from mechanical
-      transformation or translation of a Source form, including but
-      not limited to compiled object code, generated documentation,
-      and conversions to other media types.
-
-      "Work" shall mean the work of authorship, whether in Source or
-      Object form, made available under the License, as indicated by a
-      copyright notice that is included in or attached to the work
-      (an example is provided in the Appendix below).
-
-      "Derivative Works" shall mean any work, whether in Source or Object
-      form, that is based on (or derived from) the Work and for which the
-      editorial revisions, annotations, elaborations, or other modifications
-      represent, as a whole, an original work of authorship. For the purposes
-      of this License, Derivative Works shall not include works that remain
-      separable from, or merely link (or bind by name) to the interfaces of,
-      the Work and Derivative Works thereof.
-
-      "Contribution" shall mean any work of authorship, including
-      the original version of the Work and any modifications or additions
-      to that Work or Derivative Works thereof, that is intentionally
-      submitted to Licensor for inclusion in the Work by the copyright owner
-      or by an individual or Legal Entity authorized to submit on behalf of
-      the copyright owner. For the purposes of this definition, "submitted"
-      means any form of electronic, verbal, or written communication sent
-      to the Licensor or its representatives, including but not limited to
-      communication on electronic mailing lists, source code control systems,
-      and issue tracking systems that are managed by, or on behalf of, the
-      Licensor for the purpose of discussing and improving the Work, but
-      excluding communication that is conspicuously marked or otherwise
-      designated in writing by the copyright owner as "Not a Contribution."
-
-      "Contributor" shall mean Licensor and any individual or Legal Entity
-      on behalf of whom a Contribution has been received by Licensor and
-      subsequently incorporated within the Work.
-
-   2. Grant of Copyright License. Subject to the terms and conditions of
-      this License, each Contributor hereby grants to You a perpetual,
-      worldwide, non-exclusive, no-charge, royalty-free, irrevocable
-      copyright license to reproduce, prepare Derivative Works of,
-      publicly display, publicly perform, sublicense, and distribute the
-      Work and such Derivative Works in Source or Object form.
-
-   3. Grant of Patent License. Subject to the terms and conditions of
-      this License, each Contributor hereby grants to You a perpetual,
-      worldwide, non-exclusive, no-charge, royalty-free, irrevocable
-      (except as stated in this section) patent license to make, have made,
-      use, offer to sell, sell, import, and otherwise transfer the Work,
-      where such license applies only to those patent claims licensable
-      by such Contributor that are necessarily infringed by their
-      Contribution(s) alone or by combination of their Contribution(s)
-      with the Work to which such Contribution(s) was submitted. If You
-      institute patent litigation against any entity (including a
-      cross-claim or counterclaim in a lawsuit) alleging that the Work
-      or a Contribution incorporated within the Work constitutes direct
-      or contributory patent infringement, then any patent licenses
-      granted to You under this License for that Work shall terminate
-      as of the date such litigation is filed.
-
-   4. Redistribution. You may reproduce and distribute copies of the
-      Work or Derivative Works thereof in any medium, with or without
-      modifications, and in Source or Object form, provided that You
-      meet the following conditions:
-
-      (a) You must give any other recipients of the Work or
-          Derivative Works a copy of this License; and
-
-      (b) You must cause any modified files to carry prominent notices
-          stating that You changed the files; and
-
-      (c) You must retain, in the Source form of any Derivative Works
-          that You distribute, all copyright, patent, trademark, and
-          attribution notices from the Source form of the Work,
-          excluding those notices that do not pertain to any part of
-          the Derivative Works; and
-
-      (d) If the Work includes a "NOTICE" text file as part of its
-          distribution, then any Derivative Works that You distribute must
-          include a readable copy of the attribution notices contained
-          within such NOTICE file, excluding those notices that do not
-          pertain to any part of the Derivative Works, in at least one
-          of the following places: within a NOTICE text file distributed
-          as part of the Derivative Works; within the Source form or
-          documentation, if provided along with the Derivative Works; or,
-          within a display generated by the Derivative Works, if and
-          wherever such third-party notices normally appear. The contents
-          of the NOTICE file are for informational purposes only and
-          do not modify the License. You may add Your own attribution
-          notices within Derivative Works that You distribute, alongside
-          or as an addendum to the NOTICE text from the Work, provided
-          that such additional attribution notices cannot be construed
-          as modifying the License.
-
-      You may add Your own copyright statement to Your modifications and
-      may provide additional or different license terms and conditions
-      for use, reproduction, or distribution of Your modifications, or
-      for any such Derivative Works as a whole, provided Your use,
-      reproduction, and distribution of the Work otherwise complies with
-      the conditions stated in this License.
-
-   5. Submission of Contributions. Unless You explicitly state otherwise,
-      any Contribution intentionally submitted for inclusion in the Work
-      by You to the Licensor shall be under the terms and conditions of
-      this License, without any additional terms or conditions.
-      Notwithstanding the above, nothing herein shall supersede or modify
-      the terms of any separate license agreement you may have executed
-      with Licensor regarding such Contributions.
-
-   6. Trademarks. This License does not grant permission to use the trade
-      names, trademarks, service marks, or product names of the Licensor,
-      except as required for reasonable and customary use in describing the
-      origin of the Work and reproducing the content of the NOTICE file.
-
-   7. Disclaimer of Warranty. Unless required by applicable law or
-      agreed to in writing, Licensor provides the Work (and each
-      Contributor provides its Contributions) on an "AS IS" BASIS,
-      WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
-      implied, including, without limitation, any warranties or conditions
-      of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A
-      PARTICULAR PURPOSE. You are solely responsible for determining the
-      appropriateness of using or redistributing the Work and assume any
-      risks associated with Your exercise of permissions under this License.
-
-   8. Limitation of Liability. In no event and under no legal theory,
-      whether in tort (including negligence), contract, or otherwise,
-      unless required by applicable law (such as deliberate and grossly
-      negligent acts) or agreed to in writing, shall any Contributor be
-      liable to You for damages, including any direct, indirect, special,
-      incidental, or consequential damages of any character arising as a
-      result of this License or out of the use or inability to use the
-      Work (including but not limited to damages for loss of goodwill,
-      work stoppage, computer failure or malfunction, or any and all
-      other commercial damages or losses), even if such Contributor
-      has been advised of the possibility of such damages.
-
-   9. Accepting Warranty or Additional Liability. While redistributing
-      the Work or Derivative Works thereof, You may choose to offer,
-      and charge a fee for, acceptance of support, warranty, indemnity,
-      or other liability obligations and/or rights consistent with this
-      License. However, in accepting such obligations, You may act only
-      on Your own behalf and on Your sole responsibility, not on behalf
-      of any other Contributor, and only if You agree to indemnify,
-      defend, and hold each Contributor harmless for any liability
-      incurred by, or claims asserted against, such Contributor by reason
-      of your accepting any such warranty or additional liability.
-
-   END OF TERMS AND CONDITIONS
-
-   APPENDIX: How to apply the Apache License to your work.
-
-      To apply the Apache License to your work, attach the following
-      boilerplate notice, with the fields enclosed by brackets "[]"
-      replaced with your own identifying information. (Don't include
-      the brackets!)  The text should be enclosed in the appropriate
-      comment syntax for the file format. We also recommend that a
-      file or class name and description of purpose be included on the
-      same "printed page" as the copyright notice for easier
-      identification within third-party archives.
-
-   Copyright [2019] [Mike Heffner]
-
-   Licensed under the Apache License, Version 2.0 (the "License");
-   you may not use this file except in compliance with the License.
-   You may obtain a copy of the License at
-
-       http://www.apache.org/licenses/LICENSE-2.0
-
-   Unless required by applicable law or agreed to in writing, software
-   distributed under the License is distributed on an "AS IS" BASIS,
-   WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-   See the License for the specific language governing permissions and
-   limitations under the License.
--- a/sketches-ddsketch/Makefile
+++ b/sketches-ddsketch/Makefile
@@ -1,11 +0,0 @@
-clean:
-	cargo clean
-
-test:
-	cargo test
-
-test_logs:
-	cargo test -- --nocapture
-
-test_performance:
-	cargo test --release --jobs 1 test_performance -- --ignored --nocapture
--- a/sketches-ddsketch/README.md
+++ b/sketches-ddsketch/README.md
@@ -1,37 +0,0 @@
-# sketches-ddsketch
-
-This is a direct port of the [Golang](https://github.com/DataDog/sketches-go) 
-[DDSketch](https://arxiv.org/pdf/1908.10693.pdf) quantile sketch implementation 
-to Rust. DDSketch is a fully-mergeable quantile sketch with relative-error 
-guarantees and is extremely fast.
-
-# DDSketch
-
-* Sketch size automatically grows as needed, starting with 128 bins.
-* Extremely fast sample insertion and sketch merges.
-
-## Usage
-
-```rust
-use sketches_ddsketch::{Config, DDSketch};
-
-let config = Config::defaults();
-let mut sketch = DDSketch::new(c);
-
-sketch.add(1.0);
-sketch.add(1.0);
-sketch.add(1.0);
-
-// Get p=50%
-let quantile = sketch.quantile(0.5).unwrap();
-assert_eq!(quantile, Some(1.0));
-```
-
-## Performance
-
-No performance tuning has been done with this implementation of the port, so we
-would expect similar profiles to the original implementation.
-
-Out of the box we see can achieve over 70M sample inserts/sec and 350K sketch
-merges/sec. All tests run on a single core Intel i7 processor with 4.2Ghz max 
-clock.
--- a/sketches-ddsketch/src/config.rs
+++ b/sketches-ddsketch/src/config.rs
@@ -1,98 +0,0 @@
-#[cfg(feature = "use_serde")]
-use serde::{Deserialize, Serialize};
-
-const DEFAULT_MAX_BINS: u32 = 2048;
-const DEFAULT_ALPHA: f64 = 0.01;
-const DEFAULT_MIN_VALUE: f64 = 1.0e-9;
-
-/// The configuration struct for constructing a `DDSketch`
-#[derive(Copy, Clone, Debug, PartialEq)]
-#[cfg_attr(feature = "use_serde", derive(Serialize, Deserialize))]
-pub struct Config {
-    pub max_num_bins: u32,
-    pub gamma: f64,
-    pub(crate) gamma_ln: f64,
-    pub(crate) min_value: f64,
-    pub offset: i32,
-}
-
-fn log_gamma(value: f64, gamma_ln: f64) -> f64 {
-    value.ln() / gamma_ln
-}
-
-impl Config {
-    /// Construct a new `Config` struct with specific parameters. If you are unsure of how to
-    /// configure this, the `defaults` method constructs a `Config` with built-in defaults.
-    ///
-    /// `max_num_bins` is the max number of bins the DDSketch will grow to, in steps of 128 bins.
-    pub fn new(alpha: f64, max_num_bins: u32, min_value: f64) -> Self {
-        // Aligned with Java's LogarithmicMapping / LogLikeIndexMapping:
-        //   gamma = (1 + alpha) / (1 - alpha)  (correctingFactor=1 for LogarithmicMapping)
-        //   gamma_ln = gamma.ln()  (not ln_1p, to match Java's Math.log(gamma))
-        // See: https://github.com/DataDog/sketches-java/blob/master/src/main/java/com/datadoghq/sketch/ddsketch/mapping/LogLikeIndexMapping.java  (gamma() static method)
-        // See: https://github.com/DataDog/sketches-java/blob/master/src/main/java/com/datadoghq/sketch/ddsketch/mapping/LogarithmicMapping.java  (constructor, correctingFactor()=1)
-        let gamma = (1.0 + alpha) / (1.0 - alpha);
-        let gamma_ln = gamma.ln();
-
-        Config {
-            max_num_bins,
-            gamma,
-            gamma_ln,
-            min_value,
-            offset: 1 - (log_gamma(min_value, gamma_ln) as i32),
-        }
-    }
-
-    /// Return a `Config` using built-in default settings
-    pub fn defaults() -> Self {
-        Self::new(DEFAULT_ALPHA, DEFAULT_MAX_BINS, DEFAULT_MIN_VALUE)
-    }
-
-    pub fn key(&self, v: f64) -> i32 {
-        // Aligned with Java's LogLikeIndexMapping.index(): floor-based indexing.
-        // Java uses `(int) index` / `(int) index - 1` which is equivalent to floor().
-        // See: https://github.com/DataDog/sketches-java/blob/master/src/main/java/com/datadoghq/sketch/ddsketch/mapping/LogLikeIndexMapping.java  (index() method)
-        self.log_gamma(v).floor() as i32
-    }
-
-    pub fn value(&self, key: i32) -> f64 {
-        // Aligned with Java's LogLikeIndexMapping.value():
-        //   lowerBound(index) * (1 + relativeAccuracy)
-        //   = logInverse((index - indexOffset) / multiplier) * (1 + relativeAccuracy)
-        //   = gamma^key * 2*gamma/(gamma+1)
-        // See: https://github.com/DataDog/sketches-java/blob/master/src/main/java/com/datadoghq/sketch/ddsketch/mapping/LogLikeIndexMapping.java  (value() and lowerBound() methods)
-        self.pow_gamma(key) * (2.0 * self.gamma / (1.0 + self.gamma))
-    }
-
-    pub fn log_gamma(&self, value: f64) -> f64 {
-        log_gamma(value, self.gamma_ln)
-    }
-
-    pub fn pow_gamma(&self, key: i32) -> f64 {
-        ((key as f64) * self.gamma_ln).exp()
-    }
-
-    pub fn min_possible(&self) -> f64 {
-        self.min_value
-    }
-
-    /// Reconstruct a Config from a gamma value (as decoded from the binary format).
-    /// Uses default max_num_bins and min_value.
-    /// See Java: https://github.com/DataDog/sketches-java/blob/master/src/main/java/com/datadoghq/sketch/ddsketch/mapping/LogarithmicMapping.java  (LogarithmicMapping(double gamma, double indexOffset) constructor)
-    pub(crate) fn from_gamma(gamma: f64) -> Self {
-        let gamma_ln = gamma.ln();
-        Config {
-            max_num_bins: DEFAULT_MAX_BINS,
-            gamma,
-            gamma_ln,
-            min_value: DEFAULT_MIN_VALUE,
-            offset: 1 - (log_gamma(DEFAULT_MIN_VALUE, gamma_ln) as i32),
-        }
-    }
-}
-
-impl Default for Config {
-    fn default() -> Self {
-        Self::new(DEFAULT_ALPHA, DEFAULT_MAX_BINS, DEFAULT_MIN_VALUE)
-    }
-}
--- a/sketches-ddsketch/src/ddsketch.rs
+++ b/sketches-ddsketch/src/ddsketch.rs
@@ -1,385 +0,0 @@
-use std::{error, fmt};
-
-#[cfg(feature = "use_serde")]
-use serde::{Deserialize, Serialize};
-
-use crate::config::Config;
-use crate::store::Store;
-
-type Result<T> = std::result::Result<T, DDSketchError>;
-
-/// General error type for DDSketch, represents either an invalid quantile or an
-/// incompatible merge operation.
-#[derive(Debug, Clone)]
-pub enum DDSketchError {
-    Quantile,
-    Merge,
-}
-impl fmt::Display for DDSketchError {
-    fn fmt(&self, f: &mut fmt::Formatter) -> fmt::Result {
-        match self {
-            DDSketchError::Quantile => {
-                write!(f, "Invalid quantile, must be between 0 and 1 (inclusive)")
-            }
-            DDSketchError::Merge => write!(f, "Can not merge sketches with different configs"),
-        }
-    }
-}
-impl error::Error for DDSketchError {
-    fn source(&self) -> Option<&(dyn error::Error + 'static)> {
-        // Generic
-        None
-    }
-}
-
-/// This struct represents a [DDSketch](https://arxiv.org/pdf/1908.10693.pdf)
-#[derive(Clone)]
-#[cfg_attr(feature = "use_serde", derive(Serialize, Deserialize))]
-pub struct DDSketch {
-    pub(crate) config: Config,
-    pub(crate) store: Store,
-    pub(crate) negative_store: Store,
-    pub(crate) min: f64,
-    pub(crate) max: f64,
-    pub(crate) sum: f64,
-    pub(crate) zero_count: u64,
-}
-
-impl Default for DDSketch {
-    fn default() -> Self {
-        Self::new(Default::default())
-    }
-}
-
-// XXX: functions should return Option<> in the case of empty
-impl DDSketch {
-    /// Construct a `DDSketch`. Requires a `Config` specifying the parameters of the sketch
-    pub fn new(config: Config) -> Self {
-        DDSketch {
-            config,
-            store: Store::new(config.max_num_bins as usize),
-            negative_store: Store::new(config.max_num_bins as usize),
-            min: f64::INFINITY,
-            max: f64::NEG_INFINITY,
-            sum: 0.0,
-            zero_count: 0,
-        }
-    }
-
-    /// Add the sample to the sketch
-    pub fn add(&mut self, v: f64) {
-        if v > self.config.min_possible() {
-            let key = self.config.key(v);
-            self.store.add(key);
-        } else if v < -self.config.min_possible() {
-            let key = self.config.key(-v);
-            self.negative_store.add(key);
-        } else {
-            self.zero_count += 1;
-        }
-
-        if v < self.min {
-            self.min = v;
-        }
-        if self.max < v {
-            self.max = v;
-        }
-        self.sum += v;
-    }
-
-    /// Return the quantile value for quantiles between 0.0 and 1.0. Result is an error, represented
-    /// as DDSketchError::Quantile if the requested quantile is outside of that range.
-    ///
-    /// If the sketch is empty the result is None, else Some(v) for the quantile value.
-    pub fn quantile(&self, q: f64) -> Result<Option<f64>> {
-        if !(0.0..=1.0).contains(&q) {
-            return Err(DDSketchError::Quantile);
-        }
-
-        if self.empty() {
-            return Ok(None);
-        }
-
-        if q == 0.0 {
-            return Ok(Some(self.min));
-        } else if q == 1.0 {
-            return Ok(Some(self.max));
-        }
-
-        let rank = (q * (self.count() as f64 - 1.0)) as u64;
-        let quantile;
-        if rank < self.negative_store.count() {
-            let reversed_rank = self.negative_store.count() - rank - 1;
-            let key = self.negative_store.key_at_rank(reversed_rank);
-            quantile = -self.config.value(key);
-        } else if rank < self.zero_count + self.negative_store.count() {
-            quantile = 0.0;
-        } else {
-            let key = self
-                .store
-                .key_at_rank(rank - self.zero_count - self.negative_store.count());
-            quantile = self.config.value(key);
-        }
-
-        Ok(Some(quantile))
-    }
-
-    /// Returns the minimum value seen, or None if sketch is empty
-    pub fn min(&self) -> Option<f64> {
-        if self.empty() {
-            None
-        } else {
-            Some(self.min)
-        }
-    }
-
-    /// Returns the maximum value seen, or None if sketch is empty
-    pub fn max(&self) -> Option<f64> {
-        if self.empty() {
-            None
-        } else {
-            Some(self.max)
-        }
-    }
-
-    /// Returns the sum of values seen, or None if sketch is empty
-    pub fn sum(&self) -> Option<f64> {
-        if self.empty() {
-            None
-        } else {
-            Some(self.sum)
-        }
-    }
-
-    /// Returns the number of values added to the sketch
-    pub fn count(&self) -> usize {
-        (self.store.count() + self.zero_count + self.negative_store.count()) as usize
-    }
-
-    /// Returns the length of the underlying `Store`. This is mainly only useful for understanding
-    /// how much the sketch has grown given the inserted values.
-    pub fn length(&self) -> usize {
-        self.store.length() as usize + self.negative_store.length() as usize
-    }
-
-    /// Merge the contents of another sketch into this one. The sketch that is merged into this one
-    /// is unchanged after the merge.
-    pub fn merge(&mut self, o: &DDSketch) -> Result<()> {
-        if self.config != o.config {
-            return Err(DDSketchError::Merge);
-        }
-
-        let was_empty = self.store.count() == 0;
-
-        // Merge the stores
-        self.store.merge(&o.store);
-        self.negative_store.merge(&o.negative_store);
-        self.zero_count += o.zero_count;
-
-        // Need to ensure we don't override min/max with initializers
-        // if either store were empty
-        if was_empty {
-            self.min = o.min;
-            self.max = o.max;
-        } else if o.store.count() > 0 {
-            if o.min < self.min {
-                self.min = o.min
-            }
-            if o.max > self.max {
-                self.max = o.max;
-            }
-        }
-        self.sum += o.sum;
-
-        Ok(())
-    }
-
-    fn empty(&self) -> bool {
-        self.count() == 0
-    }
-
-    /// Encode this sketch into the Java-compatible binary format used by
-    /// `com.datadoghq.sketch.ddsketch.DDSketchWithExactSummaryStatistics`.
-    pub fn to_java_bytes(&self) -> Vec<u8> {
-        crate::encoding::encode_to_java_bytes(self)
-    }
-
-    /// Decode a sketch from the Java-compatible binary format.
-    /// Accepts bytes produced by Java's `DDSketchWithExactSummaryStatistics.encode()`
-    /// with or without the `0x02` version prefix.
-    pub fn from_java_bytes(
-        bytes: &[u8],
-    ) -> std::result::Result<Self, crate::encoding::DecodeError> {
-        crate::encoding::decode_from_java_bytes(bytes)
-    }
-}
-
-#[cfg(test)]
-mod tests {
-    use approx::assert_relative_eq;
-
-    use crate::{Config, DDSketch};
-
-    #[test]
-    fn test_add_zero() {
-        let alpha = 0.01;
-        let c = Config::new(alpha, 2048, 10e-9);
-        let mut dd = DDSketch::new(c);
-        dd.add(0.0);
-    }
-
-    #[test]
-    fn test_quartiles() {
-        let alpha = 0.01;
-        let c = Config::new(alpha, 2048, 10e-9);
-        let mut dd = DDSketch::new(c);
-
-        // Initialize sketch with {1.0, 2.0, 3.0, 4.0}
-        for i in 1..5 {
-            dd.add(i as f64);
-        }
-
-        // We expect the following mappings from quantile to value:
-        // [0,0.33]: 1.0, (0.34,0.66]: 2.0, (0.67,0.99]: 3.0, (0.99, 1.0]: 4.0
-        let test_cases = vec![
-            (0.0, 1.0),
-            (0.25, 1.0),
-            (0.33, 1.0),
-            (0.34, 2.0),
-            (0.5, 2.0),
-            (0.66, 2.0),
-            (0.67, 3.0),
-            (0.75, 3.0),
-            (0.99, 3.0),
-            (1.0, 4.0),
-        ];
-
-        for (q, val) in test_cases {
-            assert_relative_eq!(dd.quantile(q).unwrap().unwrap(), val, max_relative = alpha);
-        }
-    }
-
-    #[test]
-    fn test_neg_quartiles() {
-        let alpha = 0.01;
-        let c = Config::new(alpha, 2048, 10e-9);
-        let mut dd = DDSketch::new(c);
-
-        // Initialize sketch with {1.0, 2.0, 3.0, 4.0}
-        for i in 1..5 {
-            dd.add(-i as f64);
-        }
-
-        let test_cases = vec![
-            (0.0, -4.0),
-            (0.25, -4.0),
-            (0.5, -3.0),
-            (0.75, -2.0),
-            (1.0, -1.0),
-        ];
-
-        for (q, val) in test_cases {
-            assert_relative_eq!(dd.quantile(q).unwrap().unwrap(), val, max_relative = alpha);
-        }
-    }
-
-    #[test]
-    fn test_simple_quantile() {
-        let c = Config::defaults();
-        let mut dd = DDSketch::new(c);
-
-        for i in 1..101 {
-            dd.add(i as f64);
-        }
-
-        assert_eq!(dd.quantile(0.95).unwrap().unwrap().ceil(), 95.0);
-
-        assert!(dd.quantile(-1.01).is_err());
-        assert!(dd.quantile(1.01).is_err());
-    }
-
-    #[test]
-    fn test_empty_sketch() {
-        let c = Config::defaults();
-        let dd = DDSketch::new(c);
-
-        assert_eq!(dd.quantile(0.98).unwrap(), None);
-        assert_eq!(dd.max(), None);
-        assert_eq!(dd.min(), None);
-        assert_eq!(dd.sum(), None);
-        assert_eq!(dd.count(), 0);
-
-        assert!(dd.quantile(1.01).is_err());
-    }
-
-    #[test]
-    fn test_basic_histogram_data() {
-        let values = &[
-            0.754225035,
-            0.752900282,
-            0.752812246,
-            0.752602367,
-            0.754310155,
-            0.753525981,
-            0.752981082,
-            0.752715536,
-            0.751667941,
-            0.755079054,
-            0.753528150,
-            0.755188464,
-            0.752508723,
-            0.750064549,
-            0.753960428,
-            0.751139298,
-            0.752523560,
-            0.753253428,
-            0.753498342,
-            0.751858358,
-            0.752104636,
-            0.753841300,
-            0.754467374,
-            0.753814334,
-            0.750881719,
-            0.753182556,
-            0.752576884,
-            0.753945708,
-            0.753571911,
-            0.752314573,
-            0.752586651,
-        ];
-
-        let c = Config::defaults();
-        let mut dd = DDSketch::new(c);
-
-        for value in values {
-            dd.add(*value);
-        }
-
-        assert_eq!(dd.max(), Some(0.755188464));
-        assert_eq!(dd.min(), Some(0.750064549));
-        assert_eq!(dd.count(), 31);
-        assert_eq!(dd.sum(), Some(23.343630625000003));
-
-        assert!(dd.quantile(0.25).unwrap().is_some());
-        assert!(dd.quantile(0.5).unwrap().is_some());
-        assert!(dd.quantile(0.75).unwrap().is_some());
-    }
-
-    #[test]
-    fn test_length() {
-        let mut dd = DDSketch::default();
-        assert_eq!(dd.length(), 0);
-
-        dd.add(1.0);
-        assert_eq!(dd.length(), 128);
-        dd.add(2.0);
-        dd.add(3.0);
-        assert_eq!(dd.length(), 128);
-
-        dd.add(-1.0);
-        assert_eq!(dd.length(), 256);
-        dd.add(-2.0);
-        dd.add(-3.0);
-        assert_eq!(dd.length(), 256);
-    }
-}
--- a/sketches-ddsketch/src/encoding.rs
+++ b/sketches-ddsketch/src/encoding.rs
@@ -1,813 +0,0 @@
-//! Java-compatible binary encoding/decoding for DDSketch.
-//!
-//! This module implements the binary format used by the Java
-//! `com.datadoghq.sketch.ddsketch.DDSketchWithExactSummaryStatistics` class
-//! from the DataDog/sketches-java library. It enables cross-language
-//! serialization so that sketches produced in Rust can be deserialized
-//! and merged by Java consumers.
-
-use std::fmt;
-
-use crate::config::Config;
-use crate::ddsketch::DDSketch;
-use crate::store::Store;
-
-// ---------------------------------------------------------------------------
-// Flag byte layout
-//
-// Each flag byte packs a 2-bit type ordinal in the low bits and a 6-bit
-// subflag in the upper bits:  (subflag << 2) | type_ordinal
-// See: https://github.com/DataDog/sketches-java/blob/master/src/main/java/com/datadoghq/sketch/ddsketch/encoding/Flag.java
-// ---------------------------------------------------------------------------
-
-/// The 2-bit type field occupying the low bits of every flag byte.
-#[repr(u8)]
-#[derive(Debug, Clone, Copy, PartialEq, Eq)]
-enum FlagType {
-    SketchFeatures = 0,
-    PositiveStore = 1,
-    IndexMapping = 2,
-    NegativeStore = 3,
-}
-
-impl FlagType {
-    fn from_byte(b: u8) -> Option<Self> {
-        match b & 0x03 {
-            0 => Some(Self::SketchFeatures),
-            1 => Some(Self::PositiveStore),
-            2 => Some(Self::IndexMapping),
-            3 => Some(Self::NegativeStore),
-            _ => None,
-        }
-    }
-}
-
-/// Construct a flag byte from a subflag and a type.
-const fn flag(subflag: u8, flag_type: FlagType) -> u8 {
-    (subflag << 2) | (flag_type as u8)
-}
-
-// Pre-computed flag bytes for the sketch features we encode/decode.
-const FLAG_INDEX_MAPPING_LOG: u8 = flag(0, FlagType::IndexMapping); // 0x02
-const FLAG_ZERO_COUNT: u8 = flag(1, FlagType::SketchFeatures); // 0x04
-const FLAG_COUNT: u8 = flag(0x28, FlagType::SketchFeatures); // 0xA0
-const FLAG_SUM: u8 = flag(0x21, FlagType::SketchFeatures); // 0x84
-const FLAG_MIN: u8 = flag(0x22, FlagType::SketchFeatures); // 0x88
-const FLAG_MAX: u8 = flag(0x23, FlagType::SketchFeatures); // 0x8C
-
-/// BinEncodingMode subflags for store flag bytes.
-/// See: https://github.com/DataDog/sketches-java/blob/master/src/main/java/com/datadoghq/sketch/ddsketch/encoding/BinEncodingMode.java
-#[repr(u8)]
-#[derive(Debug, Clone, Copy, PartialEq, Eq)]
-enum BinEncodingMode {
-    IndexDeltasAndCounts = 1,
-    IndexDeltas = 2,
-    ContiguousCounts = 3,
-}
-
-impl BinEncodingMode {
-    fn from_subflag(subflag: u8) -> Option<Self> {
-        match subflag {
-            1 => Some(Self::IndexDeltasAndCounts),
-            2 => Some(Self::IndexDeltas),
-            3 => Some(Self::ContiguousCounts),
-            _ => None,
-        }
-    }
-}
-
-const VAR_DOUBLE_ROTATE_DISTANCE: u32 = 6;
-const MAX_VAR_LEN_64: usize = 9;
-
-const DEFAULT_MAX_BINS: u32 = 2048;
-
-// ---------------------------------------------------------------------------
-// Error type
-// ---------------------------------------------------------------------------
-
-#[derive(Debug, Clone)]
-pub enum DecodeError {
-    UnexpectedEof,
-    InvalidFlag(u8),
-    InvalidData(String),
-}
-
-impl fmt::Display for DecodeError {
-    fn fmt(&self, f: &mut fmt::Formatter<'_>) -> fmt::Result {
-        match self {
-            Self::UnexpectedEof => write!(f, "unexpected end of input"),
-            Self::InvalidFlag(b) => write!(f, "invalid flag byte: 0x{b:02X}"),
-            Self::InvalidData(msg) => write!(f, "invalid data: {msg}"),
-        }
-    }
-}
-
-impl std::error::Error for DecodeError {}
-
-// ---------------------------------------------------------------------------
-// VarEncoding — bit-exact port of Java VarEncodingHelper
-// See: https://github.com/DataDog/sketches-java/blob/master/src/main/java/com/datadoghq/sketch/ddsketch/encoding/VarEncodingHelper.java
-// ---------------------------------------------------------------------------
-
-fn encode_unsigned_var_long(out: &mut Vec<u8>, mut value: u64) {
-    let length = ((63 - value.leading_zeros() as i32) / 7).clamp(0, 8);
-    for _ in 0..length {
-        out.push((value as u8) | 0x80);
-        value >>= 7;
-    }
-    out.push(value as u8);
-}
-
-fn decode_unsigned_var_long(input: &mut &[u8]) -> Result<u64, DecodeError> {
-    let mut value: u64 = 0;
-    let mut shift: u32 = 0;
-    loop {
-        let next = read_byte(input)?;
-        if next < 0x80 || shift == 56 {
-            return Ok(value | (u64::from(next) << shift));
-        }
-        value |= (u64::from(next) & 0x7F) << shift;
-        shift += 7;
-    }
-}
-
-/// ZigZag encode then var-long encode.
-fn encode_signed_var_long(out: &mut Vec<u8>, value: i64) {
-    let encoded = ((value >> 63) ^ (value << 1)) as u64;
-    encode_unsigned_var_long(out, encoded);
-}
-
-fn decode_signed_var_long(input: &mut &[u8]) -> Result<i64, DecodeError> {
-    let encoded = decode_unsigned_var_long(input)?;
-    Ok(((encoded >> 1) as i64) ^ -((encoded & 1) as i64))
-}
-
-fn double_to_var_bits(value: f64) -> u64 {
-    let bits = f64::to_bits(value + 1.0).wrapping_sub(f64::to_bits(1.0));
-    bits.rotate_left(VAR_DOUBLE_ROTATE_DISTANCE)
-}
-
-fn var_bits_to_double(bits: u64) -> f64 {
-    f64::from_bits(
-        bits.rotate_right(VAR_DOUBLE_ROTATE_DISTANCE)
-            .wrapping_add(f64::to_bits(1.0)),
-    ) - 1.0
-}
-
-fn encode_var_double(out: &mut Vec<u8>, value: f64) {
-    let mut bits = double_to_var_bits(value);
-    for _ in 0..MAX_VAR_LEN_64 - 1 {
-        let next = (bits >> 57) as u8;
-        bits <<= 7;
-        if bits == 0 {
-            out.push(next);
-            return;
-        }
-        out.push(next | 0x80);
-    }
-    out.push((bits >> 56) as u8);
-}
-
-fn decode_var_double(input: &mut &[u8]) -> Result<f64, DecodeError> {
-    let mut bits: u64 = 0;
-    let mut shift: i32 = 57; // 8*8 - 7
-    loop {
-        let next = read_byte(input)?;
-        if shift == 1 {
-            bits |= u64::from(next);
-            break;
-        }
-        if next < 0x80 {
-            bits |= u64::from(next) << shift;
-            break;
-        }
-        bits |= (u64::from(next) & 0x7F) << shift;
-        shift -= 7;
-    }
-    Ok(var_bits_to_double(bits))
-}
-
-// ---------------------------------------------------------------------------
-// Byte-level helpers
-// ---------------------------------------------------------------------------
-
-fn read_byte(input: &mut &[u8]) -> Result<u8, DecodeError> {
-    match input.split_first() {
-        Some((&byte, rest)) => {
-            *input = rest;
-            Ok(byte)
-        }
-        None => Err(DecodeError::UnexpectedEof),
-    }
-}
-
-fn write_f64_le(out: &mut Vec<u8>, value: f64) {
-    out.extend_from_slice(&value.to_le_bytes());
-}
-
-fn read_f64_le(input: &mut &[u8]) -> Result<f64, DecodeError> {
-    if input.len() < 8 {
-        return Err(DecodeError::UnexpectedEof);
-    }
-    let (bytes, rest) = input.split_at(8);
-    *input = rest;
-    // bytes is guaranteed to be length 8 by the split_at above.
-    let arr = [
-        bytes[0], bytes[1], bytes[2], bytes[3], bytes[4], bytes[5], bytes[6], bytes[7],
-    ];
-    Ok(f64::from_le_bytes(arr))
-}
-
-// ---------------------------------------------------------------------------
-// Store encoding/decoding
-// See: https://github.com/DataDog/sketches-java/blob/master/src/main/java/com/datadoghq/sketch/ddsketch/store/DenseStore.java  (encode/decode methods)
-// ---------------------------------------------------------------------------
-
-/// Collect non-zero bins in the store as (absolute_index, count) pairs.
-///
-/// Allocation is acceptable here: this runs once per encode and the Vec
-/// has at most `max_num_bins` entries.
-fn collect_non_zero_bins(store: &Store) -> Vec<(i32, u64)> {
-    if store.count == 0 {
-        return Vec::new();
-    }
-    let start = (store.min_key - store.offset) as usize;
-    let end = ((store.max_key - store.offset + 1) as usize).min(store.bins.len());
-    store.bins[start..end]
-        .iter()
-        .enumerate()
-        .filter(|&(_, &count)| count > 0)
-        .map(|(i, &count)| (start as i32 + i as i32 + store.offset, count))
-        .collect()
-}
-
-fn encode_store(out: &mut Vec<u8>, store: &Store, flag_type: FlagType) {
-    let bins = collect_non_zero_bins(store);
-    if bins.is_empty() {
-        return;
-    }
-
-    out.push(flag(BinEncodingMode::IndexDeltasAndCounts as u8, flag_type));
-    encode_unsigned_var_long(out, bins.len() as u64);
-
-    let mut prev_index: i64 = 0;
-    for &(index, count) in &bins {
-        encode_signed_var_long(out, i64::from(index) - prev_index);
-        encode_var_double(out, count as f64);
-        prev_index = i64::from(index);
-    }
-}
-
-fn decode_store(input: &mut &[u8], subflag: u8, bin_limit: usize) -> Result<Store, DecodeError> {
-    let mode = BinEncodingMode::from_subflag(subflag).ok_or_else(|| {
-        DecodeError::InvalidData(format!("unknown bin encoding mode subflag: {subflag}"))
-    })?;
-    let num_bins = decode_unsigned_var_long(input)? as usize;
-    let mut store = Store::new(bin_limit);
-
-    match mode {
-        BinEncodingMode::IndexDeltasAndCounts => {
-            let mut index: i64 = 0;
-            for _ in 0..num_bins {
-                index += decode_signed_var_long(input)?;
-                let count = decode_var_double(input)?;
-                store.add_count(index as i32, count as u64);
-            }
-        }
-        BinEncodingMode::IndexDeltas => {
-            let mut index: i64 = 0;
-            for _ in 0..num_bins {
-                index += decode_signed_var_long(input)?;
-                store.add_count(index as i32, 1);
-            }
-        }
-        BinEncodingMode::ContiguousCounts => {
-            let start_index = decode_signed_var_long(input)?;
-            let index_delta = decode_signed_var_long(input)?;
-            let mut index = start_index;
-            for _ in 0..num_bins {
-                let count = decode_var_double(input)?;
-                store.add_count(index as i32, count as u64);
-                index += index_delta;
-            }
-        }
-    }
-
-    Ok(store)
-}
-
-// ---------------------------------------------------------------------------
-// Top-level encode / decode
-// ---------------------------------------------------------------------------
-
-/// Encode a DDSketch into the Java-compatible binary format.
-///
-/// The output follows the encoding order of
-/// `DDSketchWithExactSummaryStatistics.encode()` then `DDSketch.encode()`:
-///
-/// 1. Summary statistics: COUNT, MIN, MAX (if count > 0)
-/// 2. SUM (if sum != 0)
-/// 3. Index mapping (LOG layout): gamma, indexOffset
-/// 4. Zero count (if > 0)
-/// 5. Positive store bins
-/// 6. Negative store bins
-pub fn encode_to_java_bytes(sketch: &DDSketch) -> Vec<u8> {
-    let mut out = Vec::new();
-    let count = sketch.count() as f64;
-
-    // Summary statistics (DDSketchWithExactSummaryStatistics.encode)
-    if count != 0.0 {
-        out.push(FLAG_COUNT);
-        encode_var_double(&mut out, count);
-        out.push(FLAG_MIN);
-        write_f64_le(&mut out, sketch.min);
-        out.push(FLAG_MAX);
-        write_f64_le(&mut out, sketch.max);
-    }
-    if sketch.sum != 0.0 {
-        out.push(FLAG_SUM);
-        write_f64_le(&mut out, sketch.sum);
-    }
-
-    // DDSketch.encode: index mapping + zero count + stores
-    out.push(FLAG_INDEX_MAPPING_LOG);
-    write_f64_le(&mut out, sketch.config.gamma);
-    write_f64_le(&mut out, 0.0_f64);
-
-    if sketch.zero_count != 0 {
-        out.push(FLAG_ZERO_COUNT);
-        encode_var_double(&mut out, sketch.zero_count as f64);
-    }
-
-    encode_store(&mut out, &sketch.store, FlagType::PositiveStore);
-    encode_store(&mut out, &sketch.negative_store, FlagType::NegativeStore);
-
-    out
-}
-
-/// Decode a DDSketch from the Java-compatible binary format.
-///
-/// Accepts bytes with or without a `0x02` version prefix.
-pub fn decode_from_java_bytes(bytes: &[u8]) -> Result<DDSketch, DecodeError> {
-    if bytes.is_empty() {
-        return Err(DecodeError::UnexpectedEof);
-    }
-
-    let mut input = bytes;
-
-    // Skip optional version prefix (0x02 followed by a valid flag byte).
-    if input.len() >= 2 && input[0] == 0x02 && is_valid_flag_byte(input[1]) {
-        input = &input[1..];
-    }
-
-    let mut gamma: Option<f64> = None;
-    let mut zero_count: f64 = 0.0;
-    let mut sum: f64 = 0.0;
-    let mut min: f64 = f64::INFINITY;
-    let mut max: f64 = f64::NEG_INFINITY;
-    let mut positive_store: Option<Store> = None;
-    let mut negative_store: Option<Store> = None;
-
-    while !input.is_empty() {
-        let flag_byte = read_byte(&mut input)?;
-        let flag_type =
-            FlagType::from_byte(flag_byte).ok_or(DecodeError::InvalidFlag(flag_byte))?;
-        let subflag = flag_byte >> 2;
-
-        match flag_type {
-            FlagType::IndexMapping => {
-                gamma = Some(read_f64_le(&mut input)?);
-                let _index_offset = read_f64_le(&mut input)?;
-            }
-            FlagType::SketchFeatures => match flag_byte {
-                FLAG_ZERO_COUNT => zero_count += decode_var_double(&mut input)?,
-                FLAG_COUNT => {
-                    let _count = decode_var_double(&mut input)?;
-                }
-                FLAG_SUM => sum = read_f64_le(&mut input)?,
-                FLAG_MIN => min = read_f64_le(&mut input)?,
-                FLAG_MAX => max = read_f64_le(&mut input)?,
-                _ => return Err(DecodeError::InvalidFlag(flag_byte)),
-            },
-            FlagType::PositiveStore => {
-                positive_store = Some(decode_store(
-                    &mut input,
-                    subflag,
-                    DEFAULT_MAX_BINS as usize,
-                )?);
-            }
-            FlagType::NegativeStore => {
-                negative_store = Some(decode_store(
-                    &mut input,
-                    subflag,
-                    DEFAULT_MAX_BINS as usize,
-                )?);
-            }
-        }
-    }
-
-    let g = gamma.unwrap_or_else(|| Config::defaults().gamma);
-    let config = Config::from_gamma(g);
-    let store = positive_store.unwrap_or_else(|| Store::new(config.max_num_bins as usize));
-    let neg = negative_store.unwrap_or_else(|| Store::new(config.max_num_bins as usize));
-
-    Ok(DDSketch {
-        config,
-        store,
-        negative_store: neg,
-        min,
-        max,
-        sum,
-        zero_count: zero_count as u64,
-    })
-}
-
-/// Check whether a byte is a valid flag byte for the DDSketch binary format.
-fn is_valid_flag_byte(b: u8) -> bool {
-    // Known sketch-feature flags
-    if matches!(
-        b,
-        FLAG_ZERO_COUNT | FLAG_COUNT | FLAG_SUM | FLAG_MIN | FLAG_MAX | FLAG_INDEX_MAPPING_LOG
-    ) {
-        return true;
-    }
-    let Some(flag_type) = FlagType::from_byte(b) else {
-        return false;
-    };
-    let subflag = b >> 2;
-    match flag_type {
-        FlagType::PositiveStore | FlagType::NegativeStore => (1..=3).contains(&subflag),
-        FlagType::IndexMapping => subflag <= 4, // LOG=0, LOG_LINEAR=1 .. LOG_QUARTIC=4
-        _ => false,
-    }
-}
-
-// ---------------------------------------------------------------------------
-// Tests
-// ---------------------------------------------------------------------------
-
-#[cfg(test)]
-mod tests {
-    use super::*;
-    use crate::{Config, DDSketch};
-
-    // --- VarEncoding unit tests ---
-
-    #[test]
-    fn test_unsigned_var_long_zero() {
-        let mut buf = Vec::new();
-        encode_unsigned_var_long(&mut buf, 0);
-        assert_eq!(buf, [0x00]);
-
-        let mut input = buf.as_slice();
-        assert_eq!(decode_unsigned_var_long(&mut input).unwrap(), 0);
-        assert!(input.is_empty());
-    }
-
-    #[test]
-    fn test_unsigned_var_long_small() {
-        let mut buf = Vec::new();
-        encode_unsigned_var_long(&mut buf, 1);
-        assert_eq!(buf, [0x01]);
-
-        let mut input = buf.as_slice();
-        assert_eq!(decode_unsigned_var_long(&mut input).unwrap(), 1);
-    }
-
-    #[test]
-    fn test_unsigned_var_long_128() {
-        let mut buf = Vec::new();
-        encode_unsigned_var_long(&mut buf, 128);
-        assert_eq!(buf, [0x80, 0x01]);
-
-        let mut input = buf.as_slice();
-        assert_eq!(decode_unsigned_var_long(&mut input).unwrap(), 128);
-    }
-
-    #[test]
-    fn test_unsigned_var_long_roundtrip() {
-        for v in [0u64, 1, 127, 128, 255, 256, 16383, 16384, u64::MAX] {
-            let mut buf = Vec::new();
-            encode_unsigned_var_long(&mut buf, v);
-            let mut input = buf.as_slice();
-            let decoded = decode_unsigned_var_long(&mut input).unwrap();
-            assert_eq!(decoded, v, "roundtrip failed for {}", v);
-            assert!(input.is_empty());
-        }
-    }
-
-    #[test]
-    fn test_signed_var_long_roundtrip() {
-        for v in [0i64, 1, -1, 63, -64, 64, -65, i64::MAX, i64::MIN] {
-            let mut buf = Vec::new();
-            encode_signed_var_long(&mut buf, v);
-            let mut input = buf.as_slice();
-            let decoded = decode_signed_var_long(&mut input).unwrap();
-            assert_eq!(decoded, v, "roundtrip failed for {}", v);
-            assert!(input.is_empty());
-        }
-    }
-
-    #[test]
-    fn test_var_double_roundtrip() {
-        for v in [0.0, 1.0, 2.0, 5.0, 15.0, 42.0, 100.0, 1e-9, 1e15, 0.5, 7.77] {
-            let mut buf = Vec::new();
-            encode_var_double(&mut buf, v);
-            let mut input = buf.as_slice();
-            let decoded = decode_var_double(&mut input).unwrap();
-            assert!(
-                (decoded - v).abs() < 1e-15 || decoded == v,
-                "roundtrip failed for {}: got {}",
-                v,
-                decoded,
-            );
-            assert!(input.is_empty());
-        }
-    }
-
-    #[test]
-    fn test_var_double_small_integers() {
-        let mut buf = Vec::new();
-        encode_var_double(&mut buf, 1.0);
-        assert_eq!(buf.len(), 1, "VarDouble(1.0) should be 1 byte");
-
-        buf.clear();
-        encode_var_double(&mut buf, 5.0);
-        assert_eq!(buf.len(), 1, "VarDouble(5.0) should be 1 byte");
-    }
-
-    // --- DDSketch encode/decode roundtrip tests ---
-
-    #[test]
-    fn test_encode_empty_sketch() {
-        let sketch = DDSketch::new(Config::defaults());
-        let bytes = sketch.to_java_bytes();
-        assert!(!bytes.is_empty());
-
-        let decoded = DDSketch::from_java_bytes(&bytes).unwrap();
-        assert_eq!(decoded.count(), 0);
-        assert_eq!(decoded.min(), None);
-        assert_eq!(decoded.max(), None);
-        assert_eq!(decoded.sum(), None);
-    }
-
-    #[test]
-    fn test_encode_simple_sketch() {
-        let mut sketch = DDSketch::new(Config::defaults());
-        for v in [1.0, 2.0, 3.0, 4.0, 5.0] {
-            sketch.add(v);
-        }
-
-        let bytes = sketch.to_java_bytes();
-        let decoded = DDSketch::from_java_bytes(&bytes).unwrap();
-
-        assert_eq!(decoded.count(), 5);
-        assert_eq!(decoded.min(), Some(1.0));
-        assert_eq!(decoded.max(), Some(5.0));
-        assert_eq!(decoded.sum(), Some(15.0));
-
-        assert_quantiles_match(&sketch, &decoded, &[0.5, 0.9, 0.95, 0.99]);
-    }
-
-    #[test]
-    fn test_encode_single_value() {
-        let mut sketch = DDSketch::new(Config::defaults());
-        sketch.add(42.0);
-
-        let bytes = sketch.to_java_bytes();
-        let decoded = DDSketch::from_java_bytes(&bytes).unwrap();
-
-        assert_eq!(decoded.count(), 1);
-        assert_eq!(decoded.min(), Some(42.0));
-        assert_eq!(decoded.max(), Some(42.0));
-        assert_eq!(decoded.sum(), Some(42.0));
-    }
-
-    #[test]
-    fn test_encode_negative_values() {
-        let mut sketch = DDSketch::new(Config::defaults());
-        for v in [-3.0, -1.0, 2.0, 5.0] {
-            sketch.add(v);
-        }
-
-        let bytes = sketch.to_java_bytes();
-        let decoded = DDSketch::from_java_bytes(&bytes).unwrap();
-
-        assert_eq!(decoded.count(), 4);
-        assert_eq!(decoded.min(), Some(-3.0));
-        assert_eq!(decoded.max(), Some(5.0));
-        assert_eq!(decoded.sum(), Some(3.0));
-
-        assert_quantiles_match(&sketch, &decoded, &[0.0, 0.25, 0.5, 0.75, 1.0]);
-    }
-
-    #[test]
-    fn test_encode_with_zero_value() {
-        let mut sketch = DDSketch::new(Config::defaults());
-        for v in [0.0, 1.0, 2.0] {
-            sketch.add(v);
-        }
-
-        let bytes = sketch.to_java_bytes();
-        let decoded = DDSketch::from_java_bytes(&bytes).unwrap();
-
-        assert_eq!(decoded.count(), 3);
-        assert_eq!(decoded.min(), Some(0.0));
-        assert_eq!(decoded.max(), Some(2.0));
-        assert_eq!(decoded.sum(), Some(3.0));
-        assert_eq!(decoded.zero_count, 1);
-    }
-
-    #[test]
-    fn test_encode_large_range() {
-        let mut sketch = DDSketch::new(Config::defaults());
-        sketch.add(0.001);
-        sketch.add(1_000_000.0);
-
-        let bytes = sketch.to_java_bytes();
-        let decoded = DDSketch::from_java_bytes(&bytes).unwrap();
-
-        assert_eq!(decoded.count(), 2);
-        assert_eq!(decoded.min(), Some(0.001));
-        assert_eq!(decoded.max(), Some(1_000_000.0));
-    }
-
-    #[test]
-    fn test_encode_with_version_prefix() {
-        let mut sketch = DDSketch::new(Config::defaults());
-        for v in [1.0, 2.0, 3.0] {
-            sketch.add(v);
-        }
-
-        let bytes = sketch.to_java_bytes();
-
-        // Simulate Java's toByteArrayV2: prepend 0x02
-        let mut v2_bytes = vec![0x02];
-        v2_bytes.extend_from_slice(&bytes);
-
-        let decoded = DDSketch::from_java_bytes(&v2_bytes).unwrap();
-        assert_eq!(decoded.count(), 3);
-        assert_eq!(decoded.min(), Some(1.0));
-        assert_eq!(decoded.max(), Some(3.0));
-    }
-
-    #[test]
-    fn test_byte_level_encoding() {
-        let mut sketch = DDSketch::new(Config::defaults());
-        sketch.add(1.0);
-
-        let bytes = sketch.to_java_bytes();
-
-        assert_eq!(bytes[0], FLAG_COUNT, "first byte should be COUNT flag");
-        assert!(
-            bytes.contains(&FLAG_INDEX_MAPPING_LOG),
-            "should contain index mapping flag"
-        );
-    }
-
-    // --- Cross-language golden byte tests ---
-    //
-    // Golden bytes generated by Java's DDSketchWithExactSummaryStatistics.encode()
-    // using LogarithmicMapping(0.01) + CollapsingLowestDenseStore(2048).
-
-    const GOLDEN_SIMPLE: &str = "a00588000000000000f03f8c0000000000001440840000000000002e4002fd4a815abf52f03f000000000000000005050002440228021e021602";
-    const GOLDEN_SINGLE: &str = "a0028800000000000045408c000000000000454084000000000000454002fd4a815abf52f03f00000000000000000501f40202";
-    const GOLDEN_NEGATIVE: &str = "a084408800000000000008c08c000000000000144084000000000000084002fd4a815abf52f03f0000000000000000050244025c02070200026c02";
-    const GOLDEN_ZERO: &str = "a0048800000000000000008c000000000000004084000000000000084002fd4a815abf52f03f00000000000000000402050200024402";
-    const GOLDEN_EMPTY: &str = "02fd4a815abf52f03f0000000000000000";
-    const GOLDEN_MANY: &str = "a08d1488000000000000f03f8c0000000000005940840000000000bab34002fd4a815abf52f03f000000000000000005550002440228021e021602120210020c020c020c0208020a020802060208020602060206020602040206020402040204020402040204020402040204020202040202020402020204020202020204020202020202020402020202020202020202020202020202020202020202020202020202020203020202020202020302020202020302020202020302020203020202030202020302030202020302030203020202030203020302030202";
-
-    fn hex_to_bytes(hex: &str) -> Vec<u8> {
-        (0..hex.len())
-            .step_by(2)
-            .map(|i| u8::from_str_radix(&hex[i..i + 2], 16).unwrap())
-            .collect()
-    }
-
-    fn bytes_to_hex(bytes: &[u8]) -> String {
-        bytes.iter().map(|b| format!("{b:02x}")).collect()
-    }
-
-    fn assert_golden(label: &str, sketch: &DDSketch, golden_hex: &str) {
-        let bytes = sketch.to_java_bytes();
-        let expected = hex_to_bytes(golden_hex);
-        assert_eq!(
-            bytes,
-            expected,
-            "Rust encoding doesn't match Java golden bytes for {}.\nRust: {}\nJava: {}",
-            label,
-            bytes_to_hex(&bytes),
-            golden_hex,
-        );
-    }
-
-    fn assert_quantiles_match(a: &DDSketch, b: &DDSketch, quantiles: &[f64]) {
-        for &q in quantiles {
-            let va = a.quantile(q).unwrap().unwrap();
-            let vb = b.quantile(q).unwrap().unwrap();
-            assert!(
-                (va - vb).abs() / va.abs().max(1e-15) < 1e-12,
-                "quantile({}) mismatch: {} vs {}",
-                q,
-                va,
-                vb,
-            );
-        }
-    }
-
-    #[test]
-    fn test_cross_language_simple() {
-        let mut sketch = DDSketch::new(Config::defaults());
-        for v in [1.0, 2.0, 3.0, 4.0, 5.0] {
-            sketch.add(v);
-        }
-        assert_golden("SIMPLE", &sketch, GOLDEN_SIMPLE);
-    }
-
-    #[test]
-    fn test_cross_language_single() {
-        let mut sketch = DDSketch::new(Config::defaults());
-        sketch.add(42.0);
-        assert_golden("SINGLE", &sketch, GOLDEN_SINGLE);
-    }
-
-    #[test]
-    fn test_cross_language_negative() {
-        let mut sketch = DDSketch::new(Config::defaults());
-        for v in [-3.0, -1.0, 2.0, 5.0] {
-            sketch.add(v);
-        }
-        assert_golden("NEGATIVE", &sketch, GOLDEN_NEGATIVE);
-    }
-
-    #[test]
-    fn test_cross_language_zero() {
-        let mut sketch = DDSketch::new(Config::defaults());
-        for v in [0.0, 1.0, 2.0] {
-            sketch.add(v);
-        }
-        assert_golden("ZERO", &sketch, GOLDEN_ZERO);
-    }
-
-    #[test]
-    fn test_cross_language_empty() {
-        let sketch = DDSketch::new(Config::defaults());
-        assert_golden("EMPTY", &sketch, GOLDEN_EMPTY);
-    }
-
-    #[test]
-    fn test_cross_language_many() {
-        let mut sketch = DDSketch::new(Config::defaults());
-        for i in 1..=100 {
-            sketch.add(i as f64);
-        }
-        assert_golden("MANY", &sketch, GOLDEN_MANY);
-    }
-
-    #[test]
-    fn test_decode_java_golden_bytes() {
-        for (name, hex) in [
-            ("SIMPLE", GOLDEN_SIMPLE),
-            ("SINGLE", GOLDEN_SINGLE),
-            ("NEGATIVE", GOLDEN_NEGATIVE),
-            ("ZERO", GOLDEN_ZERO),
-            ("EMPTY", GOLDEN_EMPTY),
-            ("MANY", GOLDEN_MANY),
-        ] {
-            let bytes = hex_to_bytes(hex);
-            let result = DDSketch::from_java_bytes(&bytes);
-            assert!(
-                result.is_ok(),
-                "failed to decode {}: {:?}",
-                name,
-                result.err()
-            );
-        }
-    }
-
-    #[test]
-    fn test_encode_decode_many_values() {
-        let mut sketch = DDSketch::new(Config::defaults());
-        for i in 1..=100 {
-            sketch.add(i as f64);
-        }
-
-        let bytes = sketch.to_java_bytes();
-        let decoded = DDSketch::from_java_bytes(&bytes).unwrap();
-
-        assert_eq!(decoded.count(), 100);
-        assert_eq!(decoded.min(), Some(1.0));
-        assert_eq!(decoded.max(), Some(100.0));
-        assert_eq!(decoded.sum(), Some(5050.0));
-
-        let alpha = 0.01;
-        let orig_p95 = sketch.quantile(0.95).unwrap().unwrap();
-        let dec_p95 = decoded.quantile(0.95).unwrap().unwrap();
-        assert!(
-            (orig_p95 - dec_p95).abs() / orig_p95 < alpha,
-            "p95 mismatch: {} vs {}",
-            orig_p95,
-            dec_p95,
-        );
-    }
-}
--- a/sketches-ddsketch/src/lib.rs
+++ b/sketches-ddsketch/src/lib.rs
@@ -1,52 +0,0 @@
-//! This crate provides a direct port of the [Golang](https://github.com/DataDog/sketches-go)
-//! [DDSketch](https://arxiv.org/pdf/1908.10693.pdf) implementation to Rust. All efforts
-//! have been made to keep this as close to the original implementation as possible, with a few
-//! tweaks to get closer to idiomatic Rust.
-//!
-//! # Usage
-//!
-//! Add multiple samples to a DDSketch and invoke the `quantile` method to pull any quantile from
-//! 0.0* to *1.0*.
-//!
-//! ```rust
-//! use sketches_ddsketch::{Config, DDSketch};
-//!
-//! let c = Config::defaults();
-//! let mut d = DDSketch::new(c);
-//!
-//! d.add(1.0);
-//! d.add(1.0);
-//! d.add(1.0);
-//!
-//! let q = d.quantile(0.50).unwrap();
-//!
-//! assert!(q < Some(1.02));
-//! assert!(q > Some(0.98));
-//! ```
-//!
-//! Sketches can also be merged.
-//!
-//! ```rust
-//! use sketches_ddsketch::{Config, DDSketch};
-//!
-//! let c = Config::defaults();
-//! let mut d1 = DDSketch::new(c);
-//! let mut d2 = DDSketch::new(c);
-//!
-//! d1.add(1.0);
-//! d2.add(2.0);
-//! d2.add(2.0);
-//!
-//! d1.merge(&d2);
-//!
-//! assert_eq!(d1.count(), 3);
-//! ```
-
-pub use self::config::Config;
-pub use self::ddsketch::{DDSketch, DDSketchError};
-pub use self::encoding::DecodeError;
-
-mod config;
-mod ddsketch;
-pub mod encoding;
-mod store;
--- a/sketches-ddsketch/src/store.rs
+++ b/sketches-ddsketch/src/store.rs
@@ -1,252 +0,0 @@
-#[cfg(feature = "use_serde")]
-use serde::{Deserialize, Serialize};
-
-const CHUNK_SIZE: i32 = 128;
-
-// Divide the `dividend` by the `divisor`, rounding towards positive infinity.
-//
-// Similar to the nightly only `std::i32::div_ceil`.
-fn div_ceil(dividend: i32, divisor: i32) -> i32 {
-    (dividend + divisor - 1) / divisor
-}
-
-/// CollapsingLowestDenseStore
-#[derive(Clone, Debug)]
-#[cfg_attr(feature = "use_serde", derive(Serialize, Deserialize))]
-pub struct Store {
-    pub(crate) bins: Vec<u64>,
-    pub(crate) count: u64,
-    pub(crate) min_key: i32,
-    pub(crate) max_key: i32,
-    pub(crate) offset: i32,
-    pub(crate) bin_limit: usize,
-    is_collapsed: bool,
-}
-
-impl Store {
-    pub fn new(bin_limit: usize) -> Self {
-        Store {
-            bins: Vec::new(),
-            count: 0,
-            min_key: i32::MAX,
-            max_key: i32::MIN,
-            offset: 0,
-            bin_limit,
-            is_collapsed: false,
-        }
-    }
-
-    /// Return the number of bins.
-    pub fn length(&self) -> i32 {
-        self.bins.len() as i32
-    }
-
-    pub fn is_empty(&self) -> bool {
-        self.bins.is_empty()
-    }
-
-    pub fn add(&mut self, key: i32) {
-        let idx = self.get_index(key);
-        self.bins[idx] += 1;
-        self.count += 1;
-    }
-
-    /// See Java: https://github.com/DataDog/sketches-java/blob/master/src/main/java/com/datadoghq/sketch/ddsketch/store/DenseStore.java  (add(int index, double count) method)
-    pub(crate) fn add_count(&mut self, key: i32, count: u64) {
-        let idx = self.get_index(key);
-        self.bins[idx] += count;
-        self.count += count;
-    }
-
-    fn get_index(&mut self, key: i32) -> usize {
-        if key < self.min_key {
-            if self.is_collapsed {
-                return 0;
-            }
-
-            self.extend_range(key, None);
-            if self.is_collapsed {
-                return 0;
-            }
-        } else if key > self.max_key {
-            self.extend_range(key, None);
-        }
-
-        (key - self.offset) as usize
-    }
-
-    fn extend_range(&mut self, key: i32, second_key: Option<i32>) {
-        let second_key = second_key.unwrap_or(key);
-        let new_min_key = i32::min(key, i32::min(second_key, self.min_key));
-        let new_max_key = i32::max(key, i32::max(second_key, self.max_key));
-
-        if self.is_empty() {
-            let new_len = self.get_new_length(new_min_key, new_max_key);
-            self.bins.resize(new_len, 0);
-            self.offset = new_min_key;
-            self.adjust(new_min_key, new_max_key);
-        } else if new_min_key >= self.min_key && new_max_key < self.offset + self.length() {
-            self.min_key = new_min_key;
-            self.max_key = new_max_key;
-        } else {
-            // Grow bins
-            let new_length = self.get_new_length(new_min_key, new_max_key);
-            if new_length > self.length() as usize {
-                self.bins.resize(new_length, 0);
-            }
-            self.adjust(new_min_key, new_max_key);
-        }
-    }
-
-    fn get_new_length(&self, new_min_key: i32, new_max_key: i32) -> usize {
-        let desired_length = new_max_key - new_min_key + 1;
-        usize::min(
-            (CHUNK_SIZE * div_ceil(desired_length, CHUNK_SIZE)) as usize,
-            self.bin_limit,
-        )
-    }
-
-    fn adjust(&mut self, new_min_key: i32, new_max_key: i32) {
-        if new_max_key - new_min_key + 1 > self.length() {
-            let new_min_key = new_max_key - self.length() + 1;
-
-            if new_min_key >= self.max_key {
-                // Put everything in the first bin.
-                self.offset = new_min_key;
-                self.min_key = new_min_key;
-                self.bins.fill(0);
-                self.bins[0] = self.count;
-            } else {
-                let shift = self.offset - new_min_key;
-                if shift < 0 {
-                    let collapse_start_index = (self.min_key - self.offset) as usize;
-                    let collapse_end_index = (new_min_key - self.offset) as usize;
-                    let collapsed_count: u64 = self.bins[collapse_start_index..collapse_end_index]
-                        .iter()
-                        .sum();
-                    let zero_len = (new_min_key - self.min_key) as usize;
-                    self.bins.splice(
-                        collapse_start_index..collapse_end_index,
-                        std::iter::repeat_n(0, zero_len),
-                    );
-                    self.bins[collapse_end_index] += collapsed_count;
-                }
-                self.min_key = new_min_key;
-                self.shift_bins(shift);
-            }
-
-            self.max_key = new_max_key;
-            self.is_collapsed = true;
-        } else {
-            self.center_bins(new_min_key, new_max_key);
-            self.min_key = new_min_key;
-            self.max_key = new_max_key;
-        }
-    }
-
-    fn shift_bins(&mut self, shift: i32) {
-        if shift > 0 {
-            let shift = shift as usize;
-            self.bins.rotate_right(shift);
-            for idx in 0..shift {
-                self.bins[idx] = 0;
-            }
-        } else {
-            let shift = shift.unsigned_abs() as usize;
-            for idx in 0..shift {
-                self.bins[idx] = 0;
-            }
-            self.bins.rotate_left(shift);
-        }
-
-        self.offset -= shift;
-    }
-
-    fn center_bins(&mut self, new_min_key: i32, new_max_key: i32) {
-        let middle_key = new_min_key + (new_max_key - new_min_key + 1) / 2;
-        let shift = self.offset + self.length() / 2 - middle_key;
-        self.shift_bins(shift)
-    }
-
-    pub fn key_at_rank(&self, rank: u64) -> i32 {
-        let mut n = 0;
-        for (i, bin) in self.bins.iter().enumerate() {
-            n += *bin;
-            if n > rank {
-                return i as i32 + self.offset;
-            }
-        }
-
-        self.max_key
-    }
-
-    pub fn count(&self) -> u64 {
-        self.count
-    }
-
-    pub fn merge(&mut self, other: &Store) {
-        if other.count == 0 {
-            return;
-        }
-
-        if self.count == 0 {
-            self.copy(other);
-            return;
-        }
-
-        if other.min_key < self.min_key || other.max_key > self.max_key {
-            self.extend_range(other.min_key, Some(other.max_key));
-        }
-
-        let collapse_start_index = other.min_key - other.offset;
-        let mut collapse_end_index = i32::min(self.min_key, other.max_key + 1) - other.offset;
-        if collapse_end_index > collapse_start_index {
-            let collapsed_count: u64 = self.bins
-                [collapse_start_index as usize..collapse_end_index as usize]
-                .iter()
-                .sum();
-            self.bins[0] += collapsed_count;
-        } else {
-            collapse_end_index = collapse_start_index;
-        }
-
-        for key in (collapse_end_index + other.offset)..(other.max_key + 1) {
-            self.bins[(key - self.offset) as usize] += other.bins[(key - other.offset) as usize]
-        }
-
-        self.count += other.count;
-    }
-
-    fn copy(&mut self, o: &Store) {
-        self.bins = o.bins.clone();
-        self.count = o.count;
-        self.min_key = o.min_key;
-        self.max_key = o.max_key;
-        self.offset = o.offset;
-        self.bin_limit = o.bin_limit;
-        self.is_collapsed = o.is_collapsed;
-    }
-}
-
-#[cfg(test)]
-mod tests {
-    use crate::store::Store;
-
-    #[test]
-    fn test_simple_store() {
-        let mut s = Store::new(2048);
-
-        for i in 0..2048 {
-            s.add(i);
-        }
-    }
-
-    #[test]
-    fn test_simple_store_rev() {
-        let mut s = Store::new(2048);
-
-        for i in (0..2048).rev() {
-            s.add(i);
-        }
-    }
-}
--- a/sketches-ddsketch/tests/common/dataset.rs
+++ b/sketches-ddsketch/tests/common/dataset.rs
@@ -1,88 +0,0 @@
-use std::cmp::Ordering;
-use std::f64::NAN;
-
-pub struct Dataset {
-    values: Vec<f64>,
-    sum: f64,
-    sorted: bool,
-}
-
-fn cmp_f64(a: &f64, b: &f64) -> Ordering {
-    assert!(!a.is_nan() && !b.is_nan());
-
-    if a < b {
-        return Ordering::Less;
-    } else if a > b {
-        return Ordering::Greater;
-    } else {
-        return Ordering::Equal;
-    }
-}
-
-impl Dataset {
-    pub fn new() -> Self {
-        Dataset {
-            values: Vec::new(),
-            sum: 0.0,
-            sorted: false,
-        }
-    }
-
-    pub fn add(&mut self, value: f64) {
-        self.values.push(value);
-        self.sum += value;
-        self.sorted = false;
-    }
-
-    // pub fn quantile(&mut self, q: f64) -> f64 {
-    // self.lower_quantile(q)
-    // }
-
-    pub fn lower_quantile(&mut self, q: f64) -> f64 {
-        if q < 0.0 || q > 1.0 || self.values.len() == 0 {
-            return NAN;
-        }
-
-        self.sort();
-        let rank = q * (self.values.len() - 1) as f64;
-
-        self.values[rank.floor() as usize]
-    }
-
-    pub fn upper_quantile(&mut self, q: f64) -> f64 {
-        if q < 0.0 || q > 1.0 || self.values.len() == 0 {
-            return NAN;
-        }
-
-        self.sort();
-        let rank = q * (self.values.len() - 1) as f64;
-        self.values[rank.ceil() as usize]
-    }
-
-    pub fn min(&mut self) -> f64 {
-        self.sort();
-        self.values[0]
-    }
-
-    pub fn max(&mut self) -> f64 {
-        self.sort();
-        self.values[self.values.len() - 1]
-    }
-
-    pub fn sum(&self) -> f64 {
-        self.sum
-    }
-
-    pub fn count(&self) -> usize {
-        self.values.len()
-    }
-
-    fn sort(&mut self) {
-        if self.sorted {
-            return;
-        }
-
-        self.values.sort_by(cmp_f64);
-        self.sorted = true;
-    }
-}
--- a/sketches-ddsketch/tests/common/generator.rs
+++ b/sketches-ddsketch/tests/common/generator.rs
@@ -1,100 +0,0 @@
-extern crate rand;
-extern crate rand_distr;
-
-use rand::prelude::*;
-
-pub trait Generator {
-    fn generate(&mut self) -> f64;
-}
-
-// Constant generator
-//
-pub struct Constant {
-    value: f64,
-}
-impl Constant {
-    pub fn new(value: f64) -> Self {
-        Constant { value }
-    }
-}
-impl Generator for Constant {
-    fn generate(&mut self) -> f64 {
-        self.value
-    }
-}
-
-// Linear generator
-//
-pub struct Linear {
-    current_value: f64,
-    step: f64,
-}
-impl Linear {
-    pub fn new(start_value: f64, step: f64) -> Self {
-        Linear {
-            current_value: start_value,
-            step,
-        }
-    }
-}
-impl Generator for Linear {
-    fn generate(&mut self) -> f64 {
-        let value = self.current_value;
-        self.current_value += self.step;
-        value
-    }
-}
-
-// Normal distribution generator
-//
-pub struct Normal {
-    distr: rand_distr::Normal<f64>,
-}
-impl Normal {
-    pub fn new(mean: f64, stddev: f64) -> Self {
-        Normal {
-            distr: rand_distr::Normal::new(mean, stddev).unwrap(),
-        }
-    }
-}
-impl Generator for Normal {
-    fn generate(&mut self) -> f64 {
-        self.distr.sample(&mut rand::thread_rng())
-    }
-}
-
-// Lognormal distribution generator
-//
-pub struct Lognormal {
-    distr: rand_distr::LogNormal<f64>,
-}
-impl Lognormal {
-    pub fn new(mean: f64, stddev: f64) -> Self {
-        Lognormal {
-            distr: rand_distr::LogNormal::new(mean, stddev).unwrap(),
-        }
-    }
-}
-impl Generator for Lognormal {
-    fn generate(&mut self) -> f64 {
-        self.distr.sample(&mut rand::thread_rng())
-    }
-}
-
-// Exponential distribution generator
-//
-pub struct Exponential {
-    distr: rand_distr::Exp<f64>,
-}
-impl Exponential {
-    pub fn new(lambda: f64) -> Self {
-        Exponential {
-            distr: rand_distr::Exp::new(lambda).unwrap(),
-        }
-    }
-}
-impl Generator for Exponential {
-    fn generate(&mut self) -> f64 {
-        self.distr.sample(&mut rand::thread_rng())
-    }
-}
--- a/sketches-ddsketch/tests/common/mod.rs
+++ b/sketches-ddsketch/tests/common/mod.rs
@@ -1,2 +0,0 @@
-pub mod dataset;
-pub mod generator;
--- a/sketches-ddsketch/tests/test_ddsketch.rs
+++ b/sketches-ddsketch/tests/test_ddsketch.rs
@@ -1,316 +0,0 @@
-mod common;
-use std::time::Instant;
-
-use common::dataset::Dataset;
-use common::generator;
-use common::generator::Generator;
-use sketches_ddsketch::{Config, DDSketch};
-
-const TEST_ALPHA: f64 = 0.01;
-const TEST_MAX_BINS: u32 = 1024;
-const TEST_MIN_VALUE: f64 = 1.0e-9;
-
-// Used for float equality
-const TEST_ERROR_THRESH: f64 = 1.0e-9;
-
-const TEST_SIZES: [usize; 5] = [3, 5, 10, 100, 1000];
-const TEST_QUANTILES: [f64; 10] = [0.0, 0.1, 0.25, 0.5, 0.75, 0.9, 0.95, 0.99, 0.999, 1.0];
-
-#[test]
-fn test_constant() {
-    evaluate_sketches(|| Box::new(generator::Constant::new(42.0)));
-}
-
-#[test]
-fn test_linear() {
-    evaluate_sketches(|| Box::new(generator::Linear::new(0.0, 1.0)));
-}
-
-#[test]
-fn test_normal() {
-    evaluate_sketches(|| Box::new(generator::Normal::new(35.0, 1.0)));
-}
-
-#[test]
-fn test_lognormal() {
-    evaluate_sketches(|| Box::new(generator::Lognormal::new(0.0, 2.0)));
-}
-
-#[test]
-fn test_exponential() {
-    evaluate_sketches(|| Box::new(generator::Exponential::new(2.0)));
-}
-
-fn evaluate_test_sizes(f: impl Fn(usize)) {
-    for sz in &TEST_SIZES {
-        f(*sz);
-    }
-}
-
-fn evaluate_sketches(gen_factory: impl Fn() -> Box<dyn generator::Generator>) {
-    evaluate_test_sizes(|sz: usize| {
-        let mut generator = gen_factory();
-        evaluate_sketch(sz, &mut generator);
-    });
-}
-
-fn new_config() -> Config {
-    Config::new(TEST_ALPHA, TEST_MAX_BINS, TEST_MIN_VALUE)
-}
-
-fn assert_float_eq(a: f64, b: f64) {
-    assert!((a - b).abs() < TEST_ERROR_THRESH, "{} != {}", a, b);
-}
-
-fn evaluate_sketch(count: usize, generator: &mut Box<dyn generator::Generator>) {
-    let c = new_config();
-    let mut g = DDSketch::new(c);
-
-    let mut d = Dataset::new();
-
-    for _i in 0..count {
-        let value = generator.generate();
-
-        g.add(value);
-        d.add(value);
-    }
-
-    compare_sketches(&mut d, &g);
-}
-
-fn compare_sketches(d: &mut Dataset, g: &DDSketch) {
-    for q in &TEST_QUANTILES {
-        let lower = d.lower_quantile(*q);
-        let upper = d.upper_quantile(*q);
-
-        let min_expected;
-        if lower < 0.0 {
-            min_expected = lower * (1.0 + TEST_ALPHA);
-        } else {
-            min_expected = lower * (1.0 - TEST_ALPHA);
-        }
-
-        let max_expected;
-        if upper > 0.0 {
-            max_expected = upper * (1.0 + TEST_ALPHA);
-        } else {
-            max_expected = upper * (1.0 - TEST_ALPHA);
-        }
-
-        let quantile = g.quantile(*q).unwrap().unwrap();
-
-        assert!(
-            min_expected <= quantile,
-            "Lower than min, quantile: {}, wanted {} <= {}",
-            *q,
-            min_expected,
-            quantile
-        );
-        assert!(
-            quantile <= max_expected,
-            "Higher than max, quantile: {}, wanted {} <= {}",
-            *q,
-            quantile,
-            max_expected
-        );
-
-        // verify that calls do not modify result (not mut so not possible?)
-        let quantile2 = g.quantile(*q).unwrap().unwrap();
-        assert_eq!(quantile, quantile2);
-    }
-
-    assert_eq!(g.min().unwrap(), d.min());
-    assert_eq!(g.max().unwrap(), d.max());
-    assert_float_eq(g.sum().unwrap(), d.sum());
-    assert_eq!(g.count(), d.count());
-}
-
-#[test]
-fn test_merge_normal() {
-    evaluate_test_sizes(|sz: usize| {
-        let c = new_config();
-        let mut d = Dataset::new();
-        let mut g1 = DDSketch::new(c);
-
-        let mut generator1 = generator::Normal::new(35.0, 1.0);
-        for _ in (0..sz).step_by(3) {
-            let value = generator1.generate();
-            g1.add(value);
-            d.add(value);
-        }
-        let mut g2 = DDSketch::new(c);
-        let mut generator2 = generator::Normal::new(50.0, 2.0);
-        for _ in (1..sz).step_by(3) {
-            let value = generator2.generate();
-            g2.add(value);
-            d.add(value);
-        }
-        g1.merge(&g2).unwrap();
-
-        let mut g3 = DDSketch::new(c);
-        let mut generator3 = generator::Normal::new(40.0, 0.5);
-        for _ in (2..sz).step_by(3) {
-            let value = generator3.generate();
-            g3.add(value);
-            d.add(value);
-        }
-        g1.merge(&g3).unwrap();
-
-        compare_sketches(&mut d, &g1);
-    });
-}
-
-#[test]
-fn test_merge_empty() {
-    evaluate_test_sizes(|sz: usize| {
-        let c = new_config();
-
-        let mut d = Dataset::new();
-
-        let mut g1 = DDSketch::new(c);
-        let mut g2 = DDSketch::new(c);
-        let mut generator = generator::Exponential::new(5.0);
-
-        for _ in 0..sz {
-            let value = generator.generate();
-            g2.add(value);
-            d.add(value);
-        }
-        g1.merge(&g2).unwrap();
-        compare_sketches(&mut d, &g1);
-
-        let g3 = DDSketch::new(c);
-        g2.merge(&g3).unwrap();
-        compare_sketches(&mut d, &g2);
-    });
-}
-
-#[test]
-fn test_merge_mixed() {
-    evaluate_test_sizes(|sz: usize| {
-        let c = new_config();
-        let mut d = Dataset::new();
-        let mut g1 = DDSketch::new(c);
-
-        let mut generator1 = generator::Normal::new(100.0, 1.0);
-        for _ in (0..sz).step_by(3) {
-            let value = generator1.generate();
-            g1.add(value);
-            d.add(value);
-        }
-
-        let mut g2 = DDSketch::new(c);
-        let mut generator2 = generator::Exponential::new(5.0);
-        for _ in (1..sz).step_by(3) {
-            let value = generator2.generate();
-            g2.add(value);
-            d.add(value);
-        }
-        g1.merge(&g2).unwrap();
-
-        let mut g3 = DDSketch::new(c);
-        let mut generator3 = generator::Exponential::new(0.1);
-        for _ in (2..sz).step_by(3) {
-            let value = generator3.generate();
-            g3.add(value);
-            d.add(value);
-        }
-        g1.merge(&g3).unwrap();
-
-        compare_sketches(&mut d, &g1);
-    })
-}
-
-#[test]
-fn test_merge_incompatible() {
-    let c1 = Config::new(TEST_ALPHA, TEST_MAX_BINS, TEST_MIN_VALUE);
-    let c2 = Config::new(TEST_ALPHA * 2.0, TEST_MAX_BINS, TEST_MIN_VALUE);
-
-    let mut d1 = DDSketch::new(c1);
-    let d2 = DDSketch::new(c2);
-
-    assert!(d1.merge(&d2).is_err());
-
-    let c3 = Config::new(TEST_ALPHA, TEST_MAX_BINS, TEST_MIN_VALUE * 10.0);
-    let d3 = DDSketch::new(c3);
-
-    assert!(d1.merge(&d3).is_err());
-
-    let c4 = Config::new(TEST_ALPHA, TEST_MAX_BINS * 2, TEST_MIN_VALUE);
-    let d4 = DDSketch::new(c4);
-
-    assert!(d1.merge(&d4).is_err());
-
-    // the same should work
-    let c5 = Config::new(TEST_ALPHA, TEST_MAX_BINS, TEST_MIN_VALUE);
-    let dsame = DDSketch::new(c5);
-    assert!(d1.merge(&dsame).is_ok());
-}
-
-#[test]
-#[ignore]
-fn test_performance_insert() {
-    let c = Config::defaults();
-    let mut g = DDSketch::new(c);
-    let mut gen = generator::Normal::new(1000.0, 500.0);
-    let count = 300_000_000;
-
-    let mut values = Vec::new();
-    for _ in 0..count {
-        values.push(gen.generate());
-    }
-
-    let start_time = Instant::now();
-    for value in values {
-        g.add(value);
-    }
-
-    // This simply ensures the operations don't get optimzed out as ignored
-    let quantile = g.quantile(0.50).unwrap().unwrap();
-
-    let elapsed = start_time.elapsed().as_micros() as f64;
-    let elapsed = elapsed / 1_000_000.0;
-
-    println!(
-        "RESULT: p50={:.2} => Added {}M samples in {:2} secs ({:.2}M samples/sec)",
-        quantile,
-        count / 1_000_000,
-        elapsed,
-        (count as f64) / 1_000_000.0 / elapsed
-    );
-}
-
-#[test]
-#[ignore]
-fn test_performance_merge() {
-    let c = Config::defaults();
-    let mut gen = generator::Normal::new(1000.0, 500.0);
-    let merge_count = 500_000;
-    let sample_count = 1_000;
-    let mut sketches = Vec::new();
-
-    for _ in 0..merge_count {
-        let mut d = DDSketch::new(c);
-        for _ in 0..sample_count {
-            d.add(gen.generate());
-        }
-        sketches.push(d);
-    }
-
-    let mut base = DDSketch::new(c);
-
-    let start_time = Instant::now();
-    for sketch in &sketches {
-        base.merge(sketch).unwrap();
-    }
-
-    let elapsed = start_time.elapsed().as_micros() as f64;
-    let elapsed = elapsed / 1_000_000.0;
-
-    println!(
-        "RESULT: Merged {} sketches in {:2} secs ({:.2} merges/sec)",
-        merge_count,
-        elapsed,
-        (merge_count as f64) / elapsed
-    );
-}
--- a/src/aggregation/accessor_helpers.rs
+++ b/src/aggregation/accessor_helpers.rs
@@ -95,21 +95,11 @@ pub(crate) fn get_all_ff_reader_or_empty(
    allowed_column_types: Option<&[ColumnType]>,
    fallback_type: ColumnType,
 ) -> crate::Result<Vec<(columnar::Column<u64>, ColumnType)>> {
-    let mut ff_field_with_type = get_all_ff_readers(reader, field_name, allowed_column_types)?;
+    let ff_fields = reader.fast_fields();
+    let mut ff_field_with_type =
+        ff_fields.u64_lenient_for_type_all(allowed_column_types, field_name)?;
    if ff_field_with_type.is_empty() {
        ff_field_with_type.push((Column::build_empty_column(reader.num_docs()), fallback_type));
    }
    Ok(ff_field_with_type)
 }
-
-/// Get all fast field reader.
-pub(crate) fn get_all_ff_readers(
-    reader: &SegmentReader,
-    field_name: &str,
-    allowed_column_types: Option<&[ColumnType]>,
-) -> crate::Result<Vec<(columnar::Column<u64>, ColumnType)>> {
-    let ff_fields = reader.fast_fields();
-    let ff_field_with_type =
-        ff_fields.u64_lenient_for_type_all(allowed_column_types, field_name)?;
-    Ok(ff_field_with_type)
-}
--- a/src/aggregation/agg_data.rs
+++ b/src/aggregation/agg_data.rs
@@ -9,12 +9,11 @@ use crate::aggregation::accessor_helpers::{
    get_numeric_or_date_column_types,
 };
 use crate::aggregation::agg_req::{Aggregation, AggregationVariants, Aggregations};
-pub use crate::aggregation::bucket::{CompositeAggReqData, CompositeSourceAccessors};
 use crate::aggregation::bucket::{
-    build_segment_filter_collector, build_segment_range_collector, CompositeAggregation,
-    FilterAggReqData, HistogramAggReqData, HistogramBounds, IncludeExcludeParam,
-    MissingTermAggReqData, RangeAggReqData, SegmentCompositeCollector, SegmentHistogramCollector,
-    TermMissingAgg, TermsAggReqData, TermsAggregation, TermsAggregationInternal,
+    build_segment_filter_collector, build_segment_range_collector, FilterAggReqData,
+    HistogramAggReqData, HistogramBounds, IncludeExcludeParam, MissingTermAggReqData,
+    RangeAggReqData, SegmentHistogramCollector, TermMissingAgg, TermsAggReqData, TermsAggregation,
+    TermsAggregationInternal,
 };
 use crate::aggregation::metric::{
    build_segment_stats_collector, AverageAggregation, CardinalityAggReqData,
@@ -74,12 +73,6 @@ impl AggregationsSegmentCtx {
        self.per_request.filter_req_data.push(Some(Box::new(data)));
        self.per_request.filter_req_data.len() - 1
    }
-    pub(crate) fn push_composite_req_data(&mut self, data: CompositeAggReqData) -> usize {
-        self.per_request
-            .composite_req_data
-            .push(Some(Box::new(data)));
-        self.per_request.composite_req_data.len() - 1
-    }

    #[inline]
    pub(crate) fn get_term_req_data(&self, idx: usize) -> &TermsAggReqData {
@@ -115,12 +108,6 @@ impl AggregationsSegmentCtx {
            .as_deref()
            .expect("range_req_data slot is empty (taken)")
    }
-    #[inline]
-    pub(crate) fn get_composite_req_data(&self, idx: usize) -> &CompositeAggReqData {
-        self.per_request.composite_req_data[idx]
-            .as_deref()
-            .expect("composite_req_data slot is empty (taken)")
-    }

    // ---------- mutable getters ----------

@@ -143,14 +130,8 @@ impl AggregationsSegmentCtx {
            .as_deref_mut()
            .expect("histogram_req_data slot is empty (taken)")
    }
-    #[inline]
-    pub(crate) fn get_composite_req_data_mut(&mut self, idx: usize) -> &mut CompositeAggReqData {
-        self.per_request.composite_req_data[idx]
-            .as_deref_mut()
-            .expect("composite_req_data slot is empty (taken)")
-    }

-    // ---------- take / put (terms, histogram, range, composite) ----------
+    // ---------- take / put (terms, histogram, range) ----------

    /// Move out the boxed Histogram request at `idx`, leaving `None`.
    #[inline]
@@ -200,25 +181,6 @@ impl AggregationsSegmentCtx {
        debug_assert!(self.per_request.filter_req_data[idx].is_none());
        self.per_request.filter_req_data[idx] = Some(value);
    }
-
-    /// Move out the Composite request at `idx`.
-    #[inline]
-    pub(crate) fn take_composite_req_data(&mut self, idx: usize) -> Box<CompositeAggReqData> {
-        self.per_request.composite_req_data[idx]
-            .take()
-            .expect("composite_req_data slot is empty (taken)")
-    }
-
-    /// Put back a Composite request into an empty slot at `idx`.
-    #[inline]
-    pub(crate) fn put_back_composite_req_data(
-        &mut self,
-        idx: usize,
-        value: Box<CompositeAggReqData>,
-    ) {
-        debug_assert!(self.per_request.composite_req_data[idx].is_none());
-        self.per_request.composite_req_data[idx] = Some(value);
-    }
 }

 /// Each type of aggregation has its own request data struct. This struct holds
@@ -238,8 +200,6 @@ pub struct PerRequestAggSegCtx {
    pub range_req_data: Vec<Option<Box<RangeAggReqData>>>,
    /// FilterAggReqData contains the request data for a filter aggregation.
    pub filter_req_data: Vec<Option<Box<FilterAggReqData>>>,
-    /// CompositeAggReqData contains the request data for a composite aggregation.
-    pub composite_req_data: Vec<Option<Box<CompositeAggReqData>>>,
    /// Shared by avg, min, max, sum, stats, extended_stats, count
    pub stats_metric_req_data: Vec<MetricAggReqData>,
    /// CardinalityAggReqData contains the request data for a cardinality aggregation.
@@ -295,11 +255,6 @@ impl PerRequestAggSegCtx {
                .iter()
                .map(|t| t.get_memory_consumption())
                .sum::<usize>()
-            + self
-                .composite_req_data
-                .iter()
-                .map(|t| t.as_ref().unwrap().get_memory_consumption())
-                .sum::<usize>()
            + self.agg_tree.len() * std::mem::size_of::<AggRefNode>()
    }

@@ -336,11 +291,6 @@ impl PerRequestAggSegCtx {
                .expect("filter_req_data slot is empty (taken)")
                .name
                .as_str(),
-            AggKind::Composite => &self.composite_req_data[idx]
-                .as_deref()
-                .expect("composite_req_data slot is empty (taken)")
-                .name
-                .as_str(),
        }
    }

@@ -467,9 +417,6 @@ pub(crate) fn build_segment_agg_collector(
        )?)),
        AggKind::Range => Ok(build_segment_range_collector(req, node)?),
        AggKind::Filter => build_segment_filter_collector(req, node),
-        AggKind::Composite => Ok(Box::new(SegmentCompositeCollector::from_req_and_validate(
-            req, node,
-        )?)),
    }
 }

@@ -500,7 +447,6 @@ pub enum AggKind {
    DateHistogram,
    Range,
    Filter,
-    Composite,
 }

 impl AggKind {
@@ -516,7 +462,6 @@ impl AggKind {
            AggKind::DateHistogram => "DateHistogram",
            AggKind::Range => "Range",
            AggKind::Filter => "Filter",
-            AggKind::Composite => "Composite",
        }
    }
 }
@@ -795,14 +740,6 @@ fn build_nodes(
                children,
            }])
        }
-        AggregationVariants::Composite(composite_req) => Ok(vec![build_composite_node(
-            agg_name,
-            reader,
-            segment_ordinal,
-            data,
-            &req.sub_aggregation,
-            composite_req,
-        )?]),
    }
 }

@@ -998,35 +935,6 @@ fn build_terms_or_cardinality_nodes(
    Ok(nodes)
 }

-fn build_composite_node(
-    agg_name: &str,
-    reader: &SegmentReader,
-    segment_ordinal: SegmentOrdinal,
-    data: &mut AggregationsSegmentCtx,
-    sub_aggs: &Aggregations,
-    req: &CompositeAggregation,
-) -> crate::Result<AggRefNode> {
-    let mut composite_accessors = Vec::with_capacity(req.sources.len());
-    for source in &req.sources {
-        let source_after_key_opt = req.after.get(source.name()).map(|k| &k.0);
-        let source_accessor =
-            CompositeSourceAccessors::build_for_source(reader, source, source_after_key_opt)?;
-        composite_accessors.push(source_accessor);
-    }
-    let agg = CompositeAggReqData {
-        name: agg_name.to_string(),
-        req: req.clone(),
-        composite_accessors,
-    };
-    let idx = data.push_composite_req_data(agg);
-    let children = build_children(sub_aggs, reader, segment_ordinal, data)?;
-    Ok(AggRefNode {
-        kind: AggKind::Composite,
-        idx_in_req_data: idx,
-        children,
-    })
-}
-
 /// Builds a single BitSet of allowed term ordinals for a string dictionary column according to
 /// include/exclude parameters.
 fn build_allowed_term_ids_for_str(
--- a/src/aggregation/agg_req.rs
+++ b/src/aggregation/agg_req.rs
@@ -40,7 +40,6 @@ use super::metric::{
    MaxAggregation, MinAggregation, PercentilesAggregationReq, StatsAggregation, SumAggregation,
    TopHitsAggregationReq,
 };
-use crate::aggregation::bucket::CompositeAggregation;

 /// The top-level aggregation request structure, which contains [`Aggregation`] and their user
 /// defined names. It is also used in buckets aggregations to define sub-aggregations.
@@ -135,9 +134,6 @@ pub enum AggregationVariants {
    /// Filter documents into a single bucket.
    #[serde(rename = "filter")]
    Filter(FilterAggregation),
-    /// Put data into multi level paginated buckets.
-    #[serde(rename = "composite")]
-    Composite(CompositeAggregation),

    // Metric aggregation types
    /// Computes the average of the extracted values.
@@ -184,11 +180,6 @@ impl AggregationVariants {
            AggregationVariants::Histogram(histogram) => vec![histogram.field.as_str()],
            AggregationVariants::DateHistogram(histogram) => vec![histogram.field.as_str()],
            AggregationVariants::Filter(filter) => filter.get_fast_field_names(),
-            AggregationVariants::Composite(composite) => composite
-                .sources
-                .iter()
-                .map(|source_map| source_map.field())
-                .collect(),
            AggregationVariants::Average(avg) => vec![avg.field_name()],
            AggregationVariants::Count(count) => vec![count.field_name()],
            AggregationVariants::Max(max) => vec![max.field_name()],
@@ -223,12 +214,6 @@ impl AggregationVariants {
            _ => None,
        }
    }
-    pub(crate) fn as_composite(&self) -> Option<&CompositeAggregation> {
-        match &self {
-            AggregationVariants::Composite(composite) => Some(composite),
-            _ => None,
-        }
-    }
    pub(crate) fn as_percentile(&self) -> Option<&PercentilesAggregationReq> {
        match &self {
            AggregationVariants::Percentiles(percentile_req) => Some(percentile_req),
--- a/src/aggregation/agg_result.rs
+++ b/src/aggregation/agg_result.rs
@@ -13,8 +13,6 @@ use super::metric::{
    ExtendedStats, PercentilesMetricResult, SingleMetricResult, Stats, TopHitsMetricResult,
 };
 use super::{AggregationError, Key};
-use crate::aggregation::bucket::AfterKey;
-use crate::aggregation::intermediate_agg_result::CompositeIntermediateKey;
 use crate::TantivyError;

 #[derive(Clone, Default, Debug, PartialEq, Serialize, Deserialize)]
@@ -160,16 +158,6 @@ pub enum BucketResult {
    },
    /// This is the filter result - a single bucket with sub-aggregations
    Filter(FilterBucketResult),
-    /// This is the composite aggregation result
-    Composite {
-        /// The buckets
-        ///
-        /// See [`CompositeAggregation`](super::bucket::CompositeAggregation)
-        buckets: Vec<CompositeBucketEntry>,
-        /// The key to start after when paginating
-        #[serde(skip_serializing_if = "FxHashMap::is_empty")]
-        after_key: FxHashMap<String, AfterKey>,
-    },
 }

 impl BucketResult {
@@ -191,9 +179,6 @@ impl BucketResult {
                // Only count sub-aggregation buckets
                filter_result.sub_aggregations.get_bucket_count()
            }
-            BucketResult::Composite { buckets, .. } => {
-                buckets.iter().map(|bucket| bucket.get_bucket_count()).sum()
-            }
        }
    }
 }
@@ -352,130 +337,3 @@ pub struct FilterBucketResult {
    #[serde(flatten)]
    pub sub_aggregations: AggregationResults,
 }
-
-/// The JSON mappable key to identify a composite bucket.
-///
-/// This is similar to `Key`, but composite keys can also be boolean and null.
-///
-/// Note the type information loss compared to `CompositeIntermediateKey`.
-/// Pagination is performed using `AfterKey`, which encodes type information.
-#[derive(Clone, Debug, Serialize, Deserialize)]
-#[serde(untagged)]
-pub enum CompositeKey {
-    /// Boolean key
-    Bool(bool),
-    /// String key
-    Str(String),
-    /// `i64` key
-    I64(i64),
-    /// `u64` key
-    U64(u64),
-    /// `f64` key
-    F64(f64),
-    /// Null key
-    Null,
-}
-impl Eq for CompositeKey {}
-impl std::hash::Hash for CompositeKey {
-    fn hash<H: std::hash::Hasher>(&self, state: &mut H) {
-        core::mem::discriminant(self).hash(state);
-        match self {
-            Self::Bool(val) => val.hash(state),
-            Self::Str(text) => text.hash(state),
-            Self::F64(val) => val.to_bits().hash(state),
-            Self::U64(val) => val.hash(state),
-            Self::I64(val) => val.hash(state),
-            Self::Null => {}
-        }
-    }
-}
-impl PartialEq for CompositeKey {
-    fn eq(&self, other: &Self) -> bool {
-        match (self, other) {
-            (Self::Bool(l), Self::Bool(r)) => l == r,
-            (Self::Str(l), Self::Str(r)) => l == r,
-            (Self::F64(l), Self::F64(r)) => l.to_bits() == r.to_bits(),
-            (Self::I64(l), Self::I64(r)) => l == r,
-            (Self::U64(l), Self::U64(r)) => l == r,
-            (Self::Null, Self::Null) => true,
-            (
-                Self::Bool(_)
-                | Self::Str(_)
-                | Self::F64(_)
-                | Self::I64(_)
-                | Self::U64(_)
-                | Self::Null,
-                _,
-            ) => false,
-        }
-    }
-}
-impl From<CompositeIntermediateKey> for CompositeKey {
-    fn from(value: CompositeIntermediateKey) -> Self {
-        match value {
-            CompositeIntermediateKey::Str(s) => Self::Str(s),
-            CompositeIntermediateKey::IpAddr(s) => {
-                // Prefer to use the IPv4 representation if possible
-                if let Some(ip) = s.to_ipv4_mapped() {
-                    Self::Str(ip.to_string())
-                } else {
-                    Self::Str(s.to_string())
-                }
-            }
-            CompositeIntermediateKey::F64(f) => Self::F64(f),
-            CompositeIntermediateKey::Bool(f) => Self::Bool(f),
-            CompositeIntermediateKey::U64(f) => Self::U64(f),
-            CompositeIntermediateKey::I64(f) => Self::I64(f),
-            CompositeIntermediateKey::DateTime(f) => Self::I64(f / 1_000_000), // Convert ns to ms
-            CompositeIntermediateKey::Null => Self::Null,
-        }
-    }
-}
-
-/// This is the default entry for a bucket, which contains a composite key, count, and optionally
-/// sub-aggregations.
-///   ...
-///     "my_composite": {
-///       "buckets": [
-///         {
-///           "key": {
-///             "date": 1494201600000,
-///             "product": "rocky"
-///           },
-///           "doc_count": 5
-///         },
-///         {
-///           "key": {
-///             "date": 1494201600000,
-///             "product": "balboa"
-///           },
-///           "doc_count": 2
-///         },
-///         {
-///           "key": {
-///             "date": 1494201700000,
-///             "product": "john"
-///           },
-///           "doc_count": 3
-///         }
-///       ]
-///    }
-///    ...
-/// }
-/// ```
-#[derive(Clone, Debug, PartialEq, Serialize, Deserialize)]
-pub struct CompositeBucketEntry {
-    /// The identifier of the bucket.
-    pub key: FxHashMap<String, CompositeKey>,
-    /// Number of documents in the bucket.
-    pub doc_count: u64,
-    #[serde(flatten)]
-    /// Sub-aggregations in this bucket.
-    pub sub_aggregation: AggregationResults,
-}
-
-impl CompositeBucketEntry {
-    pub(crate) fn get_bucket_count(&self) -> u64 {
-        1 + self.sub_aggregation.get_bucket_count()
-    }
-}
--- a/src/aggregation/bucket/composite/accessors.rs
+++ b/src/aggregation/bucket/composite/accessors.rs
@@ -1,515 +0,0 @@
-use std::fmt::Debug;
-use std::net::Ipv6Addr;
-
-use columnar::column_values::{CompactHit, CompactSpaceU64Accessor};
-use columnar::{Column, ColumnType, MonotonicallyMappableToU64, StrColumn, TermOrdHit};
-
-use crate::aggregation::accessor_helpers::{get_all_ff_readers, get_numeric_or_date_column_types};
-use crate::aggregation::bucket::composite::numeric_types::num_proj;
-use crate::aggregation::bucket::composite::numeric_types::num_proj::ProjectedNumber;
-use crate::aggregation::bucket::composite::ToTypePaginationOrder;
-use crate::aggregation::bucket::{
-    parse_into_milliseconds, CalendarInterval, CompositeAggregation, CompositeAggregationSource,
-    MissingOrder, Order,
-};
-use crate::aggregation::intermediate_agg_result::CompositeIntermediateKey;
-use crate::{SegmentReader, TantivyError};
-
-/// Contains all information required by the SegmentCompositeCollector to perform the
-/// composite aggregation on a segment.
-pub struct CompositeAggReqData {
-    /// The name of the aggregation.
-    pub name: String,
-    /// The normalized term aggregation request.
-    pub req: CompositeAggregation,
-    /// Accessors for each source, each source can have multiple accessors (columns).
-    pub composite_accessors: Vec<CompositeSourceAccessors>,
-}
-
-impl CompositeAggReqData {
-    /// Estimate the memory consumption of this struct in bytes.
-    pub fn get_memory_consumption(&self) -> usize {
-        std::mem::size_of::<Self>()
-            + self.composite_accessors.len() * std::mem::size_of::<CompositeSourceAccessors>()
-    }
-}
-
-/// Accessors for a single column in a composite source.
-pub struct CompositeAccessor {
-    /// The fast field column
-    pub column: Column<u64>,
-    /// The column type
-    pub column_type: ColumnType,
-    /// Term dictionary if the column type is Str
-    ///
-    /// Only used by term sources
-    pub str_dict_column: Option<StrColumn>,
-    /// Parsed date interval for date histogram sources
-    pub date_histogram_interval: PrecomputedDateInterval,
-}
-
-/// Accessors to all the columns that belong to the field of a composite source.
-pub struct CompositeSourceAccessors {
-    /// The accessors for this source
-    pub accessors: Vec<CompositeAccessor>,
-    /// The key after which to start collecting results. Applies to the first
-    /// column of the source.
-    pub after_key: PrecomputedAfterKey,
-
-    /// The column index the after_key applies to. The after_key only applies to
-    /// one column. Columns before should be skipped. Columns after should be
-    /// kept without comparison to the after_key.
-    pub after_key_accessor_idx: usize,
-
-    /// Whether to skip missing values because of the after_key. Skipping only
-    /// applies if the value for previous columns were exactly equal to the
-    /// corresponding after keys (is_on_after_key).
-    pub skip_missing: bool,
-
-    /// The after key was set to null to indicate that the last collected key
-    /// was a missing value.
-    pub is_after_key_explicit_missing: bool,
-}
-
-impl CompositeSourceAccessors {
-    /// Creates a new set of accessors for the composite source.
-    ///
-    /// Precomputes some values to make collection faster.
-    pub fn build_for_source(
-        reader: &SegmentReader,
-        source: &CompositeAggregationSource,
-        // First option is None when no after key was set in the query, the
-        // second option is None when the after key was set but its value for
-        // this source was set to `null`
-        source_after_key_opt: Option<&CompositeIntermediateKey>,
-    ) -> crate::Result<Self> {
-        let is_after_key_explicit_missing = source_after_key_opt
-            .map(|after_key| matches!(after_key, CompositeIntermediateKey::Null))
-            .unwrap_or(false);
-        let mut skip_missing = false;
-        if let Some(CompositeIntermediateKey::Null) = source_after_key_opt {
-            if !source.missing_bucket() {
-                return Err(TantivyError::InvalidArgument(
-                    "the 'after' key for a source cannot be null when 'missing_bucket' is false"
-                        .to_string(),
-                ));
-            }
-        } else if source_after_key_opt.is_some() {
-            // if missing buckets come first and we have a non null after key, we skip missing
-            if MissingOrder::First == source.missing_order() {
-                skip_missing = true;
-            }
-            if MissingOrder::Default == source.missing_order() && Order::Asc == source.order() {
-                skip_missing = true;
-            }
-        };
-
-        match source {
-            CompositeAggregationSource::Terms(source) => {
-                let allowed_column_types = [
-                    ColumnType::I64,
-                    ColumnType::U64,
-                    ColumnType::F64,
-                    ColumnType::Str,
-                    ColumnType::DateTime,
-                    ColumnType::Bool,
-                    ColumnType::IpAddr,
-                    // ColumnType::Bytes Unsupported
-                ];
-                let mut columns_and_types =
-                    get_all_ff_readers(reader, &source.field, Some(&allowed_column_types))?;
-
-                // Sort columns by their pagination order and determine which to skip
-                columns_and_types.sort_by_key(|(_, col_type)| col_type.column_pagination_order());
-                if source.order == Order::Desc {
-                    columns_and_types.reverse();
-                }
-                let after_key_accessor_idx = find_first_column_to_collect(
-                    &columns_and_types,
-                    source_after_key_opt,
-                    source.missing_order,
-                    source.order,
-                )?;
-
-                let source_collectors: Vec<CompositeAccessor> = columns_and_types
-                    .into_iter()
-                    .map(|(column, column_type)| {
-                        Ok(CompositeAccessor {
-                            column,
-                            column_type,
-                            str_dict_column: reader.fast_fields().str(&source.field)?,
-                            date_histogram_interval: PrecomputedDateInterval::NotApplicable,
-                        })
-                    })
-                    .collect::<crate::Result<_>>()?;
-
-                let after_key = if let Some(first_col) =
-                    source_collectors.get(after_key_accessor_idx)
-                {
-                    match source_after_key_opt {
-                        Some(after_key) => PrecomputedAfterKey::precompute(
-                            &first_col,
-                            after_key,
-                            &source.field,
-                            source.missing_order,
-                            source.order,
-                        )?,
-                        None => {
-                            precompute_missing_after_key(false, source.missing_order, source.order)
-                        }
-                    }
-                } else {
-                    // if no columns, we don't care about the after_key
-                    PrecomputedAfterKey::Next(0)
-                };
-
-                Ok(CompositeSourceAccessors {
-                    accessors: source_collectors,
-                    is_after_key_explicit_missing,
-                    skip_missing,
-                    after_key,
-                    after_key_accessor_idx,
-                })
-            }
-            CompositeAggregationSource::Histogram(source) => {
-                let column_and_types: Vec<(Column, ColumnType)> = get_all_ff_readers(
-                    reader,
-                    &source.field,
-                    Some(get_numeric_or_date_column_types()),
-                )?;
-                let source_collectors: Vec<CompositeAccessor> = column_and_types
-                    .into_iter()
-                    .map(|(column, column_type)| {
-                        Ok(CompositeAccessor {
-                            column,
-                            column_type,
-                            str_dict_column: None,
-                            date_histogram_interval: PrecomputedDateInterval::NotApplicable,
-                        })
-                    })
-                    .collect::<crate::Result<_>>()?;
-                let after_key = match source_after_key_opt {
-                    Some(CompositeIntermediateKey::F64(key)) => {
-                        let normalized_key = *key / source.interval;
-                        num_proj::f64_to_i64(normalized_key).into()
-                    }
-                    Some(CompositeIntermediateKey::Null) => {
-                        precompute_missing_after_key(true, source.missing_order, source.order)
-                    }
-                    None => precompute_missing_after_key(true, source.missing_order, source.order),
-                    _ => {
-                        return Err(crate::TantivyError::InvalidArgument(
-                            "After key type invalid for interval composite source".to_string(),
-                        ));
-                    }
-                };
-                Ok(CompositeSourceAccessors {
-                    accessors: source_collectors,
-                    is_after_key_explicit_missing,
-                    skip_missing,
-                    after_key,
-                    after_key_accessor_idx: 0,
-                })
-            }
-            CompositeAggregationSource::DateHistogram(source) => {
-                let column_and_types =
-                    get_all_ff_readers(reader, &source.field, Some(&[ColumnType::DateTime]))?;
-                let date_histogram_interval =
-                    PrecomputedDateInterval::from_date_histogram_source_intervals(
-                        &source.fixed_interval,
-                        source.calendar_interval,
-                    )?;
-                let source_collectors: Vec<CompositeAccessor> = column_and_types
-                    .into_iter()
-                    .map(|(column, column_type)| {
-                        Ok(CompositeAccessor {
-                            column,
-                            column_type,
-                            str_dict_column: None,
-                            date_histogram_interval,
-                        })
-                    })
-                    .collect::<crate::Result<_>>()?;
-                let after_key = match source_after_key_opt {
-                    Some(CompositeIntermediateKey::DateTime(key)) => {
-                        PrecomputedAfterKey::Exact(key.to_u64())
-                    }
-                    Some(CompositeIntermediateKey::Null) => {
-                        precompute_missing_after_key(true, source.missing_order, source.order)
-                    }
-                    None => precompute_missing_after_key(true, source.missing_order, source.order),
-                    _ => {
-                        return Err(crate::TantivyError::InvalidArgument(
-                            "After key type invalid for interval composite source".to_string(),
-                        ));
-                    }
-                };
-                Ok(CompositeSourceAccessors {
-                    accessors: source_collectors,
-                    is_after_key_explicit_missing,
-                    skip_missing,
-                    after_key,
-                    after_key_accessor_idx: 0,
-                })
-            }
-        }
-    }
-}
-
-/// Finds the index of the first column we should start collecting from to
-/// resume the pagination from the after_key.
-fn find_first_column_to_collect<T>(
-    sorted_columns: &[(T, ColumnType)],
-    after_key_opt: Option<&CompositeIntermediateKey>,
-    missing_order: MissingOrder,
-    order: Order,
-) -> crate::Result<usize> {
-    let after_key = match after_key_opt {
-        None => return Ok(0), // No pagination, start from beginning
-        Some(key) => key,
-    };
-    // Handle null after_key (we were on a missing value last time)
-    if matches!(after_key, CompositeIntermediateKey::Null) {
-        return match (missing_order, order) {
-            // Missing values come first, so all columns remain
-            (MissingOrder::First, _) | (MissingOrder::Default, Order::Asc) => Ok(0),
-            // Missing values come last, so all columns are done
-            (MissingOrder::Last, _) | (MissingOrder::Default, Order::Desc) => {
-                Ok(sorted_columns.len())
-            }
-        };
-    }
-    // Find the first column whose type order matches or follows the after_key's
-    // type in the pagination sequence
-    let after_key_column_order = after_key.column_pagination_order();
-    for (idx, (_, col_type)) in sorted_columns.iter().enumerate() {
-        let col_order = col_type.column_pagination_order();
-        let is_first_to_collect = match order {
-            Order::Asc => col_order >= after_key_column_order,
-            Order::Desc => col_order <= after_key_column_order,
-        };
-        if is_first_to_collect {
-            return Ok(idx);
-        }
-    }
-    // All columns are before the after_key, nothing left to collect
-    Ok(sorted_columns.len())
-}
-
-fn precompute_missing_after_key(
-    is_after_key_explicit_missing: bool,
-    missing_order: MissingOrder,
-    order: Order,
-) -> PrecomputedAfterKey {
-    let after_last = PrecomputedAfterKey::AfterLast;
-    let before_first = PrecomputedAfterKey::Next(0);
-    match (is_after_key_explicit_missing, missing_order, order) {
-        (true, MissingOrder::First, Order::Asc) => before_first,
-        (true, MissingOrder::First, Order::Desc) => after_last,
-        (true, MissingOrder::Last, Order::Asc) => after_last,
-        (true, MissingOrder::Last, Order::Desc) => before_first,
-        (true, MissingOrder::Default, Order::Asc) => before_first,
-        (true, MissingOrder::Default, Order::Desc) => after_last,
-        (false, _, Order::Asc) => before_first,
-        (false, _, Order::Desc) => after_last,
-    }
-}
-
-/// A parsed representation of the date interval for date histogram sources
-#[derive(Clone, Copy, Debug)]
-pub enum PrecomputedDateInterval {
-    /// This is not a date histogram source
-    NotApplicable,
-    /// Source was configured with a fixed interval
-    FixedNanoseconds(i64),
-    /// Source was configured with a calendar interval
-    Calendar(CalendarInterval),
-}
-
-impl PrecomputedDateInterval {
-    /// Validates the date histogram source interval fields and parses a date interval from them.
-    pub fn from_date_histogram_source_intervals(
-        fixed_interval: &Option<String>,
-        calendar_interval: Option<CalendarInterval>,
-    ) -> crate::Result<Self> {
-        match (fixed_interval, calendar_interval) {
-            (Some(_), Some(_)) | (None, None) => Err(TantivyError::InvalidArgument(
-                "date histogram source must one and only one of fixed_interval or \
-                 calendar_interval set"
-                    .to_string(),
-            )),
-            (Some(fixed_interval), None) => {
-                let fixed_interval_ms = parse_into_milliseconds(&fixed_interval)?;
-                Ok(PrecomputedDateInterval::FixedNanoseconds(
-                    fixed_interval_ms * 1_000_000,
-                ))
-            }
-            (None, Some(calendar_interval)) => {
-                Ok(PrecomputedDateInterval::Calendar(calendar_interval))
-            }
-        }
-    }
-}
-
-/// The after key projected to the u64 column space
-///
-/// Some column types (term, IP) might not have an exact representation of the
-/// specified after key
-#[derive(Debug)]
-pub enum PrecomputedAfterKey {
-    /// The after key could be exactly represented in the column space.
-    Exact(u64),
-    /// The after key could not be exactly represented exactly represented, so
-    /// this is the next closest one.
-    Next(u64),
-    /// The after key could not be represented in the column space, it is
-    /// greater than all value
-    AfterLast,
-}
-
-impl From<TermOrdHit> for PrecomputedAfterKey {
-    fn from(hit: TermOrdHit) -> Self {
-        match hit {
-            TermOrdHit::Exact(ord) => PrecomputedAfterKey::Exact(ord),
-            // TermOrdHit represents AfterLast as Next(u64::MAX), we keep it as is
-            TermOrdHit::Next(ord) => PrecomputedAfterKey::Next(ord),
-        }
-    }
-}
-
-impl From<CompactHit> for PrecomputedAfterKey {
-    fn from(hit: CompactHit) -> Self {
-        match hit {
-            CompactHit::Exact(ord) => PrecomputedAfterKey::Exact(ord as u64),
-            CompactHit::Next(ord) => PrecomputedAfterKey::Next(ord as u64),
-            CompactHit::AfterLast => PrecomputedAfterKey::AfterLast,
-        }
-    }
-}
-
-impl<T: MonotonicallyMappableToU64> From<ProjectedNumber<T>> for PrecomputedAfterKey {
-    fn from(num: ProjectedNumber<T>) -> Self {
-        match num {
-            ProjectedNumber::Exact(number) => PrecomputedAfterKey::Exact(number.to_u64()),
-            ProjectedNumber::Next(number) => PrecomputedAfterKey::Next(number.to_u64()),
-            ProjectedNumber::AfterLast => PrecomputedAfterKey::AfterLast,
-        }
-    }
-}
-
-// /!\ These operators only makes sense if both values are in the same column space
-impl PrecomputedAfterKey {
-    pub fn equals(&self, column_value: u64) -> bool {
-        match self {
-            PrecomputedAfterKey::Exact(v) => *v == column_value,
-            PrecomputedAfterKey::Next(_) => false,
-            PrecomputedAfterKey::AfterLast => false,
-        }
-    }
-
-    pub fn gt(&self, column_value: u64) -> bool {
-        match self {
-            PrecomputedAfterKey::Exact(v) => *v > column_value,
-            PrecomputedAfterKey::Next(v) => *v > column_value,
-            PrecomputedAfterKey::AfterLast => true,
-        }
-    }
-
-    pub fn lt(&self, column_value: u64) -> bool {
-        match self {
-            PrecomputedAfterKey::Exact(v) => *v < column_value,
-            // a value equal to the next is greater than the after key
-            PrecomputedAfterKey::Next(v) => *v <= column_value,
-            PrecomputedAfterKey::AfterLast => false,
-        }
-    }
-
-    fn precompute_ip_addr(column: &Column<u64>, key: &Ipv6Addr) -> crate::Result<Self> {
-        let compact_space_accessor = column
-            .values
-            .clone()
-            .downcast_arc::<CompactSpaceU64Accessor>()
-            .map_err(|_| {
-                TantivyError::AggregationError(crate::aggregation::AggregationError::InternalError(
-                    "type mismatch: could not downcast to CompactSpaceU64Accessor".to_string(),
-                ))
-            })?;
-        let ip_u128 = key.to_bits();
-        let ip_next_compact = compact_space_accessor.u128_to_next_compact(ip_u128);
-        Ok(ip_next_compact.into())
-    }
-
-    fn precompute_term_ord(
-        str_dict_column: &Option<StrColumn>,
-        key: &str,
-        field: &str,
-    ) -> crate::Result<Self> {
-        let dict = str_dict_column
-            .as_ref()
-            .expect("dictionary missing for str accessor")
-            .dictionary();
-        let next_ord = dict.term_ord_or_next(key).map_err(|_| {
-            TantivyError::InvalidArgument(format!(
-                "failed to lookup after_key '{}' for field '{}'",
-                key, field
-            ))
-        })?;
-        Ok(next_ord.into())
-    }
-
-    /// Projects the after key into the column space of the given accessor.
-    ///
-    /// The computed after key will not take care of skipping entire columns
-    /// when the after key type is ordered after the accessor's type, that
-    /// should be performed earlier.
-    pub fn precompute(
-        composite_accessor: &CompositeAccessor,
-        source_after_key: &CompositeIntermediateKey,
-        field: &str,
-        missing_order: MissingOrder,
-        order: Order,
-    ) -> crate::Result<Self> {
-        use CompositeIntermediateKey as CIKey;
-        let precomputed_key = match (composite_accessor.column_type, source_after_key) {
-            (ColumnType::Bytes, _) => panic!("unsupported"),
-            // null after key
-            (_, CIKey::Null) => precompute_missing_after_key(false, missing_order, order),
-            // numerical
-            (ColumnType::I64, CIKey::I64(k)) => PrecomputedAfterKey::Exact(k.to_u64()),
-            (ColumnType::I64, CIKey::U64(k)) => num_proj::u64_to_i64(*k).into(),
-            (ColumnType::I64, CIKey::F64(k)) => num_proj::f64_to_i64(*k).into(),
-            (ColumnType::U64, CIKey::I64(k)) => num_proj::i64_to_u64(*k).into(),
-            (ColumnType::U64, CIKey::U64(k)) => PrecomputedAfterKey::Exact(*k),
-            (ColumnType::U64, CIKey::F64(k)) => num_proj::f64_to_u64(*k).into(),
-            (ColumnType::F64, CIKey::I64(k)) => num_proj::i64_to_f64(*k).into(),
-            (ColumnType::F64, CIKey::U64(k)) => num_proj::u64_to_f64(*k).into(),
-            (ColumnType::F64, CIKey::F64(k)) => PrecomputedAfterKey::Exact(k.to_u64()),
-            // boolean
-            (ColumnType::Bool, CIKey::Bool(key)) => PrecomputedAfterKey::Exact(key.to_u64()),
-            // string
-            (ColumnType::Str, CIKey::Str(key)) => PrecomputedAfterKey::precompute_term_ord(
-                &composite_accessor.str_dict_column,
-                key,
-                field,
-            )?,
-            // date time
-            (ColumnType::DateTime, CIKey::DateTime(key)) => {
-                PrecomputedAfterKey::Exact(key.to_u64())
-            }
-            // ip address
-            (ColumnType::IpAddr, CIKey::IpAddr(key)) => {
-                PrecomputedAfterKey::precompute_ip_addr(&composite_accessor.column, key)?
-            }
-            // assume the column's type is ordered after the after_key's type
-            _ => PrecomputedAfterKey::keep_all(order),
-        };
-        Ok(precomputed_key)
-    }
-
-    fn keep_all(order: Order) -> Self {
-        match order {
-            Order::Asc => PrecomputedAfterKey::Next(0),
-            Order::Desc => PrecomputedAfterKey::Next(u64::MAX),
-        }
-    }
-}
--- a/src/aggregation/bucket/composite/calendar_interval.rs
+++ b/src/aggregation/bucket/composite/calendar_interval.rs
@@ -1,140 +0,0 @@
-use time::convert::{Day, Nanosecond};
-use time::{Time, UtcDateTime};
-
-const NS_IN_DAY: i64 = Nanosecond::per_t::<i128>(Day) as i64;
-
-/// Computes the timestamp in nanoseconds corresponding to the beginning of the
-/// year (January 1st at midnight UTC).
-pub(super) fn try_year_bucket(timestamp_ns: i64) -> crate::Result<i64> {
-    year_bucket_using_time_crate(timestamp_ns).map_err(|e| {
-        crate::TantivyError::InvalidArgument(format!(
-            "Failed to compute year bucket for timestamp {}: {}",
-            timestamp_ns,
-            e.to_string()
-        ))
-    })
-}
-
-/// Computes the timestamp in nanoseconds corresponding to the beginning of the
-/// month (1st at midnight UTC).
-pub(super) fn try_month_bucket(timestamp_ns: i64) -> crate::Result<i64> {
-    month_bucket_using_time_crate(timestamp_ns).map_err(|e| {
-        crate::TantivyError::InvalidArgument(format!(
-            "Failed to compute month bucket for timestamp {}: {}",
-            timestamp_ns,
-            e.to_string()
-        ))
-    })
-}
-
-/// Computes the timestamp in nanoseconds corresponding to the beginning of the
-/// week (Monday at midnight UTC).
-pub(super) fn week_bucket(timestamp_ns: i64) -> i64 {
-    // 1970-01-01 was a Thursday (weekday = 4)
-    let days_since_epoch = timestamp_ns.div_euclid(NS_IN_DAY);
-    // Find the weekday: 0=Monday, ..., 6=Sunday
-    let weekday = (days_since_epoch + 3).rem_euclid(7);
-    let monday_days_since_epoch = days_since_epoch - weekday;
-    monday_days_since_epoch * NS_IN_DAY
-}
-
-fn year_bucket_using_time_crate(timestamp_ns: i64) -> Result<i64, time::Error> {
-    let timestamp_ns = UtcDateTime::from_unix_timestamp_nanos(timestamp_ns as i128)?
-        .replace_ordinal(1)?
-        .replace_time(Time::MIDNIGHT)
-        .unix_timestamp_nanos();
-    Ok(timestamp_ns as i64)
-}
-
-fn month_bucket_using_time_crate(timestamp_ns: i64) -> Result<i64, time::Error> {
-    let timestamp_ns = UtcDateTime::from_unix_timestamp_nanos(timestamp_ns as i128)?
-        .replace_day(1)?
-        .replace_time(Time::MIDNIGHT)
-        .unix_timestamp_nanos();
-    Ok(timestamp_ns as i64)
-}
-
-#[cfg(test)]
-mod tests {
-    use std::i64;
-
-    use time::format_description::well_known::Iso8601;
-    use time::UtcDateTime;
-
-    use super::*;
-
-    fn ts_ns(iso: &str) -> i64 {
-        UtcDateTime::parse(iso, &Iso8601::DEFAULT)
-            .unwrap()
-            .unix_timestamp_nanos() as i64
-    }
-
-    #[test]
-    fn test_year_bucket() {
-        let ts = ts_ns("1970-01-01T00:00:00Z");
-        let res = try_year_bucket(ts).unwrap();
-        assert_eq!(res, ts_ns("1970-01-01T00:00:00Z"));
-
-        let ts = ts_ns("1970-06-01T10:00:01.010Z");
-        let res = try_year_bucket(ts).unwrap();
-        assert_eq!(res, ts_ns("1970-01-01T00:00:00Z"));
-
-        let ts = ts_ns("2008-12-31T23:59:59.999999999Z"); // leap year
-        let res = try_year_bucket(ts).unwrap();
-        assert_eq!(res, ts_ns("2008-01-01T00:00:00Z"));
-
-        let ts = ts_ns("2008-01-01T00:00:00Z"); // leap year
-        let res = try_year_bucket(ts).unwrap();
-        assert_eq!(res, ts_ns("2008-01-01T00:00:00Z"));
-
-        let ts = ts_ns("2010-12-31T23:59:59.999999999Z");
-        let res = try_year_bucket(ts).unwrap();
-        assert_eq!(res, ts_ns("2010-01-01T00:00:00Z"));
-
-        let ts = ts_ns("1972-06-01T00:10:00Z");
-        let res = try_year_bucket(ts).unwrap();
-        assert_eq!(res, ts_ns("1972-01-01T00:00:00Z"));
-    }
-
-    #[test]
-    fn test_month_bucket() {
-        let ts = ts_ns("1970-01-15T00:00:00Z");
-        let res = try_month_bucket(ts).unwrap();
-        assert_eq!(res, ts_ns("1970-01-01T00:00:00Z"));
-
-        let ts = ts_ns("1970-02-01T00:00:00Z");
-        let res = try_month_bucket(ts).unwrap();
-        assert_eq!(res, ts_ns("1970-02-01T00:00:00Z"));
-
-        let ts = ts_ns("2000-01-31T23:59:59.999999999Z");
-        let res = try_month_bucket(ts).unwrap();
-        assert_eq!(res, ts_ns("2000-01-01T00:00:00Z"));
-    }
-
-    #[test]
-    fn test_week_bucket() {
-        let ts = ts_ns("1970-01-05T00:00:00Z"); // Monday
-        let res = week_bucket(ts);
-        assert_eq!(res, ts_ns("1970-01-05T00:00:00Z"));
-
-        let ts = ts_ns("1970-01-05T23:59:59Z"); // Monday
-        let res = week_bucket(ts);
-        assert_eq!(res, ts_ns("1970-01-05T00:00:00Z"));
-
-        let ts = ts_ns("1970-01-07T01:13:00Z"); // Wednesday
-        let res = week_bucket(ts);
-        assert_eq!(res, ts_ns("1970-01-05T00:00:00Z"));
-
-        let ts = ts_ns("1970-01-11T23:59:59.999999999Z"); // Sunday
-        let res = week_bucket(ts);
-        assert_eq!(res, ts_ns("1970-01-05T00:00:00Z"));
-
-        let ts = ts_ns("2025-10-16T10:41:59.010Z"); // Thursday
-        let res = week_bucket(ts);
-        assert_eq!(res, ts_ns("2025-10-13T00:00:00Z"));
-
-        let ts = ts_ns("1970-01-01T00:00:00Z"); // Thursday
-        let res = week_bucket(ts);
-        assert_eq!(res, ts_ns("1969-12-29T00:00:00Z")); // Negative
-    }
-}
--- a/src/aggregation/bucket/composite/collector.rs
+++ b/src/aggregation/bucket/composite/collector.rs
@@ -1,595 +0,0 @@
-use std::fmt::Debug;
-use std::net::Ipv6Addr;
-
-use columnar::column_values::CompactSpaceU64Accessor;
-use columnar::{
-    Column, ColumnType, Dictionary, MonotonicallyMappableToU128, MonotonicallyMappableToU64,
-    NumericalValue, StrColumn,
-};
-use rustc_hash::FxHashMap;
-use smallvec::SmallVec;
-
-use crate::aggregation::agg_data::{
-    build_segment_agg_collectors, AggRefNode, AggregationsSegmentCtx,
-};
-use crate::aggregation::bucket::composite::accessors::{
-    CompositeAccessor, CompositeAggReqData, PrecomputedDateInterval,
-};
-use crate::aggregation::bucket::composite::calendar_interval;
-use crate::aggregation::bucket::composite::map::{DynArrayHeapMap, MAX_DYN_ARRAY_SIZE};
-use crate::aggregation::bucket::{
-    CalendarInterval, CompositeAggregationSource, MissingOrder, Order,
-};
-use crate::aggregation::intermediate_agg_result::{
-    CompositeIntermediateKey, IntermediateAggregationResult, IntermediateAggregationResults,
-    IntermediateBucketResult, IntermediateCompositeBucketEntry, IntermediateCompositeBucketResult,
-};
-use crate::aggregation::segment_agg_result::SegmentAggregationCollector;
-use crate::aggregation::BucketId;
-use crate::TantivyError;
-
-#[derive(Debug)]
-struct CompositeBucketCollector {
-    count: u32,
-}
-
-impl CompositeBucketCollector {
-    fn new() -> Self {
-        CompositeBucketCollector { count: 0 }
-    }
-    #[inline]
-    fn collect(&mut self) {
-        self.count += 1;
-    }
-}
-
-/// The value is represented as a tuple of:
-/// - the column index or missing value sentinel
-///   - if the value is present, store the accessor index + 1
-///   - if the value is missing, store 0 (for missing first) or u8::MAX (for missing last)
-/// - the fast field value u64 representation
-///   - 0 if the field is missing
-///   - regular u64 repr if the ordering is ascending
-///   - bitwise NOT of the u64 repr if the ordering is descending
-#[derive(Clone, Copy, Debug, PartialEq, Eq, PartialOrd, Ord, Default, Hash)]
-struct InternalValueRepr(u8, u64);
-
-impl InternalValueRepr {
-    #[inline]
-    fn new_term(raw: u64, accessor_idx: u8, order: Order) -> Self {
-        match order {
-            Order::Asc => InternalValueRepr(accessor_idx + 1, raw),
-            Order::Desc => InternalValueRepr(accessor_idx + 1, !raw),
-        }
-    }
-    /// For histogram, the source column does not matter
-    #[inline]
-    fn new_histogram(raw: u64, order: Order) -> Self {
-        match order {
-            Order::Asc => InternalValueRepr(1, raw),
-            Order::Desc => InternalValueRepr(1, !raw),
-        }
-    }
-    #[inline]
-    fn new_missing(order: Order, missing_order: MissingOrder) -> Self {
-        let column_idx = match (missing_order, order) {
-            (MissingOrder::First, _) => 0,
-            (MissingOrder::Last, _) => u8::MAX,
-            (MissingOrder::Default, Order::Asc) => 0,
-            (MissingOrder::Default, Order::Desc) => u8::MAX,
-        };
-        InternalValueRepr(column_idx, 0)
-    }
-    #[inline]
-    fn decode(self, order: Order) -> Option<(u8, u64)> {
-        if self.0 == u8::MAX || self.0 == 0 {
-            return None;
-        }
-        match order {
-            Order::Asc => Some((self.0 - 1, self.1)),
-            Order::Desc => Some((self.0 - 1, !self.1)),
-        }
-    }
-}
-
-/// The collector puts values from the fast field into the correct buckets and
-/// does a conversion to the correct datatype.
-#[derive(Debug)]
-pub struct SegmentCompositeCollector {
-    buckets: DynArrayHeapMap<InternalValueRepr, CompositeBucketCollector>,
-    accessor_idx: usize,
-}
-
-impl SegmentAggregationCollector for SegmentCompositeCollector {
-    fn add_intermediate_aggregation_result(
-        &mut self,
-        agg_data: &AggregationsSegmentCtx,
-        results: &mut IntermediateAggregationResults,
-        _parent_bucket_id: BucketId,
-    ) -> crate::Result<()> {
-        let name = agg_data
-            .get_composite_req_data(self.accessor_idx)
-            .name
-            .clone();
-
-        let buckets = self.into_intermediate_bucket_result(agg_data)?;
-        results.push(
-            name,
-            IntermediateAggregationResult::Bucket(IntermediateBucketResult::Composite { buckets }),
-        )?;
-
-        Ok(())
-    }
-
-    #[inline]
-    fn collect(
-        &mut self,
-        _parent_bucket_id: BucketId,
-        docs: &[crate::DocId],
-        agg_data: &mut AggregationsSegmentCtx,
-    ) -> crate::Result<()> {
-        let mem_pre = self.get_memory_consumption();
-        let composite_agg_data = agg_data.take_composite_req_data(self.accessor_idx);
-
-        for doc in docs {
-            let mut sub_level_values = SmallVec::new();
-            recursive_key_visitor(
-                *doc,
-                agg_data,
-                &composite_agg_data,
-                0,
-                &mut sub_level_values,
-                &mut self.buckets,
-                true,
-            )?;
-        }
-        agg_data.put_back_composite_req_data(self.accessor_idx, composite_agg_data);
-
-        let mem_delta = self.get_memory_consumption() - mem_pre;
-        if mem_delta > 0 {
-            agg_data.context.limits.add_memory_consumed(mem_delta)?;
-        }
-
-        Ok(())
-    }
-
-    fn prepare_max_bucket(
-        &mut self,
-        _max_bucket: BucketId,
-        _agg_data: &AggregationsSegmentCtx,
-    ) -> crate::Result<()> {
-        Ok(())
-    }
-
-    fn flush(&mut self, _agg_data: &mut AggregationsSegmentCtx) -> crate::Result<()> {
-        Ok(())
-    }
-}
-
-impl SegmentCompositeCollector {
-    fn get_memory_consumption(&self) -> u64 {
-        self.buckets.memory_consumption()
-    }
-
-    pub(crate) fn from_req_and_validate(
-        req_data: &mut AggregationsSegmentCtx,
-        node: &AggRefNode,
-    ) -> crate::Result<Self> {
-        validate_req(req_data, node.idx_in_req_data)?;
-
-        if !node.children.is_empty() {
-            let _sub_aggregation = build_segment_agg_collectors(req_data, &node.children)?;
-        }
-
-        let composite_req_data = req_data.get_composite_req_data(node.idx_in_req_data);
-        Ok(SegmentCompositeCollector {
-            buckets: DynArrayHeapMap::try_new(composite_req_data.req.sources.len())?,
-            accessor_idx: node.idx_in_req_data,
-        })
-    }
-
-    #[inline]
-    pub(crate) fn into_intermediate_bucket_result(
-        &mut self,
-        agg_data: &AggregationsSegmentCtx,
-    ) -> crate::Result<IntermediateCompositeBucketResult> {
-        let mut dict: FxHashMap<Vec<CompositeIntermediateKey>, IntermediateCompositeBucketEntry> =
-            Default::default();
-        dict.reserve(self.buckets.size());
-        let composite_data = agg_data.get_composite_req_data(self.accessor_idx);
-        let buckets = std::mem::replace(
-            &mut self.buckets,
-            DynArrayHeapMap::try_new(composite_data.req.sources.len())
-                .expect("already validated source count"),
-        );
-        for (key_internal_repr, agg) in buckets.into_iter() {
-            let key = resolve_key(&key_internal_repr, composite_data)?;
-            let sub_aggregation_res = IntermediateAggregationResults::default();
-
-            dict.insert(
-                key,
-                IntermediateCompositeBucketEntry {
-                    doc_count: agg.count,
-                    sub_aggregation: sub_aggregation_res,
-                },
-            );
-        }
-
-        Ok(IntermediateCompositeBucketResult {
-            entries: dict,
-            target_size: composite_data.req.size,
-            orders: composite_data
-                .req
-                .sources
-                .iter()
-                .map(|source| match source {
-                    CompositeAggregationSource::Terms(t) => (t.order, t.missing_order),
-                    CompositeAggregationSource::Histogram(h) => (h.order, h.missing_order),
-                    CompositeAggregationSource::DateHistogram(d) => (d.order, d.missing_order),
-                })
-                .collect(),
-        })
-    }
-}
-
-fn validate_req(req_data: &mut AggregationsSegmentCtx, accessor_idx: usize) -> crate::Result<()> {
-    let composite_data = req_data.get_composite_req_data(accessor_idx);
-    let req = &composite_data.req;
-    if req.sources.is_empty() {
-        return Err(TantivyError::InvalidArgument(
-            "composite aggregation must have at least one source".to_string(),
-        ));
-    }
-    if req.size == 0 {
-        return Err(TantivyError::InvalidArgument(
-            "composite aggregation 'size' must be > 0".to_string(),
-        ));
-    }
-    let column_types_for_sources = composite_data.composite_accessors.iter().map(|item| {
-        item.accessors
-            .iter()
-            .map(|a| a.column_type)
-            .collect::<Vec<_>>()
-    });
-
-    for column_types in column_types_for_sources {
-        if column_types.len() > MAX_DYN_ARRAY_SIZE {
-            return Err(TantivyError::InvalidArgument(format!(
-                "composite aggregation source supports maximum {MAX_DYN_ARRAY_SIZE} sources",
-            )));
-        }
-        if column_types.contains(&ColumnType::Bytes) {
-            return Err(TantivyError::InvalidArgument(
-                "composite aggregation does not support 'bytes' field type".to_string(),
-            ));
-        }
-    }
-    Ok(())
-}
-
-fn collect_bucket_with_limit(
-    agg_data: &mut AggregationsSegmentCtx,
-    composite_agg_data: &CompositeAggReqData,
-    buckets: &mut DynArrayHeapMap<InternalValueRepr, CompositeBucketCollector>,
-    key: &[InternalValueRepr],
-) -> crate::Result<()> {
-    if (buckets.size() as u32) < composite_agg_data.req.size {
-        buckets
-            .get_or_insert_with(key, CompositeBucketCollector::new)
-            .collect();
-        return Ok(());
-    }
-
-    if let Some(entry) = buckets.get_mut(key) {
-        entry.collect();
-        return Ok(());
-    }
-
-    if let Some(highest_key) = buckets.peek_highest() {
-        if key < highest_key {
-            buckets.evict_highest();
-            buckets
-                .get_or_insert_with(key, CompositeBucketCollector::new)
-                .collect();
-        }
-    }
-
-    let _ = agg_data;
-    Ok(())
-}
-
-/// Converts the composite key from its internal column space representation
-/// (segment specific) into its intermediate form.
-fn resolve_key(
-    internal_key: &[InternalValueRepr],
-    agg_data: &CompositeAggReqData,
-) -> crate::Result<Vec<CompositeIntermediateKey>> {
-    internal_key
-        .into_iter()
-        .enumerate()
-        .map(|(idx, val)| {
-            resolve_internal_value_repr(
-                *val,
-                &agg_data.req.sources[idx],
-                &agg_data.composite_accessors[idx].accessors,
-            )
-        })
-        .collect()
-}
-
-fn resolve_internal_value_repr(
-    internal_value_repr: InternalValueRepr,
-    source: &CompositeAggregationSource,
-    composite_accessors: &[CompositeAccessor],
-) -> crate::Result<CompositeIntermediateKey> {
-    let decoded_value_opt = match source {
-        CompositeAggregationSource::Terms(source) => internal_value_repr.decode(source.order),
-        CompositeAggregationSource::Histogram(source) => internal_value_repr.decode(source.order),
-        CompositeAggregationSource::DateHistogram(source) => {
-            internal_value_repr.decode(source.order)
-        }
-    };
-    let Some((decoded_accessor_idx, val)) = decoded_value_opt else {
-        return Ok(CompositeIntermediateKey::Null);
-    };
-    let key = match source {
-        CompositeAggregationSource::Terms(_) => {
-            let CompositeAccessor {
-                column_type,
-                str_dict_column,
-                column,
-                ..
-            } = &composite_accessors[decoded_accessor_idx as usize];
-            resolve_term(val, column_type, str_dict_column, column)?
-        }
-        CompositeAggregationSource::Histogram(source) => {
-            CompositeIntermediateKey::F64(i64::from_u64(val) as f64 * source.interval)
-        }
-        CompositeAggregationSource::DateHistogram(_) => {
-            CompositeIntermediateKey::DateTime(i64::from_u64(val))
-        }
-    };
-
-    Ok(key)
-}
-
-fn resolve_term(
-    val: u64,
-    column_type: &ColumnType,
-    str_dict_column: &Option<StrColumn>,
-    column: &Column,
-) -> crate::Result<CompositeIntermediateKey> {
-    let key = if *column_type == ColumnType::Str {
-        let fallback_dict = Dictionary::empty();
-        let term_dict = str_dict_column
-            .as_ref()
-            .map(|el| el.dictionary())
-            .unwrap_or_else(|| &fallback_dict);
-
-        let mut buffer = Vec::new();
-        term_dict.ord_to_term(val, &mut buffer)?;
-        CompositeIntermediateKey::Str(
-            String::from_utf8(buffer.to_vec()).expect("could not convert to String"),
-        )
-    } else if *column_type == ColumnType::DateTime {
-        let val = i64::from_u64(val);
-        CompositeIntermediateKey::DateTime(val)
-    } else if *column_type == ColumnType::Bool {
-        let val = bool::from_u64(val);
-        CompositeIntermediateKey::Bool(val)
-    } else if *column_type == ColumnType::IpAddr {
-        let compact_space_accessor = column
-            .values
-            .clone()
-            .downcast_arc::<CompactSpaceU64Accessor>()
-            .map_err(|_| {
-                TantivyError::AggregationError(crate::aggregation::AggregationError::InternalError(
-                    "Type mismatch: Could not downcast to CompactSpaceU64Accessor".to_string(),
-                ))
-            })?;
-        let val: u128 = compact_space_accessor.compact_to_u128(val as u32);
-        let val = Ipv6Addr::from_u128(val);
-        CompositeIntermediateKey::IpAddr(val)
-    } else {
-        if *column_type == ColumnType::U64 {
-            CompositeIntermediateKey::U64(val)
-        } else if *column_type == ColumnType::I64 {
-            CompositeIntermediateKey::I64(i64::from_u64(val))
-        } else {
-            let val = f64::from_u64(val);
-            let val: NumericalValue = val.into();
-
-            match val.normalize() {
-                NumericalValue::U64(val) => CompositeIntermediateKey::U64(val),
-                NumericalValue::I64(val) => CompositeIntermediateKey::I64(val),
-                NumericalValue::F64(val) => CompositeIntermediateKey::F64(val),
-            }
-        }
-    };
-    Ok(key)
-}
-
-/// Depth-first walk of the accessors to build the composite key combinations
-/// and update the buckets.
-fn recursive_key_visitor(
-    doc_id: crate::DocId,
-    agg_data: &mut AggregationsSegmentCtx,
-    composite_agg_data: &CompositeAggReqData,
-    source_idx_for_recursion: usize,
-    sub_level_values: &mut SmallVec<[InternalValueRepr; MAX_DYN_ARRAY_SIZE]>,
-    buckets: &mut DynArrayHeapMap<InternalValueRepr, CompositeBucketCollector>,
-    is_on_after_key: bool,
-) -> crate::Result<()> {
-    if source_idx_for_recursion == composite_agg_data.req.sources.len() {
-        if !is_on_after_key {
-            collect_bucket_with_limit(
-                agg_data,
-                composite_agg_data,
-                buckets,
-                sub_level_values,
-            )?;
-        }
-        return Ok(());
-    }
-
-    let current_level_accessors = &composite_agg_data.composite_accessors[source_idx_for_recursion];
-    let current_level_source = &composite_agg_data.req.sources[source_idx_for_recursion];
-    let mut missing = true;
-    for (accessor_idx, accessor) in current_level_accessors.accessors.iter().enumerate() {
-        let values = accessor.column.values_for_doc(doc_id);
-        for value in values {
-            missing = false;
-            match current_level_source {
-                CompositeAggregationSource::Terms(_) => {
-                    let preceeds_after_key_type =
-                        accessor_idx < current_level_accessors.after_key_accessor_idx;
-                    if is_on_after_key && preceeds_after_key_type {
-                        break;
-                    }
-                    let matches_after_key_type =
-                        accessor_idx == current_level_accessors.after_key_accessor_idx;
-
-                    if matches_after_key_type && is_on_after_key {
-                        let should_skip = match current_level_source.order() {
-                            Order::Asc => current_level_accessors.after_key.gt(value),
-                            Order::Desc => current_level_accessors.after_key.lt(value),
-                        };
-                        if should_skip {
-                            continue;
-                        }
-                    }
-                    sub_level_values.push(InternalValueRepr::new_term(
-                        value,
-                        accessor_idx as u8,
-                        current_level_source.order(),
-                    ));
-                    let still_on_after_key =
-                        matches_after_key_type && current_level_accessors.after_key.equals(value);
-                    recursive_key_visitor(
-                        doc_id,
-                        agg_data,
-                        composite_agg_data,
-                        source_idx_for_recursion + 1,
-                        sub_level_values,
-                        buckets,
-                        is_on_after_key && still_on_after_key,
-                    )?;
-                    sub_level_values.pop();
-                }
-                CompositeAggregationSource::Histogram(source) => {
-                    let float_value = match accessor.column_type {
-                        ColumnType::U64 => value as f64,
-                        ColumnType::I64 => i64::from_u64(value) as f64,
-                        ColumnType::DateTime => i64::from_u64(value) as f64 / 1_000_000.,
-                        ColumnType::F64 => f64::from_u64(value),
-                        _ => {
-                            panic!(
-                                "unexpected type {:?}. This should not happen",
-                                accessor.column_type
-                            )
-                        }
-                    };
-                    let bucket_index = (float_value / source.interval).floor() as i64;
-                    let bucket_value = i64::to_u64(bucket_index);
-                    if is_on_after_key {
-                        let should_skip = match current_level_source.order() {
-                            Order::Asc => current_level_accessors.after_key.gt(bucket_value),
-                            Order::Desc => current_level_accessors.after_key.lt(bucket_value),
-                        };
-                        if should_skip {
-                            continue;
-                        }
-                    }
-                    sub_level_values.push(InternalValueRepr::new_histogram(
-                        bucket_value,
-                        current_level_source.order(),
-                    ));
-                    let still_on_after_key = current_level_accessors.after_key.equals(bucket_value);
-                    recursive_key_visitor(
-                        doc_id,
-                        agg_data,
-                        composite_agg_data,
-                        source_idx_for_recursion + 1,
-                        sub_level_values,
-                        buckets,
-                        is_on_after_key && still_on_after_key,
-                    )?;
-                    sub_level_values.pop();
-                }
-                CompositeAggregationSource::DateHistogram(_) => {
-                    let value_ns = match accessor.column_type {
-                        ColumnType::DateTime => i64::from_u64(value),
-                        _ => {
-                            panic!(
-                                "unexpected type {:?}. This should not happen",
-                                accessor.column_type
-                            )
-                        }
-                    };
-                    let bucket_index = match accessor.date_histogram_interval {
-                        PrecomputedDateInterval::FixedNanoseconds(fixed_interval_ns) => {
-                            (value_ns / fixed_interval_ns) * fixed_interval_ns
-                        }
-                        PrecomputedDateInterval::Calendar(CalendarInterval::Year) => {
-                            calendar_interval::try_year_bucket(value_ns)?
-                        }
-                        PrecomputedDateInterval::Calendar(CalendarInterval::Month) => {
-                            calendar_interval::try_month_bucket(value_ns)?
-                        }
-                        PrecomputedDateInterval::Calendar(CalendarInterval::Week) => {
-                            calendar_interval::week_bucket(value_ns)
-                        }
-                        PrecomputedDateInterval::NotApplicable => {
-                            panic!("interval not precomputed for date histogram source")
-                        }
-                    };
-                    let bucket_value = i64::to_u64(bucket_index);
-                    if is_on_after_key {
-                        let should_skip = match current_level_source.order() {
-                            Order::Asc => current_level_accessors.after_key.gt(bucket_value),
-                            Order::Desc => current_level_accessors.after_key.lt(bucket_value),
-                        };
-                        if should_skip {
-                            continue;
-                        }
-                    }
-                    sub_level_values.push(InternalValueRepr::new_histogram(
-                        bucket_value,
-                        current_level_source.order(),
-                    ));
-                    let still_on_after_key = current_level_accessors.after_key.equals(bucket_value);
-                    recursive_key_visitor(
-                        doc_id,
-                        agg_data,
-                        composite_agg_data,
-                        source_idx_for_recursion + 1,
-                        sub_level_values,
-                        buckets,
-                        is_on_after_key && still_on_after_key,
-                    )?;
-                    sub_level_values.pop();
-                }
-            };
-        }
-    }
-    if missing && current_level_source.missing_bucket() {
-        if is_on_after_key && current_level_accessors.skip_missing {
-            return Ok(());
-        }
-        sub_level_values.push(InternalValueRepr::new_missing(
-            current_level_source.order(),
-            current_level_source.missing_order(),
-        ));
-        recursive_key_visitor(
-            doc_id,
-            agg_data,
-            composite_agg_data,
-            source_idx_for_recursion + 1,
-            sub_level_values,
-            buckets,
-            is_on_after_key && current_level_accessors.is_after_key_explicit_missing,
-        )?;
-        sub_level_values.pop();
-    }
-    Ok(())
-}
--- a/src/aggregation/bucket/composite/map.rs
+++ b/src/aggregation/bucket/composite/map.rs
@@ -1,364 +0,0 @@
-use std::collections::BinaryHeap;
-use std::fmt::Debug;
-use std::hash::Hash;
-
-use rustc_hash::FxHashMap;
-use smallvec::SmallVec;
-
-use crate::TantivyError;
-
-/// Map backed by a hash map for fast access and a binary heap to track the
-/// highest key. The key is an array of fixed size S.
-#[derive(Clone, Debug)]
-struct ArrayHeapMap<K: Ord, V, const S: usize> {
-    pub(crate) buckets: FxHashMap<[K; S], V>,
-    pub(crate) heap: BinaryHeap<[K; S]>,
-}
-
-impl<K: Ord, V, const S: usize> Default for ArrayHeapMap<K, V, S> {
-    fn default() -> Self {
-        ArrayHeapMap {
-            buckets: FxHashMap::default(),
-            heap: BinaryHeap::default(),
-        }
-    }
-}
-
-impl<K: Eq + Hash + Clone + Ord, V, const S: usize> ArrayHeapMap<K, V, S> {
-    /// Panics if the length of `key` is not S.
-    fn get_or_insert_with<F: FnOnce() -> V>(&mut self, key: &[K], f: F) -> &mut V {
-        let key_array: &[K; S] = key.try_into().expect("Key length mismatch");
-        self.buckets.entry(key_array.clone()).or_insert_with(|| {
-            self.heap.push(key_array.clone());
-            f()
-        })
-    }
-
-    /// Panics if the length of `key` is not S.
-    fn get_mut(&mut self, key: &[K]) -> Option<&mut V> {
-        let key_array: &[K; S] = key.try_into().expect("Key length mismatch");
-        self.buckets.get_mut(key_array)
-    }
-
-    fn peek_highest(&self) -> Option<&[K]> {
-        self.heap.peek().map(|k_array| k_array.as_slice())
-    }
-
-    fn evict_highest(&mut self) {
-        if let Some(highest) = self.heap.pop() {
-            self.buckets.remove(&highest);
-        }
-    }
-
-    fn memory_consumption(&self) -> u64 {
-        let key_size = std::mem::size_of::<[K; S]>();
-        let map_size = (key_size + std::mem::size_of::<V>()) * self.buckets.capacity();
-        let heap_size = key_size * self.heap.capacity();
-        (map_size + heap_size) as u64
-    }
-}
-
-impl<K: Copy + Ord + Clone + 'static, V: 'static, const S: usize> ArrayHeapMap<K, V, S> {
-    fn into_iter(self) -> Box<dyn Iterator<Item = (SmallVec<[K; MAX_DYN_ARRAY_SIZE]>, V)>> {
-        Box::new(
-            self.buckets
-                .into_iter()
-                .map(|(k, v)| (SmallVec::from_slice(&k), v)),
-        )
-    }
-
-    fn values_mut<'a>(&'a mut self) -> Box<dyn Iterator<Item = &'a mut V> + 'a> {
-        Box::new(self.buckets.values_mut())
-    }
-}
-
-pub(super) const MAX_DYN_ARRAY_SIZE: usize = 16;
-const MAX_DYN_ARRAY_SIZE_PLUS_ONE: usize = MAX_DYN_ARRAY_SIZE + 1;
-
-/// A map optimized for memory footprint, fast access and efficient eviction of
-/// the highest key.
-///
-/// Keys are inlined arrays of size 1 to [MAX_DYN_ARRAY_SIZE] but for a given
-/// instance the key size is fixed. This allows to avoid heap allocations for the
-/// keys.
-#[derive(Clone, Debug)]
-pub(super) struct DynArrayHeapMap<K: Ord, V>(DynArrayHeapMapInner<K, V>);
-
-/// Wrapper around ArrayHeapMap to dynamically dispatch on the array size.
-#[derive(Clone, Debug)]
-enum DynArrayHeapMapInner<K: Ord, V> {
-    Dim1(ArrayHeapMap<K, V, 1>),
-    Dim2(ArrayHeapMap<K, V, 2>),
-    Dim3(ArrayHeapMap<K, V, 3>),
-    Dim4(ArrayHeapMap<K, V, 4>),
-    Dim5(ArrayHeapMap<K, V, 5>),
-    Dim6(ArrayHeapMap<K, V, 6>),
-    Dim7(ArrayHeapMap<K, V, 7>),
-    Dim8(ArrayHeapMap<K, V, 8>),
-    Dim9(ArrayHeapMap<K, V, 9>),
-    Dim10(ArrayHeapMap<K, V, 10>),
-    Dim11(ArrayHeapMap<K, V, 11>),
-    Dim12(ArrayHeapMap<K, V, 12>),
-    Dim13(ArrayHeapMap<K, V, 13>),
-    Dim14(ArrayHeapMap<K, V, 14>),
-    Dim15(ArrayHeapMap<K, V, 15>),
-    Dim16(ArrayHeapMap<K, V, 16>),
-}
-
-impl<K: Ord, V> DynArrayHeapMap<K, V> {
-    /// Creates a new heap map with dynamic array keys of size `key_dimension`.
-    pub(super) fn try_new(key_dimension: usize) -> crate::Result<Self> {
-        let inner = match key_dimension {
-            0 => {
-                return Err(TantivyError::InvalidArgument(
-                    "DynArrayHeapMap dimension must be at least 1".to_string(),
-                ))
-            }
-            1 => DynArrayHeapMapInner::Dim1(ArrayHeapMap::default()),
-            2 => DynArrayHeapMapInner::Dim2(ArrayHeapMap::default()),
-            3 => DynArrayHeapMapInner::Dim3(ArrayHeapMap::default()),
-            4 => DynArrayHeapMapInner::Dim4(ArrayHeapMap::default()),
-            5 => DynArrayHeapMapInner::Dim5(ArrayHeapMap::default()),
-            6 => DynArrayHeapMapInner::Dim6(ArrayHeapMap::default()),
-            7 => DynArrayHeapMapInner::Dim7(ArrayHeapMap::default()),
-            8 => DynArrayHeapMapInner::Dim8(ArrayHeapMap::default()),
-            9 => DynArrayHeapMapInner::Dim9(ArrayHeapMap::default()),
-            10 => DynArrayHeapMapInner::Dim10(ArrayHeapMap::default()),
-            11 => DynArrayHeapMapInner::Dim11(ArrayHeapMap::default()),
-            12 => DynArrayHeapMapInner::Dim12(ArrayHeapMap::default()),
-            13 => DynArrayHeapMapInner::Dim13(ArrayHeapMap::default()),
-            14 => DynArrayHeapMapInner::Dim14(ArrayHeapMap::default()),
-            15 => DynArrayHeapMapInner::Dim15(ArrayHeapMap::default()),
-            16 => DynArrayHeapMapInner::Dim16(ArrayHeapMap::default()),
-            MAX_DYN_ARRAY_SIZE_PLUS_ONE.. => {
-                return Err(TantivyError::InvalidArgument(format!(
-                    "DynArrayHeapMap supports maximum {MAX_DYN_ARRAY_SIZE} dimensions, got \
-                     {key_dimension}",
-                )))
-            }
-        };
-        Ok(DynArrayHeapMap(inner))
-    }
-
-    /// Number of elements in the map. This is not the dimension of the keys.
-    pub(super) fn size(&self) -> usize {
-        match &self.0 {
-            DynArrayHeapMapInner::Dim1(map) => map.buckets.len(),
-            DynArrayHeapMapInner::Dim2(map) => map.buckets.len(),
-            DynArrayHeapMapInner::Dim3(map) => map.buckets.len(),
-            DynArrayHeapMapInner::Dim4(map) => map.buckets.len(),
-            DynArrayHeapMapInner::Dim5(map) => map.buckets.len(),
-            DynArrayHeapMapInner::Dim6(map) => map.buckets.len(),
-            DynArrayHeapMapInner::Dim7(map) => map.buckets.len(),
-            DynArrayHeapMapInner::Dim8(map) => map.buckets.len(),
-            DynArrayHeapMapInner::Dim9(map) => map.buckets.len(),
-            DynArrayHeapMapInner::Dim10(map) => map.buckets.len(),
-            DynArrayHeapMapInner::Dim11(map) => map.buckets.len(),
-            DynArrayHeapMapInner::Dim12(map) => map.buckets.len(),
-            DynArrayHeapMapInner::Dim13(map) => map.buckets.len(),
-            DynArrayHeapMapInner::Dim14(map) => map.buckets.len(),
-            DynArrayHeapMapInner::Dim15(map) => map.buckets.len(),
-            DynArrayHeapMapInner::Dim16(map) => map.buckets.len(),
-        }
-    }
-}
-
-impl<K: Ord + Hash + Clone, V> DynArrayHeapMap<K, V> {
-    /// Get a mutable reference to the value corresponding to `key` or inserts a new
-    /// value created by calling `f`.
-    ///
-    /// Panics if the length of `key` does not match the key dimension of the map.
-    pub(super) fn get_or_insert_with<F: FnOnce() -> V>(&mut self, key: &[K], f: F) -> &mut V {
-        match &mut self.0 {
-            DynArrayHeapMapInner::Dim1(map) => map.get_or_insert_with(key, f),
-            DynArrayHeapMapInner::Dim2(map) => map.get_or_insert_with(key, f),
-            DynArrayHeapMapInner::Dim3(map) => map.get_or_insert_with(key, f),
-            DynArrayHeapMapInner::Dim4(map) => map.get_or_insert_with(key, f),
-            DynArrayHeapMapInner::Dim5(map) => map.get_or_insert_with(key, f),
-            DynArrayHeapMapInner::Dim6(map) => map.get_or_insert_with(key, f),
-            DynArrayHeapMapInner::Dim7(map) => map.get_or_insert_with(key, f),
-            DynArrayHeapMapInner::Dim8(map) => map.get_or_insert_with(key, f),
-            DynArrayHeapMapInner::Dim9(map) => map.get_or_insert_with(key, f),
-            DynArrayHeapMapInner::Dim10(map) => map.get_or_insert_with(key, f),
-            DynArrayHeapMapInner::Dim11(map) => map.get_or_insert_with(key, f),
-            DynArrayHeapMapInner::Dim12(map) => map.get_or_insert_with(key, f),
-            DynArrayHeapMapInner::Dim13(map) => map.get_or_insert_with(key, f),
-            DynArrayHeapMapInner::Dim14(map) => map.get_or_insert_with(key, f),
-            DynArrayHeapMapInner::Dim15(map) => map.get_or_insert_with(key, f),
-            DynArrayHeapMapInner::Dim16(map) => map.get_or_insert_with(key, f),
-        }
-    }
-
-    /// Returns a mutable reference to the value corresponding to `key`.
-    ///
-    /// Panics if the length of `key` does not match the key dimension of the map.
-    pub fn get_mut(&mut self, key: &[K]) -> Option<&mut V> {
-        match &mut self.0 {
-            DynArrayHeapMapInner::Dim1(map) => map.get_mut(key),
-            DynArrayHeapMapInner::Dim2(map) => map.get_mut(key),
-            DynArrayHeapMapInner::Dim3(map) => map.get_mut(key),
-            DynArrayHeapMapInner::Dim4(map) => map.get_mut(key),
-            DynArrayHeapMapInner::Dim5(map) => map.get_mut(key),
-            DynArrayHeapMapInner::Dim6(map) => map.get_mut(key),
-            DynArrayHeapMapInner::Dim7(map) => map.get_mut(key),
-            DynArrayHeapMapInner::Dim8(map) => map.get_mut(key),
-            DynArrayHeapMapInner::Dim9(map) => map.get_mut(key),
-            DynArrayHeapMapInner::Dim10(map) => map.get_mut(key),
-            DynArrayHeapMapInner::Dim11(map) => map.get_mut(key),
-            DynArrayHeapMapInner::Dim12(map) => map.get_mut(key),
-            DynArrayHeapMapInner::Dim13(map) => map.get_mut(key),
-            DynArrayHeapMapInner::Dim14(map) => map.get_mut(key),
-            DynArrayHeapMapInner::Dim15(map) => map.get_mut(key),
-            DynArrayHeapMapInner::Dim16(map) => map.get_mut(key),
-        }
-    }
-
-    /// Returns a reference to the highest key in the map.
-    pub(super) fn peek_highest(&self) -> Option<&[K]> {
-        match &self.0 {
-            DynArrayHeapMapInner::Dim1(map) => map.peek_highest(),
-            DynArrayHeapMapInner::Dim2(map) => map.peek_highest(),
-            DynArrayHeapMapInner::Dim3(map) => map.peek_highest(),
-            DynArrayHeapMapInner::Dim4(map) => map.peek_highest(),
-            DynArrayHeapMapInner::Dim5(map) => map.peek_highest(),
-            DynArrayHeapMapInner::Dim6(map) => map.peek_highest(),
-            DynArrayHeapMapInner::Dim7(map) => map.peek_highest(),
-            DynArrayHeapMapInner::Dim8(map) => map.peek_highest(),
-            DynArrayHeapMapInner::Dim9(map) => map.peek_highest(),
-            DynArrayHeapMapInner::Dim10(map) => map.peek_highest(),
-            DynArrayHeapMapInner::Dim11(map) => map.peek_highest(),
-            DynArrayHeapMapInner::Dim12(map) => map.peek_highest(),
-            DynArrayHeapMapInner::Dim13(map) => map.peek_highest(),
-            DynArrayHeapMapInner::Dim14(map) => map.peek_highest(),
-            DynArrayHeapMapInner::Dim15(map) => map.peek_highest(),
-            DynArrayHeapMapInner::Dim16(map) => map.peek_highest(),
-        }
-    }
-
-    /// Removes the entry with the highest key from the map.
-    pub(super) fn evict_highest(&mut self) {
-        match &mut self.0 {
-            DynArrayHeapMapInner::Dim1(map) => map.evict_highest(),
-            DynArrayHeapMapInner::Dim2(map) => map.evict_highest(),
-            DynArrayHeapMapInner::Dim3(map) => map.evict_highest(),
-            DynArrayHeapMapInner::Dim4(map) => map.evict_highest(),
-            DynArrayHeapMapInner::Dim5(map) => map.evict_highest(),
-            DynArrayHeapMapInner::Dim6(map) => map.evict_highest(),
-            DynArrayHeapMapInner::Dim7(map) => map.evict_highest(),
-            DynArrayHeapMapInner::Dim8(map) => map.evict_highest(),
-            DynArrayHeapMapInner::Dim9(map) => map.evict_highest(),
-            DynArrayHeapMapInner::Dim10(map) => map.evict_highest(),
-            DynArrayHeapMapInner::Dim11(map) => map.evict_highest(),
-            DynArrayHeapMapInner::Dim12(map) => map.evict_highest(),
-            DynArrayHeapMapInner::Dim13(map) => map.evict_highest(),
-            DynArrayHeapMapInner::Dim14(map) => map.evict_highest(),
-            DynArrayHeapMapInner::Dim15(map) => map.evict_highest(),
-            DynArrayHeapMapInner::Dim16(map) => map.evict_highest(),
-        }
-    }
-
-    pub(crate) fn memory_consumption(&self) -> u64 {
-        match &self.0 {
-            DynArrayHeapMapInner::Dim1(map) => map.memory_consumption(),
-            DynArrayHeapMapInner::Dim2(map) => map.memory_consumption(),
-            DynArrayHeapMapInner::Dim3(map) => map.memory_consumption(),
-            DynArrayHeapMapInner::Dim4(map) => map.memory_consumption(),
-            DynArrayHeapMapInner::Dim5(map) => map.memory_consumption(),
-            DynArrayHeapMapInner::Dim6(map) => map.memory_consumption(),
-            DynArrayHeapMapInner::Dim7(map) => map.memory_consumption(),
-            DynArrayHeapMapInner::Dim8(map) => map.memory_consumption(),
-            DynArrayHeapMapInner::Dim9(map) => map.memory_consumption(),
-            DynArrayHeapMapInner::Dim10(map) => map.memory_consumption(),
-            DynArrayHeapMapInner::Dim11(map) => map.memory_consumption(),
-            DynArrayHeapMapInner::Dim12(map) => map.memory_consumption(),
-            DynArrayHeapMapInner::Dim13(map) => map.memory_consumption(),
-            DynArrayHeapMapInner::Dim14(map) => map.memory_consumption(),
-            DynArrayHeapMapInner::Dim15(map) => map.memory_consumption(),
-            DynArrayHeapMapInner::Dim16(map) => map.memory_consumption(),
-        }
-    }
-}
-
-impl<K: Ord + Clone + Copy + 'static, V: 'static> DynArrayHeapMap<K, V> {
-    /// Turns this map into an iterator over key-value pairs.
-    pub fn into_iter(self) -> impl Iterator<Item = (SmallVec<[K; MAX_DYN_ARRAY_SIZE]>, V)> {
-        match self.0 {
-            DynArrayHeapMapInner::Dim1(map) => map.into_iter(),
-            DynArrayHeapMapInner::Dim2(map) => map.into_iter(),
-            DynArrayHeapMapInner::Dim3(map) => map.into_iter(),
-            DynArrayHeapMapInner::Dim4(map) => map.into_iter(),
-            DynArrayHeapMapInner::Dim5(map) => map.into_iter(),
-            DynArrayHeapMapInner::Dim6(map) => map.into_iter(),
-            DynArrayHeapMapInner::Dim7(map) => map.into_iter(),
-            DynArrayHeapMapInner::Dim8(map) => map.into_iter(),
-            DynArrayHeapMapInner::Dim9(map) => map.into_iter(),
-            DynArrayHeapMapInner::Dim10(map) => map.into_iter(),
-            DynArrayHeapMapInner::Dim11(map) => map.into_iter(),
-            DynArrayHeapMapInner::Dim12(map) => map.into_iter(),
-            DynArrayHeapMapInner::Dim13(map) => map.into_iter(),
-            DynArrayHeapMapInner::Dim14(map) => map.into_iter(),
-            DynArrayHeapMapInner::Dim15(map) => map.into_iter(),
-            DynArrayHeapMapInner::Dim16(map) => map.into_iter(),
-        }
-    }
-
-    /// Returns an iterator over mutable references to the values in the map.
-    pub(super) fn values_mut(&mut self) -> impl Iterator<Item = &mut V> {
-        match &mut self.0 {
-            DynArrayHeapMapInner::Dim1(map) => map.values_mut(),
-            DynArrayHeapMapInner::Dim2(map) => map.values_mut(),
-            DynArrayHeapMapInner::Dim3(map) => map.values_mut(),
-            DynArrayHeapMapInner::Dim4(map) => map.values_mut(),
-            DynArrayHeapMapInner::Dim5(map) => map.values_mut(),
-            DynArrayHeapMapInner::Dim6(map) => map.values_mut(),
-            DynArrayHeapMapInner::Dim7(map) => map.values_mut(),
-            DynArrayHeapMapInner::Dim8(map) => map.values_mut(),
-            DynArrayHeapMapInner::Dim9(map) => map.values_mut(),
-            DynArrayHeapMapInner::Dim10(map) => map.values_mut(),
-            DynArrayHeapMapInner::Dim11(map) => map.values_mut(),
-            DynArrayHeapMapInner::Dim12(map) => map.values_mut(),
-            DynArrayHeapMapInner::Dim13(map) => map.values_mut(),
-            DynArrayHeapMapInner::Dim14(map) => map.values_mut(),
-            DynArrayHeapMapInner::Dim15(map) => map.values_mut(),
-            DynArrayHeapMapInner::Dim16(map) => map.values_mut(),
-        }
-    }
-}
-
-#[cfg(test)]
-mod tests {
-    use super::*;
-
-    #[test]
-    fn test_dyn_array_heap_map() {
-        let mut map = DynArrayHeapMap::<u32, &str>::try_new(2).unwrap();
-        // insert
-        let key1 = [1u32, 2u32];
-        let key2 = [2u32, 1u32];
-        map.get_or_insert_with(&key1, || "a");
-        map.get_or_insert_with(&key2, || "b");
-        assert_eq!(map.size(), 2);
-
-        // evict highest
-        assert_eq!(map.peek_highest(), Some(&key2[..]));
-        map.evict_highest();
-        assert_eq!(map.size(), 1);
-        assert_eq!(map.peek_highest(), Some(&key1[..]));
-
-        // mutable iterator
-        {
-            let mut mut_iter = map.values_mut();
-            let v = mut_iter.next().unwrap();
-            assert_eq!(*v, "a");
-            *v = "c";
-            assert_eq!(mut_iter.next(), None);
-        }
-
-        // into_iter
-        let mut iter = map.into_iter();
-        let (k, v) = iter.next().unwrap();
-        assert_eq!(k.as_slice(), &key1);
-        assert_eq!(v, "c");
-        assert_eq!(iter.next(), None);
-    }
-}
--- a/src/aggregation/bucket/composite/mod.rs
+++ b/src/aggregation/bucket/composite/mod.rs
--- a/src/aggregation/bucket/composite/numeric_types.rs
+++ b/src/aggregation/bucket/composite/numeric_types.rs
@@ -1,460 +0,0 @@
-/// This modules helps comparing numerical values of different types (i64, u64
-/// and f64).
-pub(super) mod num_cmp {
-    use std::cmp::Ordering;
-
-    use crate::TantivyError;
-
-    pub fn cmp_i64_f64(left_i: i64, right_f: f64) -> crate::Result<Ordering> {
-        if right_f.is_nan() {
-            return Err(TantivyError::InvalidArgument(
-                "NaN comparison is not supported".to_string(),
-            ));
-        }
-
-        // If right_f is < i64::MIN then left_i > right_f (i64::MIN=-2^63 can be
-        // exactly represented as f64)
-        if right_f < i64::MIN as f64 {
-            return Ok(Ordering::Greater);
-        }
-        // If right_f is >= i64::MAX then left_i < right_f (i64::MAX=2^63-1 cannot
-        // be exactly represented as f64)
-        if right_f >= i64::MAX as f64 {
-            return Ok(Ordering::Less);
-        }
-
-        // Now right_f is in (i64::MIN, i64::MAX), so `right_f as i64` is
-        // well-defined (truncation toward 0)
-        let right_as_i = right_f as i64;
-
-        let result = match left_i.cmp(&right_as_i) {
-            Ordering::Less => Ordering::Less,
-            Ordering::Greater => Ordering::Greater,
-            Ordering::Equal => {
-                // they have the same integer part, compare the fraction
-                let rem = right_f - (right_as_i as f64);
-                if rem == 0.0 {
-                    Ordering::Equal
-                } else if right_f > 0.0 {
-                    Ordering::Less
-                } else {
-                    Ordering::Greater
-                }
-            }
-        };
-        Ok(result)
-    }
-
-    pub fn cmp_u64_f64(left_u: u64, right_f: f64) -> crate::Result<Ordering> {
-        if right_f.is_nan() {
-            return Err(TantivyError::InvalidArgument(
-                "NaN comparison is not supported".to_string(),
-            ));
-        }
-
-        // Negative floats are always less than any u64 >= 0
-        if right_f < 0.0 {
-            return Ok(Ordering::Greater);
-        }
-
-        // If right_f is >= u64::MAX then left_u < right_f (u64::MAX=2^64-1 cannot be exactly)
-        let max_as_f = u64::MAX as f64;
-        if right_f > max_as_f {
-            return Ok(Ordering::Less);
-        }
-
-        // Now right_f is in (0, u64::MAX), so `right_f as u64` is well-defined
-        // (truncation toward 0)
-        let right_as_u = right_f as u64;
-
-        let result = match left_u.cmp(&right_as_u) {
-            Ordering::Less => Ordering::Less,
-            Ordering::Greater => Ordering::Greater,
-            Ordering::Equal => {
-                // they have the same integer part, compare the fraction
-                let rem = right_f - (right_as_u as f64);
-                if rem == 0.0 {
-                    Ordering::Equal
-                } else {
-                    Ordering::Less
-                }
-            }
-        };
-        Ok(result)
-    }
-
-    pub fn cmp_i64_u64(left_i: i64, right_u: u64) -> Ordering {
-        if left_i < 0 {
-            Ordering::Less
-        } else {
-            let left_as_u = left_i as u64;
-            left_as_u.cmp(&right_u)
-        }
-    }
-}
-
-/// This modules helps projecting numerical values to other numerical types.
-/// When the target value space cannot exactly represent the source value, the
-/// next representable value is returned (or AfterLast if the source value is
-/// larger than the largest representable value).
-///
-/// All functions in this module assume that f64 values are not NaN.
-pub(super) mod num_proj {
-    #[derive(Debug, PartialEq)]
-    pub enum ProjectedNumber<T> {
-        Exact(T),
-        Next(T),
-        AfterLast,
-    }
-
-    pub fn i64_to_u64(value: i64) -> ProjectedNumber<u64> {
-        if value < 0 {
-            ProjectedNumber::Next(0)
-        } else {
-            ProjectedNumber::Exact(value as u64)
-        }
-    }
-
-    pub fn u64_to_i64(value: u64) -> ProjectedNumber<i64> {
-        if value > i64::MAX as u64 {
-            ProjectedNumber::AfterLast
-        } else {
-            ProjectedNumber::Exact(value as i64)
-        }
-    }
-
-    pub fn f64_to_u64(value: f64) -> ProjectedNumber<u64> {
-        if value < 0.0 {
-            ProjectedNumber::Next(0)
-        } else if value > u64::MAX as f64 {
-            ProjectedNumber::AfterLast
-        } else if value.fract() == 0.0 {
-            ProjectedNumber::Exact(value as u64)
-        } else {
-            // casting f64 to u64 truncates toward zero
-            ProjectedNumber::Next(value as u64 + 1)
-        }
-    }
-
-    pub fn f64_to_i64(value: f64) -> ProjectedNumber<i64> {
-        if value < (i64::MIN as f64) {
-            return ProjectedNumber::Next(i64::MIN);
-        } else if value >= (i64::MAX as f64) {
-            return ProjectedNumber::AfterLast;
-        } else if value.fract() == 0.0 {
-            ProjectedNumber::Exact(value as i64)
-        } else if value > 0.0 {
-            // casting f64 to i64 truncates toward zero
-            ProjectedNumber::Next(value as i64 + 1)
-        } else {
-            ProjectedNumber::Next(value as i64)
-        }
-    }
-
-    pub fn i64_to_f64(value: i64) -> ProjectedNumber<f64> {
-        let value_f = value as f64;
-        let k_roundtrip = value_f as i64;
-        if k_roundtrip == value {
-            // between -2^53 and 2^53 all i64 are exactly represented as f64
-            ProjectedNumber::Exact(value_f)
-        } else {
-            // for very large/small i64 values, it is approximated to the closest f64
-            if k_roundtrip > value {
-                ProjectedNumber::Next(value_f)
-            } else {
-                ProjectedNumber::Next(value_f.next_up())
-            }
-        }
-    }
-
-    pub fn u64_to_f64(value: u64) -> ProjectedNumber<f64> {
-        let value_f = value as f64;
-        let k_roundtrip = value_f as u64;
-        if k_roundtrip == value {
-            // between 0 and 2^53 all u64 are exactly represented as f64
-            ProjectedNumber::Exact(value_f)
-        } else if k_roundtrip > value {
-            ProjectedNumber::Next(value_f)
-        } else {
-            ProjectedNumber::Next(value_f.next_up())
-        }
-    }
-}
-
-#[cfg(test)]
-mod num_cmp_tests {
-    use std::cmp::Ordering;
-
-    use super::num_cmp::*;
-
-    #[test]
-    fn test_cmp_u64_f64() {
-        // Basic comparisons
-        assert_eq!(cmp_u64_f64(5, 5.0).unwrap(), Ordering::Equal);
-        assert_eq!(cmp_u64_f64(5, 6.0).unwrap(), Ordering::Less);
-        assert_eq!(cmp_u64_f64(6, 5.0).unwrap(), Ordering::Greater);
-        assert_eq!(cmp_u64_f64(0, 0.0).unwrap(), Ordering::Equal);
-        assert_eq!(cmp_u64_f64(0, 0.1).unwrap(), Ordering::Less);
-
-        // Negative float values should always be less than any u64
-        assert_eq!(cmp_u64_f64(0, -0.1).unwrap(), Ordering::Greater);
-        assert_eq!(cmp_u64_f64(5, -5.0).unwrap(), Ordering::Greater);
-        assert_eq!(cmp_u64_f64(u64::MAX, -1e20).unwrap(), Ordering::Greater);
-
-        // Tests with extreme values
-        assert_eq!(cmp_u64_f64(u64::MAX, 1e20).unwrap(), Ordering::Less);
-
-        // Precision edge cases: large u64 that loses precision when converted to f64
-        // => 2^54, exactly represented as f64
-        let large_f64 = 18_014_398_509_481_984.0;
-        let large_u64 = 18_014_398_509_481_984;
-        // prove that large_u64 is exactly represented as f64
-        assert_eq!(large_u64 as f64, large_f64);
-        assert_eq!(cmp_u64_f64(large_u64, large_f64).unwrap(), Ordering::Equal);
-        // => (2^54 + 1) cannot be exactly represented in f64
-        let large_u64_plus_1 = 18_014_398_509_481_985;
-        // prove that it is represented as f64 by large_f64
-        assert_eq!(large_u64_plus_1 as f64, large_f64);
-        assert_eq!(
-            cmp_u64_f64(large_u64_plus_1, large_f64).unwrap(),
-            Ordering::Greater
-        );
-        // => (2^54 - 1) cannot be exactly represented in f64
-        let large_u64_minus_1 = 18_014_398_509_481_983;
-        // prove that it is also represented as f64 by large_f64
-        assert_eq!(large_u64_minus_1 as f64, large_f64);
-        assert_eq!(
-            cmp_u64_f64(large_u64_minus_1, large_f64).unwrap(),
-            Ordering::Less
-        );
-
-        // NaN comparison results in an error
-        assert!(cmp_u64_f64(0, f64::NAN).is_err());
-    }
-
-    #[test]
-    fn test_cmp_i64_f64() {
-        // Basic comparisons
-        assert_eq!(cmp_i64_f64(5, 5.0).unwrap(), Ordering::Equal);
-        assert_eq!(cmp_i64_f64(5, 6.0).unwrap(), Ordering::Less);
-        assert_eq!(cmp_i64_f64(6, 5.0).unwrap(), Ordering::Greater);
-        assert_eq!(cmp_i64_f64(-5, -5.0).unwrap(), Ordering::Equal);
-        assert_eq!(cmp_i64_f64(-5, -4.0).unwrap(), Ordering::Less);
-        assert_eq!(cmp_i64_f64(-4, -5.0).unwrap(), Ordering::Greater);
-        assert_eq!(cmp_i64_f64(-5, 5.0).unwrap(), Ordering::Less);
-        assert_eq!(cmp_i64_f64(5, -5.0).unwrap(), Ordering::Greater);
-        assert_eq!(cmp_i64_f64(0, -0.1).unwrap(), Ordering::Greater);
-        assert_eq!(cmp_i64_f64(0, 0.1).unwrap(), Ordering::Less);
-        assert_eq!(cmp_i64_f64(-1, -0.5).unwrap(), Ordering::Less);
-        assert_eq!(cmp_i64_f64(-1, 0.0).unwrap(), Ordering::Less);
-        assert_eq!(cmp_i64_f64(0, 0.0).unwrap(), Ordering::Equal);
-
-        // Tests with extreme values
-        assert_eq!(cmp_i64_f64(i64::MAX, 1e20).unwrap(), Ordering::Less);
-        assert_eq!(cmp_i64_f64(i64::MIN, -1e20).unwrap(), Ordering::Greater);
-
-        // Precision edge cases: large i64 that loses precision when converted to f64
-        // => 2^54, exactly represented as f64
-        let large_f64 = 18_014_398_509_481_984.0;
-        let large_i64 = 18_014_398_509_481_984;
-        // prove that large_i64 is exactly represented as f64
-        assert_eq!(large_i64 as f64, large_f64);
-        assert_eq!(cmp_i64_f64(large_i64, large_f64).unwrap(), Ordering::Equal);
-        // => (1_i64 << 54) + 1 cannot be exactly represented in f64
-        let large_i64_plus_1 = 18_014_398_509_481_985;
-        // prove that it is represented as f64 by large_f64
-        assert_eq!(large_i64_plus_1 as f64, large_f64);
-        assert_eq!(
-            cmp_i64_f64(large_i64_plus_1, large_f64).unwrap(),
-            Ordering::Greater
-        );
-        // => (1_i64 << 54) - 1 cannot be exactly represented in f64
-        let large_i64_minus_1 = 18_014_398_509_481_983;
-        // prove that it is also represented as f64 by large_f64
-        assert_eq!(large_i64_minus_1 as f64, large_f64);
-        assert_eq!(
-            cmp_i64_f64(large_i64_minus_1, large_f64).unwrap(),
-            Ordering::Less
-        );
-
-        // Same precision edge case but with negative values
-        // => -2^54, exactly represented as f64
-        let large_neg_f64 = -18_014_398_509_481_984.0;
-        let large_neg_i64 = -18_014_398_509_481_984;
-        // prove that large_neg_i64 is exactly represented as f64
-        assert_eq!(large_neg_i64 as f64, large_neg_f64);
-        assert_eq!(
-            cmp_i64_f64(large_neg_i64, large_neg_f64).unwrap(),
-            Ordering::Equal
-        );
-        // => (-2^54 + 1) cannot be exactly represented in f64
-        let large_neg_i64_plus_1 = -18_014_398_509_481_985;
-        // prove that it is represented as f64 by large_neg_f64
-        assert_eq!(large_neg_i64_plus_1 as f64, large_neg_f64);
-        assert_eq!(
-            cmp_i64_f64(large_neg_i64_plus_1, large_neg_f64).unwrap(),
-            Ordering::Less
-        );
-        // => (-2^54 - 1) cannot be exactly represented in f64
-        let large_neg_i64_minus_1 = -18_014_398_509_481_983;
-        // prove that it is also represented as f64 by large_neg_f64
-        assert_eq!(large_neg_i64_minus_1 as f64, large_neg_f64);
-        assert_eq!(
-            cmp_i64_f64(large_neg_i64_minus_1, large_neg_f64).unwrap(),
-            Ordering::Greater
-        );
-
-        // NaN comparison results in an error
-        assert!(cmp_i64_f64(0, f64::NAN).is_err());
-    }
-
-    #[test]
-    fn test_cmp_i64_u64() {
-        // Test with negative i64 values (should always be less than any u64)
-        assert_eq!(cmp_i64_u64(-1, 0), Ordering::Less);
-        assert_eq!(cmp_i64_u64(i64::MIN, 0), Ordering::Less);
-        assert_eq!(cmp_i64_u64(i64::MIN, u64::MAX), Ordering::Less);
-
-        // Test with positive i64 values
-        assert_eq!(cmp_i64_u64(0, 0), Ordering::Equal);
-        assert_eq!(cmp_i64_u64(1, 0), Ordering::Greater);
-        assert_eq!(cmp_i64_u64(1, 1), Ordering::Equal);
-        assert_eq!(cmp_i64_u64(0, 1), Ordering::Less);
-        assert_eq!(cmp_i64_u64(5, 10), Ordering::Less);
-        assert_eq!(cmp_i64_u64(10, 5), Ordering::Greater);
-
-        // Test with values near i64::MAX and u64 conversion
-        assert_eq!(cmp_i64_u64(i64::MAX, i64::MAX as u64), Ordering::Equal);
-        assert_eq!(cmp_i64_u64(i64::MAX, (i64::MAX as u64) + 1), Ordering::Less);
-        assert_eq!(cmp_i64_u64(i64::MAX, u64::MAX), Ordering::Less);
-    }
-}
-
-#[cfg(test)]
-mod num_proj_tests {
-    use super::num_proj::{self, ProjectedNumber};
-
-    #[test]
-    fn test_i64_to_u64() {
-        assert_eq!(num_proj::i64_to_u64(-1), ProjectedNumber::Next(0));
-        assert_eq!(num_proj::i64_to_u64(i64::MIN), ProjectedNumber::Next(0));
-        assert_eq!(num_proj::i64_to_u64(0), ProjectedNumber::Exact(0));
-        assert_eq!(num_proj::i64_to_u64(42), ProjectedNumber::Exact(42));
-        assert_eq!(
-            num_proj::i64_to_u64(i64::MAX),
-            ProjectedNumber::Exact(i64::MAX as u64)
-        );
-    }
-
-    #[test]
-    fn test_u64_to_i64() {
-        assert_eq!(num_proj::u64_to_i64(0), ProjectedNumber::Exact(0));
-        assert_eq!(num_proj::u64_to_i64(42), ProjectedNumber::Exact(42));
-        assert_eq!(
-            num_proj::u64_to_i64(i64::MAX as u64),
-            ProjectedNumber::Exact(i64::MAX)
-        );
-        assert_eq!(
-            num_proj::u64_to_i64((i64::MAX as u64) + 1),
-            ProjectedNumber::AfterLast
-        );
-        assert_eq!(num_proj::u64_to_i64(u64::MAX), ProjectedNumber::AfterLast);
-    }
-
-    #[test]
-    fn test_f64_to_u64() {
-        assert_eq!(num_proj::f64_to_u64(-1e25), ProjectedNumber::Next(0));
-        assert_eq!(num_proj::f64_to_u64(-0.1), ProjectedNumber::Next(0));
-        assert_eq!(num_proj::f64_to_u64(1e20), ProjectedNumber::AfterLast);
-        assert_eq!(
-            num_proj::f64_to_u64(f64::INFINITY),
-            ProjectedNumber::AfterLast
-        );
-        assert_eq!(num_proj::f64_to_u64(0.0), ProjectedNumber::Exact(0));
-        assert_eq!(num_proj::f64_to_u64(42.0), ProjectedNumber::Exact(42));
-        assert_eq!(num_proj::f64_to_u64(0.5), ProjectedNumber::Next(1));
-        assert_eq!(num_proj::f64_to_u64(42.1), ProjectedNumber::Next(43));
-    }
-
-    #[test]
-    fn test_f64_to_i64() {
-        assert_eq!(num_proj::f64_to_i64(-1e20), ProjectedNumber::Next(i64::MIN));
-        assert_eq!(
-            num_proj::f64_to_i64(f64::NEG_INFINITY),
-            ProjectedNumber::Next(i64::MIN)
-        );
-        assert_eq!(num_proj::f64_to_i64(1e20), ProjectedNumber::AfterLast);
-        assert_eq!(
-            num_proj::f64_to_i64(f64::INFINITY),
-            ProjectedNumber::AfterLast
-        );
-        assert_eq!(num_proj::f64_to_i64(0.0), ProjectedNumber::Exact(0));
-        assert_eq!(num_proj::f64_to_i64(42.0), ProjectedNumber::Exact(42));
-        assert_eq!(num_proj::f64_to_i64(-42.0), ProjectedNumber::Exact(-42));
-        assert_eq!(num_proj::f64_to_i64(0.5), ProjectedNumber::Next(1));
-        assert_eq!(num_proj::f64_to_i64(42.1), ProjectedNumber::Next(43));
-        assert_eq!(num_proj::f64_to_i64(-0.5), ProjectedNumber::Next(0));
-        assert_eq!(num_proj::f64_to_i64(-42.1), ProjectedNumber::Next(-42));
-    }
-
-    #[test]
-    fn test_i64_to_f64() {
-        assert_eq!(num_proj::i64_to_f64(0), ProjectedNumber::Exact(0.0));
-        assert_eq!(num_proj::i64_to_f64(42), ProjectedNumber::Exact(42.0));
-        assert_eq!(num_proj::i64_to_f64(-42), ProjectedNumber::Exact(-42.0));
-
-        let max_exact = 9_007_199_254_740_992; // 2^53
-        assert_eq!(
-            num_proj::i64_to_f64(max_exact),
-            ProjectedNumber::Exact(max_exact as f64)
-        );
-
-        // Test values that cannot be exactly represented as f64 (integers above 2^53)
-        let large_i64 = 9_007_199_254_740_993; // 2^53 + 1
-        let closest_f64 = 9_007_199_254_740_992.0;
-        assert_eq!(large_i64 as f64, closest_f64);
-        if let ProjectedNumber::Next(val) = num_proj::i64_to_f64(large_i64) {
-            // Verify that the returned float is different from the direct cast
-            assert!(val > closest_f64);
-            assert!(val - closest_f64 < 2. * f64::EPSILON * closest_f64);
-        } else {
-            panic!("Expected ProjectedNumber::Next for large_i64");
-        }
-
-        // Test with very large negative value
-        let large_neg_i64 = -9_007_199_254_740_993; // -(2^53 + 1)
-        let closest_neg_f64 = -9_007_199_254_740_992.0;
-        assert_eq!(large_neg_i64 as f64, closest_neg_f64);
-        if let ProjectedNumber::Next(val) = num_proj::i64_to_f64(large_neg_i64) {
-            // Verify that the returned float is the closest representable f64
-            assert_eq!(val, closest_neg_f64);
-        } else {
-            panic!("Expected ProjectedNumber::Next for large_neg_i64");
-        }
-    }
-
-    #[test]
-    fn test_u64_to_f64() {
-        assert_eq!(num_proj::u64_to_f64(0), ProjectedNumber::Exact(0.0));
-        assert_eq!(num_proj::u64_to_f64(42), ProjectedNumber::Exact(42.0));
-
-        // Test the largest u64 value that can be exactly represented as f64 (2^53)
-        let max_exact = 9_007_199_254_740_992; // 2^53
-        assert_eq!(
-            num_proj::u64_to_f64(max_exact),
-            ProjectedNumber::Exact(max_exact as f64)
-        );
-
-        // Test values that cannot be exactly represented as f64 (integers above 2^53)
-        let large_u64 = 9_007_199_254_740_993; // 2^53 + 1
-        let closest_f64 = 9_007_199_254_740_992.0;
-        assert_eq!(large_u64 as f64, closest_f64);
-        if let ProjectedNumber::Next(val) = num_proj::u64_to_f64(large_u64) {
-            // Verify that the returned float is different from the direct cast
-            assert!(val > closest_f64);
-            assert!(val - closest_f64 < 2. * f64::EPSILON * closest_f64);
-        } else {
-            panic!("Expected ProjectedNumber::Next for large_u64");
-        }
-    }
-}
--- a/src/aggregation/bucket/histogram/date_histogram.rs
+++ b/src/aggregation/bucket/histogram/date_histogram.rs
@@ -207,7 +207,7 @@ fn parse_offset_into_milliseconds(input: &str) -> Result<i64, AggregationError>
    }
 }

-pub(crate) fn parse_into_milliseconds(input: &str) -> Result<i64, AggregationError> {
+fn parse_into_milliseconds(input: &str) -> Result<i64, AggregationError> {
    let split_boundary = input
        .as_bytes()
        .iter()
--- a/src/aggregation/bucket/mod.rs
+++ b/src/aggregation/bucket/mod.rs
@@ -22,7 +22,6 @@
 //! - [Range](RangeAggregation)
 //! - [Terms](TermsAggregation)

-mod composite;
 mod filter;
 mod histogram;
 mod range;
@@ -32,7 +31,6 @@ mod term_missing_agg;
 use std::collections::HashMap;
 use std::fmt;

-pub use composite::*;
 pub use filter::*;
 pub use histogram::*;
 pub use range::*;
--- a/src/aggregation/intermediate_agg_result.rs
+++ b/src/aggregation/intermediate_agg_result.rs
@@ -25,12 +25,9 @@ use super::metric::{
 use super::segment_agg_result::AggregationLimitsGuard;
 use super::{format_date, AggregationError, Key, SerializedKey};
 use crate::aggregation::agg_result::{
-    AggregationResults, BucketEntries, BucketEntry, CompositeBucketEntry, FilterBucketResult,
-};
-use crate::aggregation::bucket::{
-    composite_intermediate_key_ordering, CompositeAggregation, MissingOrder,
-    TermsAggregationInternal,
+    AggregationResults, BucketEntries, BucketEntry, FilterBucketResult,
 };
+use crate::aggregation::bucket::TermsAggregationInternal;
 use crate::aggregation::metric::CardinalityCollector;
 use crate::TantivyError;

@@ -249,11 +246,6 @@ pub(crate) fn empty_from_req(req: &Aggregation) -> IntermediateAggregationResult
                is_date_agg: true,
            })
        }
-        Composite(_) => {
-            IntermediateAggregationResult::Bucket(IntermediateBucketResult::Composite {
-                buckets: Default::default(),
-            })
-        }
        Average(_) => IntermediateAggregationResult::Metric(IntermediateMetricResult::Average(
            IntermediateAverage::default(),
        )),
@@ -481,11 +473,6 @@ pub enum IntermediateBucketResult {
        /// Sub-aggregation results
        sub_aggregations: IntermediateAggregationResults,
    },
-    /// Composite aggregation
-    Composite {
-        /// The composite buckets
-        buckets: IntermediateCompositeBucketResult,
-    },
 }

 impl IntermediateBucketResult {
@@ -581,13 +568,6 @@ impl IntermediateBucketResult {
                    sub_aggregations: final_sub_aggregations,
                }))
            }
-            IntermediateBucketResult::Composite { buckets } => buckets.into_final_result(
-                req.agg
-                    .as_composite()
-                    .expect("unexpected aggregation, expected composite aggregation"),
-                req.sub_aggregation(),
-                limits,
-            ),
        }
    }

@@ -654,16 +634,6 @@ impl IntermediateBucketResult {
                *doc_count_left += doc_count_right;
                sub_aggs_left.merge_fruits(sub_aggs_right)?;
            }
-            (
-                IntermediateBucketResult::Composite {
-                    buckets: buckets_left,
-                },
-                IntermediateBucketResult::Composite {
-                    buckets: buckets_right,
-                },
-            ) => {
-                buckets_left.merge_fruits(buckets_right)?;
-            }
            (IntermediateBucketResult::Range(_), _) => {
                panic!("try merge on different types")
            }
@@ -676,9 +646,6 @@ impl IntermediateBucketResult {
            (IntermediateBucketResult::Filter { .. }, _) => {
                panic!("try merge on different types")
            }
-            (IntermediateBucketResult::Composite { .. }, _) => {
-                panic!("try merge on different types")
-            }
        }
        Ok(())
    }
@@ -922,212 +889,6 @@ pub struct IntermediateTermBucketEntry {
    pub sub_aggregation: IntermediateAggregationResults,
 }

-/// Entry for the composite bucket.
-pub type IntermediateCompositeBucketEntry = IntermediateTermBucketEntry;
-
-/// The fully typed key for composite aggregation
-#[derive(Clone, Debug, PartialEq, Serialize, Deserialize)]
-pub enum CompositeIntermediateKey {
-    /// Bool key
-    Bool(bool),
-    /// String key
-    Str(String),
-    /// Float key
-    F64(f64),
-    /// Signed integer key
-    I64(i64),
-    /// Unsigned integer key
-    U64(u64),
-    /// DateTime key, nanoseconds since epoch
-    DateTime(i64),
-    /// IP Address key
-    IpAddr(Ipv6Addr),
-    /// Missing value key
-    Null,
-}
-
-impl Eq for CompositeIntermediateKey {}
-
-impl std::hash::Hash for CompositeIntermediateKey {
-    fn hash<H: std::hash::Hasher>(&self, state: &mut H) {
-        core::mem::discriminant(self).hash(state);
-        match self {
-            CompositeIntermediateKey::Bool(val) => val.hash(state),
-            CompositeIntermediateKey::Str(text) => text.hash(state),
-            CompositeIntermediateKey::F64(val) => val.to_bits().hash(state),
-            CompositeIntermediateKey::U64(val) => val.hash(state),
-            CompositeIntermediateKey::I64(val) => val.hash(state),
-            CompositeIntermediateKey::DateTime(val) => val.hash(state),
-            CompositeIntermediateKey::IpAddr(val) => val.hash(state),
-            CompositeIntermediateKey::Null => {}
-        }
-    }
-}
-
-/// Composite aggregation page.
-#[derive(Default, Clone, Debug, PartialEq, Serialize, Deserialize)]
-pub struct IntermediateCompositeBucketResult {
-    #[serde(
-        serialize_with = "serialize_composite_entries",
-        deserialize_with = "deserialize_composite_entries"
-    )]
-    pub(crate) entries: FxHashMap<Vec<CompositeIntermediateKey>, IntermediateCompositeBucketEntry>,
-    pub(crate) target_size: u32,
-    pub(crate) orders: Vec<(Order, MissingOrder)>,
-}
-
-fn serialize_composite_entries<S>(
-    entries: &FxHashMap<Vec<CompositeIntermediateKey>, IntermediateCompositeBucketEntry>,
-    serializer: S,
-) -> Result<S::Ok, S::Error>
-where
-    S: serde::Serializer,
-{
-    use serde::ser::SerializeSeq;
-    let mut seq = serializer.serialize_seq(Some(entries.len()))?;
-    for (k, v) in entries {
-        seq.serialize_element(&(k, v))?;
-    }
-    seq.end()
-}
-
-fn deserialize_composite_entries<'de, D>(
-    deserializer: D,
-) -> Result<FxHashMap<Vec<CompositeIntermediateKey>, IntermediateCompositeBucketEntry>, D::Error>
-where
-    D: serde::Deserializer<'de>,
-{
-    let vec: Vec<(Vec<CompositeIntermediateKey>, IntermediateCompositeBucketEntry)> =
-        serde::Deserialize::deserialize(deserializer)?;
-    Ok(vec.into_iter().collect())
-}
-
-impl IntermediateCompositeBucketResult {
-    pub(crate) fn into_final_result(
-        self,
-        req: &CompositeAggregation,
-        sub_aggregation_req: &Aggregations,
-        limits: &mut AggregationLimitsGuard,
-    ) -> crate::Result<BucketResult> {
-        let trimmed_entry_vec =
-            trim_composite_buckets(self.entries, &self.orders, self.target_size)?;
-        let after_key = if trimmed_entry_vec.len() == req.size as usize {
-            trimmed_entry_vec
-                .last()
-                .map(|bucket| {
-                    let (intermediate_key, _entry) = bucket;
-                    intermediate_key
-                        .iter()
-                        .enumerate()
-                        .map(|(idx, intermediate_key)| {
-                            let source = &req.sources[idx];
-                            (source.name().to_string(), intermediate_key.clone().into())
-                        })
-                        .collect()
-                })
-                .unwrap()
-        } else {
-            FxHashMap::default()
-        };
-
-        let buckets = trimmed_entry_vec
-            .into_iter()
-            .map(|(intermediate_key, entry)| {
-                let key = intermediate_key
-                    .into_iter()
-                    .enumerate()
-                    .map(|(idx, intermediate_key)| {
-                        let source = &req.sources[idx];
-                        (source.name().to_string(), intermediate_key.into())
-                    })
-                    .collect();
-                Ok(CompositeBucketEntry {
-                    key,
-                    doc_count: entry.doc_count as u64,
-                    sub_aggregation: entry
-                        .sub_aggregation
-                        .into_final_result_internal(sub_aggregation_req, limits)?,
-                })
-            })
-            .collect::<crate::Result<Vec<_>>>()?;
-
-        Ok(BucketResult::Composite { after_key, buckets })
-    }
-
-    fn merge_fruits(&mut self, other: IntermediateCompositeBucketResult) -> crate::Result<()> {
-        merge_maps(&mut self.entries, other.entries)?;
-        if self.entries.len() as u32 > 2 * self.target_size {
-            // 2x factor used to avoid trimming too often (expensive operation)
-            // an optimal threshold could probably be figured out
-            self.trim()?;
-        }
-        Ok(())
-    }
-
-    /// Trim the composite buckets to the target size, according to the ordering.
-    ///
-    /// Returns an error if the ordering comparison fails.
-    pub(crate) fn trim(&mut self) -> crate::Result<()> {
-        if self.entries.len() as u32 <= self.target_size {
-            return Ok(());
-        }
-
-        let sorted_entries = trim_composite_buckets(
-            std::mem::take(&mut self.entries),
-            &self.orders,
-            self.target_size,
-        )?;
-
-        self.entries = sorted_entries.into_iter().collect();
-        Ok(())
-    }
-}
-
-fn trim_composite_buckets(
-    entries: FxHashMap<Vec<CompositeIntermediateKey>, IntermediateCompositeBucketEntry>,
-    orders: &[(Order, MissingOrder)],
-    target_size: u32,
-) -> crate::Result<
-    Vec<(
-        Vec<CompositeIntermediateKey>,
-        IntermediateCompositeBucketEntry,
-    )>,
-> {
-    let mut entries: Vec<_> = entries.into_iter().collect();
-    let mut sort_error: Option<TantivyError> = None;
-    entries.sort_by(|(left_key, _), (right_key, _)| {
-        // Only attempt sorting if we haven't encountered an error yet
-        if sort_error.is_some() {
-            return Ordering::Equal; // Return a default, we'll handle the error after sorting
-        }
-
-        for i in 0..orders.len() {
-            match composite_intermediate_key_ordering(
-                &left_key[i],
-                &right_key[i],
-                orders[i].0,
-                orders[i].1,
-            ) {
-                Ok(ordering) if ordering != Ordering::Equal => return ordering,
-                Ok(_) => continue, // Equal, try next key
-                Err(err) => {
-                    sort_error = Some(err);
-                    break;
-                }
-            }
-        }
-        Ordering::Equal
-    });
-
-    // If we encountered an error during sorting, return it now
-    if let Some(err) = sort_error {
-        return Err(err);
-    }
-
-    entries.truncate(target_size as usize);
-    Ok(entries)
-}
-
 impl MergeFruits for IntermediateTermBucketEntry {
    fn merge_fruits(&mut self, other: IntermediateTermBucketEntry) -> crate::Result<()> {
        self.doc_count += other.doc_count;
--- a/src/aggregation/metric/percentiles.rs
+++ b/src/aggregation/metric/percentiles.rs
@@ -331,7 +331,7 @@ mod tests {
    use crate::aggregation::AggregationCollector;
    use crate::query::AllQuery;
    use crate::schema::{Schema, FAST};
-    use crate::Index;
+    use crate::{assert_nearly_equals, Index};

    #[test]
    fn test_aggregation_percentiles_empty_index() -> crate::Result<()> {
@@ -614,13 +614,17 @@ mod tests {
        let res = exec_request_with_query(agg_req, &index, None)?;
        assert_eq!(res["range_with_stats"]["buckets"][0]["doc_count"], 3);

-        assert_eq!(
-            res["range_with_stats"]["buckets"][0]["percentiles"]["values"]["1.0"],
-            5.002829575110705
+        assert_nearly_equals!(
+            res["range_with_stats"]["buckets"][0]["percentiles"]["values"]["1.0"]
+                .as_f64()
+                .unwrap(),
+            5.0028295751107414
        );
-        assert_eq!(
-            res["range_with_stats"]["buckets"][0]["percentiles"]["values"]["99.0"],
-            10.07469668951133
+        assert_nearly_equals!(
+            res["range_with_stats"]["buckets"][0]["percentiles"]["values"]["99.0"]
+                .as_f64()
+                .unwrap(),
+            10.07469668951144
        );

        Ok(())
@@ -665,8 +669,14 @@ mod tests {

        let res = exec_request_with_query(agg_req, &index, None)?;

-        assert_eq!(res["percentiles"]["values"]["1.0"], 5.002829575110705);
-        assert_eq!(res["percentiles"]["values"]["99.0"], 10.07469668951133);
+        assert_nearly_equals!(
+            res["percentiles"]["values"]["1.0"].as_f64().unwrap(),
+            5.0028295751107414
+        );
+        assert_nearly_equals!(
+            res["percentiles"]["values"]["99.0"].as_f64().unwrap(),
+            10.07469668951144
+        );

        Ok(())
    }
--- a/src/directory/mod.rs
+++ b/src/directory/mod.rs
@@ -21,7 +21,7 @@ use std::path::PathBuf;
 pub use common::file_slice::{FileHandle, FileSlice};
 pub use common::{AntiCallToken, OwnedBytes, TerminatingWrite};

-pub(crate) use self::composite_file::{CompositeFile, CompositeWrite};
+pub use self::composite_file::{CompositeFile, CompositeWrite};
 pub use self::directory::{Directory, DirectoryClone, DirectoryLock};
 pub use self::directory_lock::{Lock, INDEX_WRITER_LOCK, META_LOCK};
 pub use self::ram_directory::RamDirectory;
@@ -52,7 +52,7 @@ pub use self::mmap_directory::MmapDirectory;
 ///
 /// `WritePtr` are required to implement both Write
 /// and Seek.
-pub type WritePtr = BufWriter<Box<dyn TerminatingWrite>>;
+pub type WritePtr = BufWriter<Box<dyn TerminatingWrite + Send + Sync>>;

 #[cfg(test)]
 mod tests;
--- a/src/indexer/log_merge_policy.rs
+++ b/src/indexer/log_merge_policy.rs
@@ -94,7 +94,7 @@ impl MergePolicy for LogMergePolicy {
    fn compute_merge_candidates(&self, segments: &[SegmentMeta]) -> Vec<MergeCandidate> {
        let size_sorted_segments = segments
            .iter()
-            .filter(|seg| seg.num_docs() <= (self.max_docs_before_merge as u32))
+            .filter(|seg| (seg.num_docs() as usize) <= self.max_docs_before_merge)
            .sorted_by_key(|seg| std::cmp::Reverse(seg.max_doc()))
            .collect::<Vec<&SegmentMeta>>();

@@ -372,4 +372,21 @@ mod tests {
        assert_eq!(merge_candidates[0].0.len(), 1);
        assert_eq!(merge_candidates[0].0[0], test_input[1].id());
    }
+
+    #[test]
+    fn test_max_docs_before_merge_large_value() {
+        // Regression test: (max_docs_before_merge as u32) truncates values > u32::MAX.
+        // Casting num_docs() to usize instead avoids the truncation.
+        let mut policy = LogMergePolicy::default();
+        policy.set_min_num_segments(2);
+        policy.set_max_docs_before_merge(5_000_000_000usize);
+        let test_input = vec![
+            create_random_segment_meta(100_000),
+            create_random_segment_meta(100_000),
+        ];
+        let result = policy.compute_merge_candidates(&test_input);
+        // Both segments should be eligible (100_000 < 5_000_000_000)
+        assert_eq!(result.len(), 1);
+        assert_eq!(result[0].0.len(), 2);
+    }
 }
--- a/src/indexer/segment_updater.rs
+++ b/src/indexer/segment_updater.rs
@@ -403,7 +403,8 @@ impl SegmentUpdater {
            // from the different drives.
            //
            // Segment 1 from disk 1, Segment 1 from disk 2, etc.
-            committed_segment_metas.sort_by_key(|segment_meta| -(segment_meta.max_doc() as i32));
+            committed_segment_metas
+                .sort_by_key(|segment_meta| std::cmp::Reverse(segment_meta.max_doc()));
            let index_meta = IndexMeta {
                index_settings: index.settings().clone(),
                segments: committed_segment_metas,
@@ -705,6 +706,7 @@ mod tests {
    use crate::collector::TopDocs;
    use crate::directory::RamDirectory;
    use crate::fastfield::AliveBitSet;
+    use crate::index::{SegmentId, SegmentMetaInventory};
    use crate::indexer::merge_policy::tests::MergeWheneverPossible;
    use crate::indexer::merger::IndexMerger;
    use crate::indexer::segment_updater::merge_filtered_segments;
@@ -712,6 +714,22 @@ mod tests {
    use crate::schema::*;
    use crate::{Directory, DocAddress, Index, Segment};

+    #[test]
+    fn test_segment_sort_large_max_doc() {
+        // Regression test: -(max_doc as i32) overflows for max_doc >= 2^31.
+        // Using std::cmp::Reverse avoids this.
+        let inventory = SegmentMetaInventory::default();
+        let mut metas = vec![
+            inventory.new_segment_meta(SegmentId::generate_random(), 100),
+            inventory.new_segment_meta(SegmentId::generate_random(), (1u32 << 31) - 1),
+            inventory.new_segment_meta(SegmentId::generate_random(), 50_000),
+        ];
+        metas.sort_by_key(|m| std::cmp::Reverse(m.max_doc()));
+        assert_eq!(metas[0].max_doc(), (1u32 << 31) - 1);
+        assert_eq!(metas[1].max_doc(), 50_000);
+        assert_eq!(metas[2].max_doc(), 100);
+    }
+
    #[test]
    fn test_delete_during_merge() -> crate::Result<()> {
        let mut schema_builder = Schema::builder();
--- a/src/lib.rs
+++ b/src/lib.rs
@@ -169,8 +169,10 @@ mod macros;
 mod future_result;

 // Re-exports
+pub use columnar;
 pub use common::{ByteCount, DateTime};
-pub use {columnar, query_grammar, time};
+pub use query_grammar;
+pub use time;

 pub use crate::error::TantivyError;
 pub use crate::future_result::FutureResult;
--- a/src/postings/mod.rs
+++ b/src/postings/mod.rs
@@ -14,7 +14,8 @@ mod postings;
 mod postings_writer;
 mod recorder;
 mod segment_postings;
-mod serializer;
+/// Serializer module for the inverted index
+pub mod serializer;
 mod skip;
 mod term_info;

--- a/src/postings/serializer.rs
+++ b/src/postings/serializer.rs
@@ -11,7 +11,7 @@ use crate::positions::PositionSerializer;
 use crate::postings::compression::{BlockEncoder, VIntEncoder, COMPRESSION_BLOCK_SIZE};
 use crate::postings::skip::SkipSerializer;
 use crate::query::Bm25Weight;
-use crate::schema::{Field, FieldEntry, FieldType, IndexRecordOption, Schema};
+use crate::schema::{Field, FieldEntry, IndexRecordOption, Schema};
 use crate::termdict::TermDictionaryBuilder;
 use crate::{DocId, Score};

@@ -80,9 +80,12 @@ impl InvertedIndexSerializer {
        let term_dictionary_write = self.terms_write.for_field(field);
        let postings_write = self.postings_write.for_field(field);
        let positions_write = self.positions_write.for_field(field);
-        let field_type: FieldType = (*field_entry.field_type()).clone();
+        let index_record_option = field_entry
+            .field_type()
+            .index_record_option()
+            .unwrap_or(IndexRecordOption::Basic);
        FieldSerializer::create(
-            &field_type,
+            index_record_option,
            total_num_tokens,
            term_dictionary_write,
            postings_write,
@@ -102,29 +105,27 @@ impl InvertedIndexSerializer {

 /// The field serializer is in charge of
 /// the serialization of a specific field.
-pub struct FieldSerializer<'a> {
-    term_dictionary_builder: TermDictionaryBuilder<&'a mut CountingWriter<WritePtr>>,
+pub struct FieldSerializer<'a, W: Write = WritePtr> {
+    term_dictionary_builder: TermDictionaryBuilder<&'a mut CountingWriter<W>>,
    postings_serializer: PostingsSerializer,
-    positions_serializer_opt: Option<PositionSerializer<&'a mut CountingWriter<WritePtr>>>,
+    positions_serializer_opt: Option<PositionSerializer<&'a mut CountingWriter<W>>>,
    current_term_info: TermInfo,
    term_open: bool,
-    postings_write: &'a mut CountingWriter<WritePtr>,
+    postings_write: &'a mut CountingWriter<W>,
    postings_start_offset: u64,
 }

-impl<'a> FieldSerializer<'a> {
-    fn create(
-        field_type: &FieldType,
+impl<'a, W: Write> FieldSerializer<'a, W> {
+    /// Creates a new `FieldSerializer` for the given field type.
+    pub fn create(
+        index_record_option: IndexRecordOption,
        total_num_tokens: u64,
-        term_dictionary_write: &'a mut CountingWriter<WritePtr>,
-        postings_write: &'a mut CountingWriter<WritePtr>,
-        positions_write: &'a mut CountingWriter<WritePtr>,
+        term_dictionary_write: &'a mut CountingWriter<W>,
+        postings_write: &'a mut CountingWriter<W>,
+        positions_write: &'a mut CountingWriter<W>,
        fieldnorm_reader: Option<FieldNormReader>,
-    ) -> io::Result<FieldSerializer<'a>> {
+    ) -> io::Result<FieldSerializer<'a, W>> {
        total_num_tokens.serialize(postings_write)?;
-        let index_record_option = field_type
-            .index_record_option()
-            .unwrap_or(IndexRecordOption::Basic);
        let term_dictionary_builder = TermDictionaryBuilder::create(term_dictionary_write)?;
        let average_fieldnorm = fieldnorm_reader
            .as_ref()
@@ -192,6 +193,11 @@ impl<'a> FieldSerializer<'a> {
        Ok(())
    }

+    /// Starts the postings for a new term without recording term frequencies.
+    pub fn new_term_without_freq(&mut self, term: &[u8]) -> io::Result<()> {
+        self.new_term(term, 0, false)
+    }
+
    /// Serialize the information that a document contains for the current term:
    /// its term frequency, and the position deltas.
    ///
@@ -297,6 +303,7 @@ impl Block {
    }
 }

+/// Serializer for postings lists.
 pub struct PostingsSerializer {
    last_doc_id_encoded: u32,

@@ -316,6 +323,9 @@ pub struct PostingsSerializer {
 }

 impl PostingsSerializer {
+    /// Creates a new `PostingsSerializer`.
+    /// * avg_fieldnorm - average field norm for the field being serialized.
+    /// * mode - indexing options for the field being serialized.
    pub fn new(
        avg_fieldnorm: Score,
        mode: IndexRecordOption,
@@ -338,6 +348,8 @@ impl PostingsSerializer {
        }
    }

+    /// Starts the serialization for a new term.
+    /// * term_doc_freq - the number of documents containing the term.
    pub fn new_term(&mut self, term_doc_freq: u32, record_term_freq: bool) {
        self.bm25_weight = None;

@@ -377,6 +389,7 @@ impl PostingsSerializer {
            self.postings_write.extend(block_encoded);
        }
        if self.term_has_freq {
+            // encode the term frequencies
            let (num_bits, block_encoded): (u8, &[u8]) = self
                .block_encoder
                .compress_block_unsorted(self.block.term_freqs(), true);
@@ -417,6 +430,9 @@ impl PostingsSerializer {
        self.block.clear();
    }

+    /// Register that the given document contains the current term.
+    /// * doc_id - the document id.
+    /// * term_freq - the term frequency within the document.
    pub fn write_doc(&mut self, doc_id: DocId, term_freq: u32) {
        self.block.append_doc(doc_id, term_freq);
        if self.block.is_full() {
@@ -424,6 +440,7 @@ impl PostingsSerializer {
        }
    }

+    /// Finish the serialization for this term.
    pub fn close_term(
        &mut self,
        doc_freq: u32,
--- a/src/postings/skip.rs
+++ b/src/postings/skip.rs
@@ -14,7 +14,11 @@ use crate::{DocId, Score, TERMINATED};
 //   (requiring a 6th bit), but the biggest doc_id we can want to encode is TERMINATED-1, which can
 //   be represented on 31b without delta encoding.
 fn encode_bitwidth(bitwidth: u8, delta_1: bool) -> u8 {
-    assert!(bitwidth < 32);
+    assert!(
+        bitwidth < 32,
+        "bitwidth needs to be less than 32, but got {}",
+        bitwidth
+    );
    bitwidth | ((delta_1 as u8) << 6)
 }

--- a/sstable/src/lib.rs
+++ b/sstable/src/lib.rs
@@ -51,7 +51,7 @@ mod sstable_index_v3;
 pub use sstable_index_v3::{BlockAddr, SSTableIndex, SSTableIndexBuilder, SSTableIndexV3};
 mod sstable_index_v2;
 pub(crate) mod vint;
-pub use dictionary::{Dictionary, TermOrdHit};
+pub use dictionary::Dictionary;
 pub use streamer::{Streamer, StreamerBuilder};

 mod block_reader;
@@ -302,8 +302,9 @@ where
            || self.previous_key[keep_len] < key[keep_len];
        assert!(
            increasing_keys,
-            "Keys should be increasing. ({:?} > {key:?})",
-            self.previous_key
+            "Keys should be increasing. ({:?} > {:?})",
+            String::from_utf8_lossy(&self.previous_key),
+            String::from_utf8_lossy(key),
        );
        self.previous_key.resize(key.len(), 0u8);
        self.previous_key[keep_len..].copy_from_slice(&key[keep_len..]);
Author	SHA1	Message	Date
Pascal Seitz	472286a045	update CHANGELOG for tantivy 0.26 release	2026-03-18 19:14:45 +01:00
Paul Masurel	68a9066d13	Fix format (#2852 ) Co-authored-by: Paul Masurel <paul.masurel@datadoghq.com>	2026-03-16 10:43:39 +01:00
Paul Masurel	d02559a4d1	Update time deps to defensively address a vulnerability. (#2850 ) Closes #2849 Co-authored-by: Paul Masurel <paul.masurel@datadoghq.com>	2026-03-12 16:47:11 +01:00
Anas Limem	1922abaf33	Fixed integer overflow in segment sorting and merge policy truncation (#2846 )	2026-03-12 16:44:38 +01:00
trinity-1686a	d0c5ffb0aa	Merge pull request #2842 from quickwit-oss/congxie/replaceHll Use sketches-ddsketch fork with Java-compatible binary encoding	2026-02-20 16:56:56 +01:00
cong.xie	18fedd9384	Fix nightly fmt: merge crate imports in percentiles tests Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-19 14:18:54 -05:00
cong.xie	2098fca47f	Restore use_serde feature and simplify PercentilesCollector Keep use_serde on sketches-ddsketch so DDSketch derives Serialize/Deserialize, removing the need for custom impls on PercentilesCollector. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-19 14:13:17 -05:00
cong.xie	1251b40c93	Drop use_serde feature; use Java binary encoding for PercentilesCollector Replace the derived Serialize/Deserialize on PercentilesCollector with custom impls that use DDSketch's Java-compatible binary encoding (encode_to_java_bytes / decode_from_java_bytes). This removes the need for the use_serde feature on sketches-ddsketch entirely. Also restore original float test values and use assert_nearly_equals! for all float comparisons in percentile tests, since DDSketch quantile estimates can have minor precision differences across platforms. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-19 13:32:28 -05:00
cong.xie	09a49b872c	Use assert_nearly_equals! for float comparisons in percentile test Address review feedback: replace assert_eq! with assert_nearly_equals! for float values that go through JSON serialization roundtrips, which can introduce minor precision differences. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-19 13:21:48 -05:00
cong.xie	b9ace002ce	Replace vendored sketches-ddsketch with git dependency Move the vendored sketches-ddsketch crate (with Java-compatible binary encoding) to its own repo at quickwit-oss/rust-sketches-ddsketch and reference it via git+rev in Cargo.toml. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-19 12:22:19 -05:00
Adrien Guillo	51f340f83d	Merge pull request #2837 from quickwit-oss/congxie/replaceHll Replace hyperloglogplus with Apache DataSketches HLL (lg_k=11)	2026-02-12 17:19:40 -05:00
PSeitz	57fe659fff	make serializer pub (#2835 ) some changes on the posting list serializer to make it usable in other contexts. Improve errors Signed-off-by: Pascal Seitz <pascal.seitz@gmail.com>	2026-02-11 14:37:42 +01:00