rust/tantivy

mirror of https://github.com/quickwit-oss/tantivy.git synced 2026-05-27 13:40:49 +00:00

Files

History

alexanderbianchi 3cd9011f87 Make BucketEntries::iter, PercentileValuesVecEntry fields, and TopNComputer::threshold public (#2890 )

These items need to be accessible from the tantivy-datafusion crate:
- BucketEntries::iter() for iterating aggregation bucket results
- PercentileValuesVecEntry.key/.value for reading percentile results
- TopNComputer.threshold for Block-WAND score pruning in the inverted index provider

Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Co-authored-by: Paul Masurel <paul@quickwit.io>

2026-04-09 13:32:31 +02:00

..

Fix clippy warnings: deprecated gen_range, manual div_ceil, legacy import (#2860 )

2026-03-26 07:37:26 -04:00

Make BucketEntries::iter, PercentileValuesVecEntry fields, and TopNComputer::threshold public (#2890 )

2026-04-09 13:32:31 +02:00

accessor_helpers.rs

Optimize term aggregation with low cardinality + some refactoring (#2740 )

2025-11-21 14:46:29 +01:00

agg_data.rs

Composite agg merge (#2856 )

2026-03-18 17:28:59 +01:00

agg_limits.rs

Optimize term aggregation with low cardinality + some refactoring (#2740 )

2025-11-21 14:46:29 +01:00

agg_req.rs

Composite agg merge (#2856 )

2026-03-18 17:28:59 +01:00

agg_result.rs

Make BucketEntries::iter, PercentileValuesVecEntry fields, and TopNComputer::threshold public (#2890 )

2026-04-09 13:32:31 +02:00

agg_tests.rs

one collector per agg request instead per bucket (#2759 )

2026-01-06 11:50:55 +01:00

cached_sub_aggs.rs

one collector per agg request instead per bucket (#2759 )

2026-01-06 11:50:55 +01:00

collector.rs

one collector per agg request instead per bucket (#2759 )

2026-01-06 11:50:55 +01:00

date.rs

Inline format arguments where makes sense (#2038 )

2023-05-10 18:03:59 +09:00

error.rs

add percentiles aggregations (#1984 )

2023-04-07 07:18:28 +02:00

intermediate_agg_result.rs

Composite agg merge (#2856 )

2026-03-18 17:28:59 +01:00

mod.rs

one collector per agg request instead per bucket (#2759 )

2026-01-06 11:50:55 +01:00

README.md

Replace AggregationsWithAccessor (#2715 )

2025-10-14 09:22:11 +02:00

segment_agg_result.rs

one collector per agg request instead per bucket (#2759 )

2026-01-06 11:50:55 +01:00

README.md

Contributing

When adding new bucket aggregation make sure to extend the "test_aggregation_flushing" test for at least 2 levels.

Code Organization

Tantivy's aggregations have been designed to mimic the aggregations of elasticsearch.

The code is organized in submodules:

bucket

Contains all bucket aggregations, like range aggregation. These bucket aggregations group documents into buckets and can contain sub-aggregations.

metric

Contains all metric aggregations, like average aggregation. Metric aggregations do not have sub aggregations.

agg_req

agg_req contains the users aggregation request. Deserialization from json is compatible with elasticsearch aggregation requests.

agg_data

agg_data contains the users aggregation request enriched with fast field accessors etc, which are used during collection.

segment_agg_result

segment_agg_result contains the aggregation result tree, which is used for collection of a segment. agg_data is passed during collection.

intermediate_agg_result

intermediate_agg_result contains the aggregation tree for merging with other trees.

agg_result

agg_result contains the final aggregation tree.