extend proptest to cover bytes field codec bug

Fix opening bytes index with dynamic codec
Fix #1278
2026-02-20 23:00:37 +00:00 · 2022-02-18 10:50:46 +01:00 · 2022-02-18 07:08:43 +01:00 · 2022-02-17 10:59:46 +09:00 · 2022-02-14 18:19:57 +09:00 · 2022-02-04 15:10:24 +09:00
222 changed files with 2658 additions and 2875 deletions
--- a/.github/workflows/long_running.yml
+++ b/.github/workflows/long_running.yml
@@ -1,4 +1,4 @@
-name: Rust
+name: Long running tests

 on:
  push:
--- a/.github/workflows/test.yml
+++ b/.github/workflows/test.yml
@@ -1,4 +1,4 @@
-name: Rust
+name: Unit tests

 on:
  push:
@@ -21,10 +21,22 @@ jobs:
    - name: Install latest nightly to test also against unstable feature flag
      uses: actions-rs/toolchain@v1
      with:
-            toolchain: stable
+            toolchain: nightly
            override: true
            components: rustfmt
+    - name: Install latest nightly to test also against unstable feature flag
+      uses: actions-rs/toolchain@v1
+      with:
+            toolchain: stable
+            override: true
+            components: rustfmt, clippy
    - name: Run tests
-      run: cargo test --features mmap,brotli-compression,lz4-compression,snappy-compression,failpoints --verbose --workspace
+      run: cargo +stable test --features mmap,brotli-compression,lz4-compression,snappy-compression,failpoints --verbose --workspace
    - name: Check Formatting
-      run: cargo fmt --all -- --check
+      run: cargo +nightly fmt --all -- --check
+    - uses: actions-rs/clippy-check@v1
+      with:
+        toolchain: stable
+        token: ${{ secrets.GITHUB_TOKEN }}
+        args: --tests
+
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -1,12 +1,12 @@
 Tantivy 0.17
 ================================
- LogMergePolicy now triggers merges if the ratio of deleted documents reaches a threshold (@shikhar) [#115](https://github.com/quickwit-inc/tantivy/issues/115)
- Adds a searcher Warmer API (@shikhar)
+- LogMergePolicy now triggers merges if the ratio of deleted documents reaches a threshold (@shikhar @fulmicoton) [#115](https://github.com/quickwit-oss/tantivy/issues/115)
+- Adds a searcher Warmer API (@shikhar @fulmicoton)
 - Change to non-strict schema. Ignore fields in data which are not defined in schema. Previously this returned an error. #1211
 - Facets are necessarily indexed. Existing index with indexed facets should work out of the box. Index without facets that are marked with index: false should be broken (but they were already broken in a sense). (@fulmicoton) #1195 .
- Bugfix that could in theory impact durability in theory on some filesystems [#1224](https://github.com/quickwit-inc/tantivy/issues/1224)
- Schema now offers not indexing fieldnorms (@lpouget) [#922](https://github.com/quickwit-inc/tantivy/issues/922)
- Reduce the number of fsync calls [#1225](https://github.com/quickwit-inc/tantivy/issues/1225)
+- Bugfix that could in theory impact durability in theory on some filesystems [#1224](https://github.com/quickwit-oss/tantivy/issues/1224)
+- Schema now offers not indexing fieldnorms (@lpouget) [#922](https://github.com/quickwit-oss/tantivy/issues/922)
+- Reduce the number of fsync calls [#1225](https://github.com/quickwit-oss/tantivy/issues/1225)

 Tantivy 0.16.2
 ================================
@@ -128,7 +128,7 @@ Tantivy 0.12.0
 ## How to update?

 Crates relying on custom tokenizer, or registering tokenizer in the manager will require some
-minor changes. Check https://github.com/quickwit-inc/tantivy/blob/main/examples/custom_tokenizer.rs
+minor changes. Check https://github.com/quickwit-oss/tantivy/blob/main/examples/custom_tokenizer.rs
 to check for some code sample.

 Tantivy 0.11.3
--- a/Cargo.toml
+++ b/Cargo.toml
@@ -6,8 +6,8 @@ license = "MIT"
 categories = ["database-implementations", "data-structures"]
 description = """Search engine library"""
 documentation = "https://docs.rs/tantivy/"
-homepage = "https://github.com/quickwit-inc/tantivy"
-repository = "https://github.com/quickwit-inc/tantivy"
+homepage = "https://github.com/quickwit-oss/tantivy"
+repository = "https://github.com/quickwit-oss/tantivy"
 readme = "README.md"
 keywords = ["search", "information", "retrieval"]
 edition = "2018"
@@ -52,7 +52,7 @@ chrono = "0.4.19"
 smallvec = "1.6.1"
 rayon = "1.5"
 lru = "0.7.0"
-fastdivide = "0.3"
+fastdivide = "0.4"
 itertools = "0.10.0"
 measure_time = "0.8.0"

@@ -95,9 +95,6 @@ unstable = [] # useful for benches.
 [workspace]
 members = ["query-grammar", "bitpacker", "common", "fastfield_codecs", "ownedbytes"]

-[badges]
-travis-ci = { repository = "tantivy-search/tantivy" }
-
 # Following the "fail" crate best practises, we isolate
 # tests that define specific behavior in fail check points
 # in a different binary.
--- a/3
+++ b/3
@@ -1,3 +1,6 @@
 test:
 	echo "Run test only... No examples."
 	cargo test --tests --lib
+
+fmt:
+	cargo +nightly fmt --all
--- a/README.md
+++ b/README.md
@@ -1,22 +1,13 @@

 [![Docs](https://docs.rs/tantivy/badge.svg)](https://docs.rs/crate/tantivy/)
-[![Build Status](https://github.com/quickwit-inc/tantivy/actions/workflows/test.yml/badge.svg)](https://github.com/quickwit-inc/tantivy/actions/workflows/test.yml)
-[![codecov](https://codecov.io/gh/quickwit-inc/tantivy/branch/main/graph/badge.svg)](https://codecov.io/gh/quickwit-inc/tantivy)
+[![Build Status](https://github.com/quickwit-oss/tantivy/actions/workflows/test.yml/badge.svg)](https://github.com/quickwit-oss/tantivy/actions/workflows/test.yml)
+[![codecov](https://codecov.io/gh/quickwit-oss/tantivy/branch/main/graph/badge.svg)](https://codecov.io/gh/quickwit-oss/tantivy)
 [![Join the chat at https://discord.gg/MT27AG5EVE](https://shields.io/discord/908281611840282624?label=chat%20on%20discord)](https://discord.gg/MT27AG5EVE)
 [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
 [![Crates.io](https://img.shields.io/crates/v/tantivy.svg)](https://crates.io/crates/tantivy)

 ![Tantivy](https://tantivy-search.github.io/logo/tantivy-logo.png)

-[![](https://sourcerer.io/fame/fulmicoton/tantivy-search/tantivy/images/0)](https://sourcerer.io/fame/fulmicoton/tantivy-search/tantivy/links/0)
-[![](https://sourcerer.io/fame/fulmicoton/tantivy-search/tantivy/images/1)](https://sourcerer.io/fame/fulmicoton/tantivy-search/tantivy/links/1)
-[![](https://sourcerer.io/fame/fulmicoton/tantivy-search/tantivy/images/2)](https://sourcerer.io/fame/fulmicoton/tantivy-search/tantivy/links/2)
-[![](https://sourcerer.io/fame/fulmicoton/tantivy-search/tantivy/images/3)](https://sourcerer.io/fame/fulmicoton/tantivy-search/tantivy/links/3)
-[![](https://sourcerer.io/fame/fulmicoton/tantivy-search/tantivy/images/4)](https://sourcerer.io/fame/fulmicoton/tantivy-search/tantivy/links/4)
-[![](https://sourcerer.io/fame/fulmicoton/tantivy-search/tantivy/images/5)](https://sourcerer.io/fame/fulmicoton/tantivy-search/tantivy/links/5)
-[![](https://sourcerer.io/fame/fulmicoton/tantivy-search/tantivy/images/6)](https://sourcerer.io/fame/fulmicoton/tantivy-search/tantivy/links/6)
-[![](https://sourcerer.io/fame/fulmicoton/tantivy-search/tantivy/images/7)](https://sourcerer.io/fame/fulmicoton/tantivy-search/tantivy/links/7)
-
 **Tantivy** is a **full text search engine library** written in Rust.

 It is closer to [Apache Lucene](https://lucene.apache.org/) than to [Elasticsearch](https://www.elastic.co/products/elasticsearch) or [Apache Solr](https://lucene.apache.org/solr/) in the sense it is not
@@ -27,7 +18,7 @@ Tantivy is, in fact, strongly inspired by Lucene's design.

 # Benchmark

-The following [benchmark](https://tantivy-search.github.io/bench/) break downs 
+The following [benchmark](https://tantivy-search.github.io/bench/) break downs
 performance for different type of queries / collection.

 Your mileage WILL vary depending on the nature of queries and their load.
@@ -35,7 +26,7 @@ Your mileage WILL vary depending on the nature of queries and their load.
 # Features

 - Full-text search
- Configurable tokenizer (stemming available for 17 Latin languages with third party support for Chinese ([tantivy-jieba](https://crates.io/crates/tantivy-jieba) and [cang-jie](https://crates.io/crates/cang-jie)), Japanese ([lindera](https://github.com/lindera-morphology/lindera-tantivy) and [tantivy-tokenizer-tiny-segmenter](https://crates.io/crates/tantivy-tokenizer-tiny-segmenter)) and Korean ([lindera](https://github.com/lindera-morphology/lindera-tantivy) + [lindera-ko-dic-builder](https://github.com/lindera-morphology/lindera-ko-dic-builder))
+- Configurable tokenizer (stemming available for 17 Latin languages with third party support for Chinese ([tantivy-jieba](https://crates.io/crates/tantivy-jieba) and [cang-jie](https://crates.io/crates/cang-jie)), Japanese ([lindera](https://github.com/lindera-morphology/lindera-tantivy), [Vaporetto](https://crates.io/crates/vaporetto_tantivy), and [tantivy-tokenizer-tiny-segmenter](https://crates.io/crates/tantivy-tokenizer-tiny-segmenter)) and Korean ([lindera](https://github.com/lindera-morphology/lindera-tantivy) + [lindera-ko-dic-builder](https://github.com/lindera-morphology/lindera-ko-dic-builder))
 - Fast (check out the :racehorse: :sparkles: [benchmark](https://tantivy-search.github.io/bench/) :sparkles: :racehorse:)
 - Tiny startup time (<10ms), perfect for command line tools
 - BM25 scoring (the same as Lucene)
@@ -57,7 +48,7 @@ Your mileage WILL vary depending on the nature of queries and their load.
 ## Non-features

 - Distributed search is out of the scope of Tantivy. That being said, Tantivy is a
-library upon which one could build a distributed search. Serializable/mergeable collector state for instance, 
+library upon which one could build a distributed search. Serializable/mergeable collector state for instance,
 are within the scope of Tantivy.


@@ -66,14 +57,14 @@ are within the scope of Tantivy.
 Tantivy works on stable Rust (>= 1.27) and supports Linux, MacOS, and Windows.

 - [Tantivy's simple search example](https://tantivy-search.github.io/examples/basic_search.html)
- [tantivy-cli and its tutorial](https://github.com/tantivy-search/tantivy-cli) - `tantivy-cli` is an actual command line interface that makes it easy for you to create a search engine,
+- [tantivy-cli and its tutorial](https://github.com/quickwit-oss/tantivy-cli) - `tantivy-cli` is an actual command line interface that makes it easy for you to create a search engine,
 index documents, and search via the CLI or a small server with a REST API.
 It walks you through getting a wikipedia search engine up and running in a few minutes.
 - [Reference doc for the last released version](https://docs.rs/tantivy/)

 # How can I support this project?

-There are many ways to support this project. 
+There are many ways to support this project.

 - Use Tantivy and tell us about your experience on [Discord](https://discord.gg/MT27AG5EVE) or by email (paul.masurel@gmail.com)
 - Report bugs
@@ -92,7 +83,7 @@ Tantivy compiles on stable Rust but requires `Rust >= 1.27`.
 To check out and run tests, you can simply run:

 ```bash
-    git clone https://github.com/quickwit-inc/tantivy.git
+    git clone https://github.com/quickwit-oss/tantivy.git
    cd tantivy
    cargo build
 ```
--- a/bitpacker/Cargo.toml
+++ b/bitpacker/Cargo.toml
@@ -6,7 +6,7 @@ authors = ["Paul Masurel <paul.masurel@gmail.com>"]
 license = "MIT"
 categories = []
 description = """Tantivy-sub crate: bitpacking"""
-repository = "https://github.com/quickwit-inc/tantivy"
+repository = "https://github.com/quickwit-oss/tantivy"
 keywords = []


--- a/bitpacker/src/bitpacker.rs
+++ b/bitpacker/src/bitpacker.rs
@@ -1,4 +1,5 @@
-use std::{convert::TryInto, io};
+use std::convert::TryInto;
+use std::io;

 pub struct BitPacker {
    mini_buffer: u64,
--- a/bitpacker/src/blocked_bitpacker.rs
+++ b/bitpacker/src/blocked_bitpacker.rs
@@ -1,12 +1,11 @@
+use super::bitpacker::BitPacker;
+use super::compute_num_bits;
 use crate::{minmax, BitUnpacker};

-use super::{bitpacker::BitPacker, compute_num_bits};
-
 const BLOCK_SIZE: usize = 128;

 /// `BlockedBitpacker` compresses data in blocks of
 /// 128 elements, while keeping an index on it
-///
 #[derive(Debug, Clone)]
 pub struct BlockedBitpacker {
    // bitpacked blocks
--- a/bitpacker/src/lib.rs
+++ b/bitpacker/src/lib.rs
@@ -1,8 +1,7 @@
 mod bitpacker;
 mod blocked_bitpacker;

-pub use crate::bitpacker::BitPacker;
-pub use crate::bitpacker::BitUnpacker;
+pub use crate::bitpacker::{BitPacker, BitUnpacker};
 pub use crate::blocked_bitpacker::BlockedBitpacker;

 /// Computes the number of bits that will be used for bitpacking.
--- a/common/src/bitset.rs
+++ b/common/src/bitset.rs
@@ -1,8 +1,8 @@
-use ownedbytes::OwnedBytes;
 use std::convert::TryInto;
 use std::io::Write;
-use std::u64;
-use std::{fmt, io};
+use std::{fmt, io, u64};
+
+use ownedbytes::OwnedBytes;

 #[derive(Clone, Copy, Eq, PartialEq)]
 pub struct TinySet(u64);
@@ -187,7 +187,6 @@ fn num_buckets(max_val: u32) -> u32 {

 impl BitSet {
    /// serialize a `BitSet`.
-    ///
    pub fn serialize<T: Write>(&self, writer: &mut T) -> io::Result<()> {
        writer.write_all(self.max_value.to_le_bytes().as_ref())?;
        for tinyset in self.tinysets.iter().cloned() {
@@ -353,7 +352,6 @@ impl ReadOnlyBitSet {
    }

    /// Iterate the tinyset on the fly from serialized data.
-    ///
    #[inline]
    fn iter_tinysets(&self) -> impl Iterator<Item = TinySet> + '_ {
        self.data.chunks_exact(8).map(move |chunk| {
@@ -363,7 +361,6 @@ impl ReadOnlyBitSet {
    }

    /// Iterate over the positions of the elements.
-    ///
    #[inline]
    pub fn iter(&self) -> impl Iterator<Item = u32> + '_ {
        self.iter_tinysets()
@@ -415,14 +412,14 @@ impl<'a> From<&'a BitSet> for ReadOnlyBitSet {
 #[cfg(test)]
 mod tests {

-    use super::BitSet;
-    use super::ReadOnlyBitSet;
-    use super::TinySet;
+    use std::collections::HashSet;
+
    use ownedbytes::OwnedBytes;
    use rand::distributions::Bernoulli;
    use rand::rngs::StdRng;
    use rand::{Rng, SeedableRng};
-    use std::collections::HashSet;
+
+    use super::{BitSet, ReadOnlyBitSet, TinySet};

    #[test]
    fn test_read_serialized_bitset_full_multi() {
@@ -443,7 +440,7 @@ mod tests {
        bitset.serialize(&mut out).unwrap();

        let bitset = ReadOnlyBitSet::open(OwnedBytes::new(out));
-        assert_eq!(bitset.len() as usize, 64 as usize);
+        assert_eq!(bitset.len() as usize, 64);
    }

    #[test]
@@ -710,10 +707,10 @@ mod tests {
 #[cfg(all(test, feature = "unstable"))]
 mod bench {

-    use super::BitSet;
-    use super::TinySet;
    use test;

+    use super::{BitSet, TinySet};
+
    #[bench]
    fn bench_tinyset_pop(b: &mut test::Bencher) {
        b.iter(|| {
--- a/common/src/lib.rs
+++ b/common/src/lib.rs
@@ -104,11 +104,12 @@ pub fn u64_to_f64(val: u64) -> f64 {
 #[cfg(test)]
 pub mod test {

-    use super::{f64_to_u64, i64_to_u64, u64_to_f64, u64_to_i64};
-    use super::{BinarySerializable, FixedSize};
-    use proptest::prelude::*;
    use std::f64;

+    use proptest::prelude::*;
+
+    use super::{f64_to_u64, i64_to_u64, u64_to_f64, u64_to_i64, BinarySerializable, FixedSize};
+
    fn test_i64_converter_helper(val: i64) {
        assert_eq!(u64_to_i64(i64_to_u64(val)), val);
    }
@@ -157,10 +158,10 @@ pub mod test {
    #[test]
    fn test_f64_order() {
        assert!(!(f64_to_u64(f64::NEG_INFINITY)..f64_to_u64(f64::INFINITY))
-            .contains(&f64_to_u64(f64::NAN))); //nan is not a number
-        assert!(f64_to_u64(1.5) > f64_to_u64(1.0)); //same exponent, different mantissa
-        assert!(f64_to_u64(2.0) > f64_to_u64(1.0)); //same mantissa, different exponent
-        assert!(f64_to_u64(2.0) > f64_to_u64(1.5)); //different exponent and mantissa
+            .contains(&f64_to_u64(f64::NAN))); // nan is not a number
+        assert!(f64_to_u64(1.5) > f64_to_u64(1.0)); // same exponent, different mantissa
+        assert!(f64_to_u64(2.0) > f64_to_u64(1.0)); // same mantissa, different exponent
+        assert!(f64_to_u64(2.0) > f64_to_u64(1.5)); // different exponent and mantissa
        assert!(f64_to_u64(1.0) > f64_to_u64(-1.0)); // pos > neg
        assert!(f64_to_u64(-1.5) < f64_to_u64(-1.0));
        assert!(f64_to_u64(-2.0) < f64_to_u64(1.0));
--- a/common/src/serialize.rs
+++ b/common/src/serialize.rs
@@ -1,10 +1,9 @@
-use crate::Endianness;
-use crate::VInt;
+use std::io::{Read, Write};
+use std::{fmt, io};
+
 use byteorder::{ReadBytesExt, WriteBytesExt};
-use std::fmt;
-use std::io;
-use std::io::Read;
-use std::io::Write;
+
+use crate::{Endianness, VInt};

 /// Trait for a simple binary serialization.
 pub trait BinarySerializable: fmt::Debug + Sized {
@@ -202,8 +201,7 @@ impl BinarySerializable for String {
 #[cfg(test)]
 pub mod test {

-    use super::VInt;
-    use super::*;
+    use super::{VInt, *};
    use crate::serialize::BinarySerializable;
    pub fn fixed_size_test<O: BinarySerializable + FixedSize + Default>() {
        let mut buffer = Vec::new();
--- a/common/src/vint.rs
+++ b/common/src/vint.rs
@@ -1,8 +1,9 @@
-use super::BinarySerializable;
-use byteorder::{ByteOrder, LittleEndian};
 use std::io;
-use std::io::Read;
-use std::io::Write;
+use std::io::{Read, Write};
+
+use byteorder::{ByteOrder, LittleEndian};
+
+use super::BinarySerializable;

 ///   Wrapper over a `u64` that serializes as a variable int.
 #[derive(Clone, Copy, Debug, Eq, PartialEq)]
@@ -174,9 +175,7 @@ impl BinarySerializable for VInt {
 #[cfg(test)]
 mod tests {

-    use super::serialize_vint_u32;
-    use super::BinarySerializable;
-    use super::VInt;
+    use super::{serialize_vint_u32, BinarySerializable, VInt};

    fn aux_test_vint(val: u64) {
        let mut v = [14u8; 10];
--- a/common/src/writer.rs
+++ b/common/src/writer.rs
@@ -54,7 +54,8 @@ impl<W: TerminatingWrite> TerminatingWrite for CountingWriter<W> {
    }
 }

-/// Struct used to prevent from calling [`terminate_ref`](trait.TerminatingWrite.html#tymethod.terminate_ref) directly
+/// Struct used to prevent from calling
+/// [`terminate_ref`](trait.TerminatingWrite.html#tymethod.terminate_ref) directly
 ///
 /// The point is that while the type is public, it cannot be built by anyone
 /// outside of this module.
@@ -64,9 +65,7 @@ pub struct AntiCallToken(());
 pub trait TerminatingWrite: Write {
    /// Indicate that the writer will no longer be used. Internally call terminate_ref.
    fn terminate(mut self) -> io::Result<()>
-    where
-        Self: Sized,
-    {
+    where Self: Sized {
        self.terminate_ref(AntiCallToken(()))
    }

@@ -97,9 +96,10 @@ impl<'a> TerminatingWrite for &'a mut Vec<u8> {
 #[cfg(test)]
 mod test {

-    use super::CountingWriter;
    use std::io::Write;

+    use super::CountingWriter;
+
    #[test]
    fn test_counting_writer() {
        let buffer: Vec<u8> = vec![];
--- a/doc/src/index_sorting.md
+++ b/doc/src/index_sorting.md
@@ -38,7 +38,7 @@ Note: Tantivy 0.16 does not do this optimization yet.
 In principle there are many algorithms possible that exploit the monotonically increasing nature. (aggregations maybe?)

 ## Usage
-The index sorting can be configured setting [`sort_by_field`](https://github.com/quickwit-inc/tantivy/blob/000d76b11a139a84b16b9b95060a1c93e8b9851c/src/core/index_meta.rs#L238) on `IndexSettings` and passing it to a `IndexBuilder`. As of tantvy 0.16 only fast fields are allowed to be used.
+The index sorting can be configured setting [`sort_by_field`](https://github.com/quickwit-oss/tantivy/blob/000d76b11a139a84b16b9b95060a1c93e8b9851c/src/core/index_meta.rs#L238) on `IndexSettings` and passing it to a `IndexBuilder`. As of tantvy 0.16 only fast fields are allowed to be used.

 ```
 let settings = IndexSettings {
@@ -55,7 +55,7 @@ let index = index_builder.create_in_ram().unwrap();

 ## Implementation details

-Sorting an index is applied in the serialization step. In general there are two serialization steps: [Finishing a single segment](https://github.com/quickwit-inc/tantivy/blob/000d76b11a139a84b16b9b95060a1c93e8b9851c/src/indexer/segment_writer.rs#L338) and [merging multiple segments](https://github.com/quickwit-inc/tantivy/blob/000d76b11a139a84b16b9b95060a1c93e8b9851c/src/indexer/merger.rs#L1073).
+Sorting an index is applied in the serialization step. In general there are two serialization steps: [Finishing a single segment](https://github.com/quickwit-oss/tantivy/blob/000d76b11a139a84b16b9b95060a1c93e8b9851c/src/indexer/segment_writer.rs#L338) and [merging multiple segments](https://github.com/quickwit-oss/tantivy/blob/000d76b11a139a84b16b9b95060a1c93e8b9851c/src/indexer/merger.rs#L1073).

 In both cases we generate a docid mapping reflecting the sort. This mapping is used when serializing the different components (doc store, fastfields, posting list, normfield, facets).

--- a/examples/basic_search.rs
+++ b/examples/basic_search.rs
@@ -73,7 +73,7 @@ fn main() -> tantivy::Result<()> {
    // multithreaded.
    //
    // Here we give tantivy a budget of `50MB`.
-    // Using a bigger heap for the indexer may increase
+    // Using a bigger memory_arena for the indexer may increase
    // throughput, but 50 MB is already plenty.
    let mut index_writer = index.writer(50_000_000)?;

@@ -91,8 +91,8 @@ fn main() -> tantivy::Result<()> {
    old_man_doc.add_text(title, "The Old Man and the Sea");
    old_man_doc.add_text(
        body,
-        "He was an old man who fished alone in a skiff in the Gulf Stream and \
-         he had gone eighty-four days now without taking a fish.",
+        "He was an old man who fished alone in a skiff in the Gulf Stream and he had gone \
+         eighty-four days now without taking a fish.",
    );

    // ... and add it to the `IndexWriter`.
--- a/examples/custom_collector.rs
+++ b/examples/custom_collector.rs
@@ -12,8 +12,7 @@
 use tantivy::collector::{Collector, SegmentCollector};
 use tantivy::fastfield::{DynamicFastFieldReader, FastFieldReader};
 use tantivy::query::QueryParser;
-use tantivy::schema::Field;
-use tantivy::schema::{Schema, FAST, INDEXED, TEXT};
+use tantivy::schema::{Field, Schema, FAST, INDEXED, TEXT};
 use tantivy::{doc, Index, Score, SegmentReader};

 #[derive(Default)]
--- a/examples/custom_tokenizer.rs
+++ b/examples/custom_tokenizer.rs
@@ -62,7 +62,7 @@ fn main() -> tantivy::Result<()> {
    // multithreaded.
    //
    // Here we use a buffer of 50MB per thread. Using a bigger
-    // heap for the indexer can increase its throughput.
+    // memory arena for the indexer can increase its throughput.
    let mut index_writer = index.writer(50_000_000)?;
    index_writer.add_document(doc!(
    title => "The Old Man and the Sea",
--- a/examples/deleting_updating_documents.rs
+++ b/examples/deleting_updating_documents.rs
@@ -56,8 +56,9 @@ fn main() -> tantivy::Result<()> {
    // If it is `text`, let's make sure to keep it `raw` and let's avoid
    // running any text processing on it.
    // This is done by associating this field to the tokenizer named `raw`.
-    // Rather than building our [`TextOptions`](//docs.rs/tantivy/~0/tantivy/schema/struct.TextOptions.html) manually,
-    // We use the `STRING` shortcut. `STRING` stands for indexed (without term frequency or positions)
+    // Rather than building our
+    // [`TextOptions`](//docs.rs/tantivy/~0/tantivy/schema/struct.TextOptions.html) manually, We
+    // use the `STRING` shortcut. `STRING` stands for indexed (without term frequency or positions)
    // and untokenized.
    //
    // Because we also want to be able to see this `id` in our returned documents,
--- a/examples/faceted_search_with_tweaked_score.rs
+++ b/examples/faceted_search_with_tweaked_score.rs
@@ -1,9 +1,9 @@
 use std::collections::HashSet;
+
 use tantivy::collector::TopDocs;
-use tantivy::doc;
 use tantivy::query::BooleanQuery;
 use tantivy::schema::*;
-use tantivy::{DocId, Index, Score, SegmentReader};
+use tantivy::{doc, DocId, Index, Score, SegmentReader};

 fn main() -> tantivy::Result<()> {
    let mut schema_builder = Schema::builder();
@@ -87,7 +87,7 @@ fn main() -> tantivy::Result<()> {
                    .unwrap()
                    .get_first(title)
                    .unwrap()
-                    .text()
+                    .as_text()
                    .unwrap()
                    .to_owned()
            })
--- a/examples/iterating_docs_and_positions.rs
+++ b/examples/iterating_docs_and_positions.rs
@@ -52,11 +52,11 @@ fn main() -> tantivy::Result<()> {
        let term_the = Term::from_field_text(title, "the");

        // This segment posting object is like a cursor over the documents matching the term.
-        // The `IndexRecordOption` arguments tells tantivy we will be interested in both term frequencies
-        // and positions.
+        // The `IndexRecordOption` arguments tells tantivy we will be interested in both term
+        // frequencies and positions.
        //
-        // If you don't need all this information, you may get better performance by decompressing less
-        // information.
+        // If you don't need all this information, you may get better performance by decompressing
+        // less information.
        if let Some(mut segment_postings) =
            inverted_index.read_postings(&term_the, IndexRecordOption::WithFreqsAndPositions)?
        {
@@ -109,11 +109,11 @@ fn main() -> tantivy::Result<()> {
        let inverted_index = segment_reader.inverted_index(title)?;

        // This segment posting object is like a cursor over the documents matching the term.
-        // The `IndexRecordOption` arguments tells tantivy we will be interested in both term frequencies
-        // and positions.
+        // The `IndexRecordOption` arguments tells tantivy we will be interested in both term
+        // frequencies and positions.
        //
-        // If you don't need all this information, you may get better performance by decompressing less
-        // information.
+        // If you don't need all this information, you may get better performance by decompressing
+        // less information.
        if let Some(mut block_segment_postings) =
            inverted_index.read_block_postings(&term_the, IndexRecordOption::Basic)?
        {
--- a/examples/multiple_producer.rs
+++ b/examples/multiple_producer.rs
@@ -28,6 +28,7 @@
 use std::sync::{Arc, RwLock};
 use std::thread;
 use std::time::Duration;
+
 use tantivy::schema::{Schema, STORED, TEXT};
 use tantivy::{doc, Index, IndexWriter, Opstamp, TantivyError};

@@ -90,7 +91,8 @@ fn main() -> tantivy::Result<()> {
    // # In the main thread, we commit 10 times, once every 500ms.
    for _ in 0..10 {
        let opstamp: Opstamp = {
-            // Committing or rollbacking on the other hand requires write lock. This will block other threads.
+            // Committing or rollbacking on the other hand requires write lock. This will block
+            // other threads.
            let mut index_writer_wlock = index_writer.write().unwrap();
            index_writer_wlock.commit()?
        };
--- a/examples/snippet.rs
+++ b/examples/snippet.rs
@@ -57,7 +57,10 @@ fn main() -> tantivy::Result<()> {
        let doc = searcher.doc(doc_address)?;
        let snippet = snippet_generator.snippet_from_doc(&doc);
        println!("Document score {}:", score);
-        println!("title: {}", doc.get_first(title).unwrap().text().unwrap());
+        println!(
+            "title: {}",
+            doc.get_first(title).unwrap().as_text().unwrap()
+        );
        println!("snippet: {}", snippet.to_html());
        println!("custom highlighting: {}", highlight(snippet));
    }
--- a/examples/warmer.rs
+++ b/examples/warmer.rs
@@ -6,8 +6,10 @@ use tantivy::collector::TopDocs;
 use tantivy::fastfield::FastFieldReader;
 use tantivy::query::QueryParser;
 use tantivy::schema::{Field, Schema, FAST, TEXT};
-use tantivy::{doc, DocAddress, DocId, Index, IndexReader, SegmentReader, TrackedObject};
-use tantivy::{Opstamp, Searcher, SearcherGeneration, SegmentId, Warmer};
+use tantivy::{
+    doc, DocAddress, DocId, Index, IndexReader, Opstamp, Searcher, SearcherGeneration, SegmentId,
+    SegmentReader, Warmer,
+};

 // This example shows how warmers can be used to
 // load a values from an external sources using the Warmer API.
@@ -69,7 +71,7 @@ impl Warmer for DynamicPriceColumn {
        Ok(())
    }

-    fn garbage_collect(&self, live_generations: &[TrackedObject<SearcherGeneration>]) {
+    fn garbage_collect(&self, live_generations: &[&SearcherGeneration]) {
        let live_segment_id_and_delete_ops: HashSet<(SegmentId, Option<Opstamp>)> =
            live_generations
                .iter()
@@ -90,7 +92,6 @@ impl Warmer for DynamicPriceColumn {
 /// This map represents a map (ProductId -> Price)
 ///
 /// In practise, it could be fetching things from an external service, like a SQL table.
-///
 #[derive(Default, Clone)]
 pub struct ExternalPriceTable {
    prices: Arc<RwLock<HashMap<ProductId, Price>>>,
--- a/fastfield_codecs/benches/bench.rs
+++ b/fastfield_codecs/benches/bench.rs
@@ -4,14 +4,14 @@ extern crate test;

 #[cfg(test)]
 mod tests {
-    use fastfield_codecs::{
-        bitpacked::{BitpackedFastFieldReader, BitpackedFastFieldSerializer},
-        linearinterpol::{LinearInterpolFastFieldReader, LinearInterpolFastFieldSerializer},
-        multilinearinterpol::{
-            MultiLinearInterpolFastFieldReader, MultiLinearInterpolFastFieldSerializer,
-        },
-        *,
+    use fastfield_codecs::bitpacked::{BitpackedFastFieldReader, BitpackedFastFieldSerializer};
+    use fastfield_codecs::linearinterpol::{
+        LinearInterpolFastFieldReader, LinearInterpolFastFieldSerializer,
    };
+    use fastfield_codecs::multilinearinterpol::{
+        MultiLinearInterpolFastFieldReader, MultiLinearInterpolFastFieldSerializer,
+    };
+    use fastfield_codecs::*;

    fn get_data() -> Vec<u64> {
        let mut data: Vec<_> = (100..55000_u64)
--- a/fastfield_codecs/src/bitpacked.rs
+++ b/fastfield_codecs/src/bitpacked.rs
@@ -1,13 +1,9 @@
-use crate::FastFieldCodecReader;
-use crate::FastFieldCodecSerializer;
-use crate::FastFieldDataAccess;
-use crate::FastFieldStats;
-use common::BinarySerializable;
 use std::io::{self, Write};
-use tantivy_bitpacker::compute_num_bits;
-use tantivy_bitpacker::BitPacker;

-use tantivy_bitpacker::BitUnpacker;
+use common::BinarySerializable;
+use tantivy_bitpacker::{compute_num_bits, BitPacker, BitUnpacker};
+
+use crate::{FastFieldCodecReader, FastFieldCodecSerializer, FastFieldDataAccess, FastFieldStats};

 /// Depending on the field type, a different
 /// fast field is required.
--- a/fastfield_codecs/src/lib.rs
+++ b/fastfield_codecs/src/lib.rs
@@ -53,7 +53,8 @@ pub trait FastFieldCodecSerializer {
 pub trait FastFieldDataAccess {
    /// Return the value associated to the given position.
    ///
-    /// Whenever possible use the Iterator passed to the fastfield creation instead, for performance reasons.
+    /// Whenever possible use the Iterator passed to the fastfield creation instead, for performance
+    /// reasons.
    ///
    /// # Panics
    ///
@@ -82,12 +83,10 @@ impl FastFieldDataAccess for Vec<u64> {

 #[cfg(test)]
 mod tests {
-    use crate::{
-        bitpacked::{BitpackedFastFieldReader, BitpackedFastFieldSerializer},
-        linearinterpol::{LinearInterpolFastFieldReader, LinearInterpolFastFieldSerializer},
-        multilinearinterpol::{
-            MultiLinearInterpolFastFieldReader, MultiLinearInterpolFastFieldSerializer,
-        },
+    use crate::bitpacked::{BitpackedFastFieldReader, BitpackedFastFieldSerializer};
+    use crate::linearinterpol::{LinearInterpolFastFieldReader, LinearInterpolFastFieldSerializer};
+    use crate::multilinearinterpol::{
+        MultiLinearInterpolFastFieldReader, MultiLinearInterpolFastFieldSerializer,
    };

    pub fn create_and_validate<S: FastFieldCodecSerializer, R: FastFieldCodecReader>(
--- a/fastfield_codecs/src/linearinterpol.rs
+++ b/fastfield_codecs/src/linearinterpol.rs
@@ -1,15 +1,10 @@
-use crate::FastFieldCodecReader;
-use crate::FastFieldCodecSerializer;
-use crate::FastFieldDataAccess;
-use crate::FastFieldStats;
 use std::io::{self, Read, Write};
 use std::ops::Sub;
-use tantivy_bitpacker::compute_num_bits;
-use tantivy_bitpacker::BitPacker;

-use common::BinarySerializable;
-use common::FixedSize;
-use tantivy_bitpacker::BitUnpacker;
+use common::{BinarySerializable, FixedSize};
+use tantivy_bitpacker::{compute_num_bits, BitPacker, BitUnpacker};
+
+use crate::{FastFieldCodecReader, FastFieldCodecSerializer, FastFieldDataAccess, FastFieldStats};

 /// Depending on the field type, a different
 /// fast field is required.
@@ -137,7 +132,7 @@ impl FastFieldCodecSerializer for LinearInterpolFastFieldSerializer {
                // will be offset to 0
                offset = offset.max(calculated_value - actual_value);
            } else {
-                //positive value no offset reuqired
+                // positive value no offset reuqired
                rel_positive_max = rel_positive_max.max(actual_value - calculated_value);
            }
        }
@@ -171,7 +166,7 @@ impl FastFieldCodecSerializer for LinearInterpolFastFieldSerializer {
        stats: FastFieldStats,
    ) -> bool {
        if stats.num_vals < 3 {
-            return false; //disable compressor for this case
+            return false; // disable compressor for this case
        }
        // On serialisation the offset is added to the actual value.
        // We need to make sure this won't run into overflow calculation issues.
@@ -211,8 +206,8 @@ impl FastFieldCodecSerializer for LinearInterpolFastFieldSerializer {
            .max()
            .unwrap_or(0);

-        // the theory would be that we don't have the actual max_distance, but we are close within 50%
-        // threshold.
+        // the theory would be that we don't have the actual max_distance, but we are close within
+        // 50% threshold.
        // It is multiplied by 2 because in a log case scenario the line would be as much above as
        // below. So the offset would = max_distance
        //
--- a/fastfield_codecs/src/main.rs
+++ b/fastfield_codecs/src/main.rs
@@ -1,10 +1,8 @@
 #[macro_use]
 extern crate prettytable;
-use fastfield_codecs::{
-    linearinterpol::LinearInterpolFastFieldSerializer,
-    multilinearinterpol::MultiLinearInterpolFastFieldSerializer, FastFieldCodecSerializer,
-    FastFieldStats,
-};
+use fastfield_codecs::linearinterpol::LinearInterpolFastFieldSerializer;
+use fastfield_codecs::multilinearinterpol::MultiLinearInterpolFastFieldSerializer;
+use fastfield_codecs::{FastFieldCodecSerializer, FastFieldStats};
 use prettytable::{Cell, Row, Table};

 fn main() {
@@ -24,7 +22,7 @@ fn main() {
        );
        results.push(res);

-        //let best_estimation_codec = results
+        // let best_estimation_codec = results
        //.iter()
        //.min_by(|res1, res2| res1.partial_cmp(&res2).unwrap())
        //.unwrap();
@@ -41,7 +39,6 @@ fn main() {
            } else {
                (est.to_string(), comp.to_string())
            };
-            #[allow(clippy::all)]
            let style = if comp == best_compression_ratio_codec.1 {
                "Fb"
            } else {
@@ -49,7 +46,7 @@ fn main() {
            };

            table.add_row(Row::new(vec![
-                Cell::new(&name.to_string()).style_spec("bFg"),
+                Cell::new(name).style_spec("bFg"),
                Cell::new(&ratio_cell).style_spec(style),
                Cell::new(&est_cell).style_spec(""),
            ]));
@@ -73,7 +70,7 @@ pub fn get_codec_test_data_sets() -> Vec<(Vec<u64>, &'static str)> {
            current_cumulative
        })
        .collect::<Vec<_>>();
-    //let data = (1..=200000_u64).map(|num| num + num).collect::<Vec<_>>();
+    // let data = (1..=200000_u64).map(|num| num + num).collect::<Vec<_>>();
    data_and_names.push((data, "Monotonically increasing concave"));

    let mut current_cumulative = 0;
--- a/fastfield_codecs/src/multilinearinterpol.rs
+++ b/fastfield_codecs/src/multilinearinterpol.rs
@@ -1,30 +1,22 @@
-/*!
+//! MultiLinearInterpol compressor uses linear interpolation to guess a values and stores the
+//! offset, but in blocks of 512.
+//!
+//! With a CHUNK_SIZE of 512 and 29 byte metadata per block, we get a overhead for metadata of 232 /
+//! 512 = 0,45 bits per element. The additional space required per element in a block is the the
+//! maximum deviation of the linear interpolation estimation function.
+//!
+//! E.g. if the maximum deviation of an element is 12, all elements cost 4bits.
+//!
+//! Size per block:
+//! Num Elements * Maximum Deviation from Interpolation + 29 Byte Metadata

-MultiLinearInterpol compressor uses linear interpolation to guess a values and stores the offset, but in blocks of 512.
-
-With a CHUNK_SIZE of 512 and 29 byte metadata per block, we get a overhead for metadata of 232 / 512 = 0,45 bits per element.
-The additional space required per element in a block is the the maximum deviation of the linear interpolation estimation function.
-
-E.g. if the maximum deviation of an element is 12, all elements cost 4bits.
-
-Size per block:
-Num Elements * Maximum Deviation from Interpolation + 29 Byte Metadata
-
-*/
-
-use crate::FastFieldCodecReader;
-use crate::FastFieldCodecSerializer;
-use crate::FastFieldDataAccess;
-use crate::FastFieldStats;
-use common::CountingWriter;
 use std::io::{self, Read, Write};
 use std::ops::Sub;
-use tantivy_bitpacker::compute_num_bits;
-use tantivy_bitpacker::BitPacker;

-use common::BinarySerializable;
-use common::DeserializeFrom;
-use tantivy_bitpacker::BitUnpacker;
+use common::{BinarySerializable, CountingWriter, DeserializeFrom};
+use tantivy_bitpacker::{compute_num_bits, BitPacker, BitUnpacker};
+
+use crate::{FastFieldCodecReader, FastFieldCodecSerializer, FastFieldDataAccess, FastFieldStats};

 const CHUNK_SIZE: u64 = 512;

@@ -252,11 +244,11 @@ impl FastFieldCodecSerializer for MultiLinearInterpolFastFieldSerializer {
                );
                if calculated_value > actual_value {
                    // negative value we need to apply an offset
-                    // we ignore negative values in the max value calculation, because negative values
-                    // will be offset to 0
+                    // we ignore negative values in the max value calculation, because negative
+                    // values will be offset to 0
                    offset = offset.max(calculated_value - actual_value);
                } else {
-                    //positive value no offset reuqired
+                    // positive value no offset reuqired
                    rel_positive_max = rel_positive_max.max(actual_value - calculated_value);
                }
            }
@@ -350,8 +342,8 @@ impl FastFieldCodecSerializer for MultiLinearInterpolFastFieldSerializer {
            .unwrap();

        // Estimate one block and extrapolate the cost to all blocks.
-        // the theory would be that we don't have the actual max_distance, but we are close within 50%
-        // threshold.
+        // the theory would be that we don't have the actual max_distance, but we are close within
+        // 50% threshold.
        // It is multiplied by 2 because in a log case scenario the line would be as much above as
        // below. So the offset would = max_distance
        //
--- a/ownedbytes/src/lib.rs
+++ b/ownedbytes/src/lib.rs
@@ -1,11 +1,9 @@
-#![allow(clippy::return_self_not_must_use)]
-
-use stable_deref_trait::StableDeref;
 use std::convert::TryInto;
-use std::mem;
 use std::ops::{Deref, Range};
 use std::sync::Arc;
-use std::{fmt, io};
+use std::{fmt, io, mem};
+
+use stable_deref_trait::StableDeref;

 /// An OwnedBytes simply wraps an object that owns a slice of data and exposes
 /// this data as a static slice.
@@ -102,7 +100,6 @@ impl OwnedBytes {
    }

    /// Drops the left most `advance_len` bytes.
-    ///
    #[inline]
    pub fn advance(&mut self, advance_len: usize) {
        self.data = &self.data[advance_len..]
@@ -163,8 +160,7 @@ impl PartialEq<str> for OwnedBytes {
 }

 impl<'a, T: ?Sized> PartialEq<&'a T> for OwnedBytes
-where
-    OwnedBytes: PartialEq<T>,
+where OwnedBytes: PartialEq<T>
 {
    fn eq(&self, other: &&'a T) -> bool {
        *self == **other
--- a/query-grammar/Cargo.toml
+++ b/query-grammar/Cargo.toml
@@ -5,9 +5,8 @@ authors = ["Paul Masurel <paul.masurel@gmail.com>"]
 license = "MIT"
 categories = ["database-implementations", "data-structures"]
 description = """Search engine library"""
-documentation = "https://quickwit-inc.github.io/tantivy/tantivy/index.html"
-homepage = "https://github.com/quickwit-inc/tantivy"
-repository = "https://github.com/quickwit-inc/tantivy"
+homepage = "https://github.com/quickwit-oss/tantivy"
+repository = "https://github.com/quickwit-oss/tantivy"
 readme = "README.md"
 keywords = ["search", "information", "retrieval"]
 edition = "2018"
--- a/query-grammar/src/query_grammar.rs
+++ b/query-grammar/src/query_grammar.rs
@@ -1,17 +1,20 @@
-use super::user_input_ast::{UserInputAst, UserInputBound, UserInputLeaf, UserInputLiteral};
-use crate::Occur;
+use combine::error::StringStreamError;
 use combine::parser::char::{char, digit, space, spaces, string};
+use combine::parser::combinator::recognize;
 use combine::parser::range::{take_while, take_while1};
 use combine::parser::repeat::escaped;
 use combine::parser::Parser;
 use combine::{
    attempt, choice, eof, many, many1, one_of, optional, parser, satisfy, skip_many1, value,
 };
-use combine::{error::StringStreamError, parser::combinator::recognize};
 use once_cell::sync::Lazy;
 use regex::Regex;

-// Note: '-' char is only forbidden at the beginning of a field name, would be clearer to add it to special characters.
+use super::user_input_ast::{UserInputAst, UserInputBound, UserInputLeaf, UserInputLiteral};
+use crate::Occur;
+
+// Note: '-' char is only forbidden at the beginning of a field name, would be clearer to add it to
+// special characters.
 const SPECIAL_CHARS: &[char] = &[
    '+', '^', '`', ':', '{', '}', '"', '[', ']', '(', ')', '~', '!', '\\', '*', ' ',
 ];
@@ -363,9 +366,10 @@ mod test {

    type TestParseResult = Result<(), StringStreamError>;

-    use super::*;
    use combine::parser::Parser;

+    use super::*;
+
    pub fn nearly_equals(a: f64, b: f64) -> bool {
        (a - b).abs() < 0.0005 * (a + b).abs()
    }
--- a/rustfmt.toml
+++ b/rustfmt.toml
@@ -1 +1,7 @@
-use_try_shorthand = true
+comment_width = 120
+format_strings = true
+group_imports = "StdExternalCrate"
+imports_granularity = "Module"
+normalize_comments = true
+where_single_line = true
+wrap_comments = true
--- a/src/collector/count_collector.rs
+++ b/src/collector/count_collector.rs
@@ -1,9 +1,6 @@
 use super::Collector;
 use crate::collector::SegmentCollector;
-use crate::DocId;
-use crate::Score;
-use crate::SegmentOrdinal;
-use crate::SegmentReader;
+use crate::{DocId, Score, SegmentOrdinal, SegmentReader};

 /// `CountCollector` collector only counts how many
 /// documents match the query.
@@ -80,8 +77,7 @@ impl SegmentCollector for SegmentCountCollector {
 #[cfg(test)]
 mod tests {
    use super::{Count, SegmentCountCollector};
-    use crate::collector::Collector;
-    use crate::collector::SegmentCollector;
+    use crate::collector::{Collector, SegmentCollector};

    #[test]
    fn test_count_collect_does_not_requires_scoring() {
--- a/src/collector/custom_score_top_collector.rs
+++ b/src/collector/custom_score_top_collector.rs
@@ -8,8 +8,7 @@ pub(crate) struct CustomScoreTopCollector<TCustomScorer, TScore = Score> {
 }

 impl<TCustomScorer, TScore> CustomScoreTopCollector<TCustomScorer, TScore>
-where
-    TScore: Clone + PartialOrd,
+where TScore: Clone + PartialOrd
 {
    pub(crate) fn new(
        custom_scorer: TCustomScorer,
@@ -114,8 +113,7 @@ where
 }

 impl<F, TScore> CustomSegmentScorer<TScore> for F
-where
-    F: 'static + FnMut(DocId) -> TScore,
+where F: 'static + FnMut(DocId) -> TScore
 {
    fn score(&mut self, doc: DocId) -> TScore {
        (self)(doc)
--- a/src/collector/docset_collector.rs
+++ b/src/collector/docset_collector.rs
@@ -1,8 +1,7 @@
 use std::collections::HashSet;

-use crate::{DocAddress, DocId, Score};
-
 use super::{Collector, SegmentCollector};
+use crate::{DocAddress, DocId, Score};

 /// Collectors that returns the set of DocAddress that matches the query.
 ///
--- a/src/collector/facet_collector.rs
+++ b/src/collector/facet_collector.rs
@@ -1,21 +1,14 @@
-use crate::collector::Collector;
-use crate::collector::SegmentCollector;
-use crate::fastfield::FacetReader;
-use crate::schema::Facet;
-use crate::schema::Field;
-use crate::DocId;
-use crate::Score;
-use crate::SegmentOrdinal;
-use crate::SegmentReader;
 use std::cmp::Ordering;
-use std::collections::btree_map;
-use std::collections::BTreeMap;
-use std::collections::BTreeSet;
-use std::collections::BinaryHeap;
+use std::collections::{btree_map, BTreeMap, BTreeSet, BinaryHeap};
 use std::iter::Peekable;
 use std::ops::Bound;
 use std::{u64, usize};

+use crate::collector::{Collector, SegmentCollector};
+use crate::fastfield::FacetReader;
+use crate::schema::{Facet, Field};
+use crate::{DocId, Score, SegmentOrdinal, SegmentReader};
+
 struct Hit<'a> {
    count: u64,
    facet: &'a Facet,
@@ -240,9 +233,7 @@ impl FacetCollector {
    /// If you need the correct number of unique documents for two such facets,
    /// just add them in separate `FacetCollector`.
    pub fn add_facet<T>(&mut self, facet_from: T)
-    where
-        Facet: From<T>,
-    {
+    where Facet: From<T> {
        let facet = Facet::from(facet_from);
        for old_facet in &self.facets {
            assert!(
@@ -402,9 +393,7 @@ impl FacetCounts {
    /// Returns an iterator over all of the facet count pairs inside this result.
    /// See the documentation for [FacetCollector] for a usage example.
    pub fn get<T>(&self, facet_from: T) -> FacetChildIterator<'_>
-    where
-        Facet: From<T>,
-    {
+    where Facet: From<T> {
        let facet = Facet::from(facet_from);
        let left_bound = Bound::Excluded(facet.clone());
        let right_bound = if facet.is_root() {
@@ -423,9 +412,7 @@ impl FacetCounts {
    /// Returns a vector of top `k` facets with their counts, sorted highest-to-lowest by counts.
    /// See the documentation for [FacetCollector] for a usage example.
    pub fn top_k<T>(&self, facet: T, k: usize) -> Vec<(&Facet, u64)>
-    where
-        Facet: From<T>,
-    {
+    where Facet: From<T> {
        let mut heap = BinaryHeap::with_capacity(k);
        let mut it = self.get(facet);

@@ -458,16 +445,18 @@ impl FacetCounts {

 #[cfg(test)]
 mod tests {
+    use std::iter;
+
+    use rand::distributions::Uniform;
+    use rand::prelude::SliceRandom;
+    use rand::{thread_rng, Rng};
+
    use super::{FacetCollector, FacetCounts};
    use crate::collector::Count;
    use crate::core::Index;
    use crate::query::{AllQuery, QueryParser, TermQuery};
    use crate::schema::{Document, Facet, FacetOptions, Field, IndexRecordOption, Schema};
    use crate::Term;
-    use rand::distributions::Uniform;
-    use rand::prelude::SliceRandom;
-    use rand::{thread_rng, Rng};
-    use std::iter;

    #[test]
    fn test_facet_collector_drilldown() -> crate::Result<()> {
@@ -522,8 +511,9 @@ mod tests {
    }

    #[test]
-    #[should_panic(expected = "Tried to add a facet which is a descendant of \
-                               an already added facet.")]
+    #[should_panic(
+        expected = "Tried to add a facet which is a descendant of an already added facet."
+    )]
    fn test_misused_facet_collector() {
        let mut facet_collector = FacetCollector::for_field(Field::from_field_id(0));
        facet_collector.add_facet(Facet::from("/country"));
@@ -700,13 +690,14 @@ mod tests {
 #[cfg(all(test, feature = "unstable"))]
 mod bench {

+    use rand::seq::SliceRandom;
+    use rand::thread_rng;
+    use test::Bencher;
+
    use crate::collector::FacetCollector;
    use crate::query::AllQuery;
    use crate::schema::{Facet, Schema, INDEXED};
    use crate::Index;
-    use rand::seq::SliceRandom;
-    use rand::thread_rng;
-    use test::Bencher;

    #[bench]
    fn bench_facet_collector(b: &mut Bencher) {
--- a/src/collector/filter_collector_wrapper.rs
+++ b/src/collector/filter_collector_wrapper.rs
@@ -17,7 +17,8 @@ use crate::schema::Field;
 use crate::{Score, SegmentReader, TantivyError};

 /// The `FilterCollector` filters docs using a fast field value and a predicate.
-/// Only the documents for which the predicate returned "true" will be passed on to the next collector.
+/// Only the documents for which the predicate returned "true" will be passed on to the next
+/// collector.
 ///
 /// ```rust
 /// use tantivy::collector::{TopDocs, FilterCollector};
@@ -58,8 +59,7 @@ use crate::{Score, SegmentReader, TantivyError};
 /// # }
 /// ```
 pub struct FilterCollector<TCollector, TPredicate, TPredicateValue: FastValue>
-where
-    TPredicate: 'static + Clone,
+where TPredicate: 'static + Clone
 {
    field: Field,
    collector: TCollector,
--- a/src/collector/histogram_collector.rs
+++ b/src/collector/histogram_collector.rs
@@ -1,8 +1,9 @@
+use fastdivide::DividerU64;
+
 use crate::collector::{Collector, SegmentCollector};
 use crate::fastfield::{DynamicFastFieldReader, FastFieldReader, FastValue};
 use crate::schema::{Field, Type};
 use crate::{DocId, Score};
-use fastdivide::DividerU64;

 /// Histogram builds an histogram of the values of a fastfield for the
 /// collected DocSet.
@@ -36,8 +37,8 @@ impl HistogramCollector {
    ///  - `bucket_width`: the length of the interval that is associated to each buckets.
    ///  - `num_buckets`: The overall number of buckets.
    ///
-    /// Together, this parameters define a partition of `[min_value, min_value + num_buckets * bucket_width)`
-    /// into `num_buckets` intervals of width bucket that we call `bucket`.
+    /// Together, this parameters define a partition of `[min_value, min_value + num_buckets *
+    /// bucket_width)` into `num_buckets` intervals of width bucket that we call `bucket`.
    ///
    /// # Disclaimer
    /// This function panics if the field given is of type f64.
@@ -147,12 +148,13 @@ fn add_vecs(mut vals_list: Vec<Vec<u64>>, len: usize) -> Vec<u64> {

 #[cfg(test)]
 mod tests {
+    use fastdivide::DividerU64;
+    use query::AllQuery;
+
    use super::{add_vecs, HistogramCollector, HistogramComputer};
    use crate::chrono::{TimeZone, Utc};
    use crate::schema::{Schema, FAST};
    use crate::{doc, query, Index};
-    use fastdivide::DividerU64;
-    use query::AllQuery;

    #[test]
    fn test_add_histograms_simple() {
--- a/src/collector/mod.rs
+++ b/src/collector/mod.rs
@@ -1,95 +1,90 @@
-/*!
+//! # Collectors
+//!
+//! Collectors define the information you want to extract from the documents matching the queries.
+//! In tantivy jargon, we call this information your search "fruit".
+//!
+//! Your fruit could for instance be :
+//! - [the count of matching documents](./struct.Count.html)
+//! - [the top 10 documents, by relevancy or by a fast field](./struct.TopDocs.html)
+//! - [facet counts](./struct.FacetCollector.html)
+//!
+//! At one point in your code, you will trigger the actual search operation by calling
+//! [the `search(...)` method of your `Searcher` object](../struct.Searcher.html#method.search).
+//! This call will look like this.
+//!
+//! ```verbatim
+//! let fruit = searcher.search(&query, &collector)?;
+//! ```
+//!
+//! Here the type of fruit is actually determined as an associated type of the collector
+//! (`Collector::Fruit`).
+//!
+//!
+//! # Combining several collectors
+//!
+//! A rich search experience often requires to run several collectors on your search query.
+//! For instance,
+//! - selecting the top-K products matching your query
+//! - counting the matching documents
+//! - computing several facets
+//! - computing statistics about the matching product prices
+//!
+//! A simple and efficient way to do that is to pass your collectors as one tuple.
+//! The resulting `Fruit` will then be a typed tuple with each collector's original fruits
+//! in their respective position.
+//!
+//! ```rust
+//! # use tantivy::schema::*;
+//! # use tantivy::*;
+//! # use tantivy::query::*;
+//! use tantivy::collector::{Count, TopDocs};
+//! #
+//! # fn main() -> tantivy::Result<()> {
+//! # let mut schema_builder = Schema::builder();
+//! #     let title = schema_builder.add_text_field("title", TEXT);
+//! #     let schema = schema_builder.build();
+//! #     let index = Index::create_in_ram(schema);
+//! #     let mut index_writer = index.writer(3_000_000)?;
+//! #       index_writer.add_document(doc!(
+//! #       title => "The Name of the Wind",
+//! #      ))?;
+//! #     index_writer.add_document(doc!(
+//! #        title => "The Diary of Muadib",
+//! #     ))?;
+//! #     index_writer.commit()?;
+//! #     let reader = index.reader()?;
+//! #     let searcher = reader.searcher();
+//! #     let query_parser = QueryParser::for_index(&index, vec![title]);
+//! #     let query = query_parser.parse_query("diary")?;
+//! let (doc_count, top_docs): (usize, Vec<(Score, DocAddress)>) =
+//! searcher.search(&query, &(Count, TopDocs::with_limit(2)))?;
+//! #     Ok(())
+//! # }
+//! ```
+//!
+//! The `Collector` trait is implemented for up to 4 collectors.
+//! If you have more than 4 collectors, you can either group them into
+//! tuples of tuples `(a,(b,(c,d)))`, or rely on [`MultiCollector`](./struct.MultiCollector.html).
+//!
+//! # Combining several collectors dynamically
+//!
+//! Combining collectors into a tuple is a zero-cost abstraction: everything
+//! happens as if you had manually implemented a single collector
+//! combining all of our features.
+//!
+//! Unfortunately it requires you to know at compile time your collector types.
+//! If on the other hand, the collectors depend on some query parameter,
+//! you can rely on `MultiCollector`'s.
+//!
+//!
+//! # Implementing your own collectors.
+//!
+//! See the `custom_collector` example.

-# Collectors
-
-Collectors define the information you want to extract from the documents matching the queries.
-In tantivy jargon, we call this information your search "fruit".
-
-Your fruit could for instance be :
- [the count of matching documents](./struct.Count.html)
- [the top 10 documents, by relevancy or by a fast field](./struct.TopDocs.html)
- [facet counts](./struct.FacetCollector.html)
-
-At one point in your code, you will trigger the actual search operation by calling
-[the `search(...)` method of your `Searcher` object](../struct.Searcher.html#method.search).
-This call will look like this.
-
-```verbatim
-let fruit = searcher.search(&query, &collector)?;
-```
-
-Here the type of fruit is actually determined as an associated type of the collector (`Collector::Fruit`).
-
-
-# Combining several collectors
-
-A rich search experience often requires to run several collectors on your search query.
-For instance,
- selecting the top-K products matching your query
- counting the matching documents
- computing several facets
- computing statistics about the matching product prices
-
-A simple and efficient way to do that is to pass your collectors as one tuple.
-The resulting `Fruit` will then be a typed tuple with each collector's original fruits
-in their respective position.
-
-```rust
-# use tantivy::schema::*;
-# use tantivy::*;
-# use tantivy::query::*;
-use tantivy::collector::{Count, TopDocs};
-#
-# fn main() -> tantivy::Result<()> {
-# let mut schema_builder = Schema::builder();
-#     let title = schema_builder.add_text_field("title", TEXT);
-#     let schema = schema_builder.build();
-#     let index = Index::create_in_ram(schema);
-#     let mut index_writer = index.writer(3_000_000)?;
-#       index_writer.add_document(doc!(
-#       title => "The Name of the Wind",
-#      ))?;
-#     index_writer.add_document(doc!(
-#        title => "The Diary of Muadib",
-#     ))?;
-#     index_writer.commit()?;
-#     let reader = index.reader()?;
-#     let searcher = reader.searcher();
-#     let query_parser = QueryParser::for_index(&index, vec![title]);
-#     let query = query_parser.parse_query("diary")?;
-let (doc_count, top_docs): (usize, Vec<(Score, DocAddress)>) =
-    searcher.search(&query, &(Count, TopDocs::with_limit(2)))?;
-#     Ok(())
-# }
-```
-
-The `Collector` trait is implemented for up to 4 collectors.
-If you have more than 4 collectors, you can either group them into
-tuples of tuples `(a,(b,(c,d)))`, or rely on [`MultiCollector`](./struct.MultiCollector.html).
-
-# Combining several collectors dynamically
-
-Combining collectors into a tuple is a zero-cost abstraction: everything
-happens as if you had manually implemented a single collector
-combining all of our features.
-
-Unfortunately it requires you to know at compile time your collector types.
-If on the other hand, the collectors depend on some query parameter,
-you can rely on `MultiCollector`'s.
-
-
-# Implementing your own collectors.
-
-See the `custom_collector` example.
-
-*/
-
-use crate::DocId;
-use crate::Score;
-use crate::SegmentOrdinal;
-use crate::SegmentReader;
 use downcast_rs::impl_downcast;

+use crate::{DocId, Score, SegmentOrdinal, SegmentReader};
+
 mod count_collector;
 pub use self::count_collector::Count;

@@ -111,8 +106,7 @@ mod tweak_score_top_collector;
 pub use self::tweak_score_top_collector::{ScoreSegmentTweaker, ScoreTweaker};

 mod facet_collector;
-pub use self::facet_collector::FacetCollector;
-pub use self::facet_collector::FacetCounts;
+pub use self::facet_collector::{FacetCollector, FacetCounts};
 use crate::query::Weight;

 mod docset_collector;
--- a/src/collector/multi_collector.rs
+++ b/src/collector/multi_collector.rs
@@ -1,14 +1,10 @@
-use super::Collector;
-use super::SegmentCollector;
-use crate::collector::Fruit;
-use crate::DocId;
-use crate::Score;
-use crate::SegmentOrdinal;
-use crate::SegmentReader;
-use crate::TantivyError;
 use std::marker::PhantomData;
 use std::ops::Deref;

+use super::{Collector, SegmentCollector};
+use crate::collector::Fruit;
+use crate::{DocId, Score, SegmentOrdinal, SegmentReader, TantivyError};
+
 pub struct MultiFruit {
    sub_fruits: Vec<Option<Box<dyn Fruit>>>,
 }
@@ -104,7 +100,8 @@ impl<TFruit: Fruit> FruitHandle<TFruit> {
 ///
 /// If the type of the collectors is known, you can just group yours collectors
 /// in a tuple. See the
-/// [Combining several collectors section of the collector documentation](./index.html#combining-several-collectors).
+/// [Combining several collectors section of the collector
+/// documentation](./index.html#combining-several-collectors).
 ///
 /// ```rust
 /// use tantivy::collector::{Count, TopDocs, MultiCollector};
@@ -248,10 +245,8 @@ mod tests {
    use super::*;
    use crate::collector::{Count, TopDocs};
    use crate::query::TermQuery;
-    use crate::schema::IndexRecordOption;
-    use crate::schema::{Schema, TEXT};
-    use crate::Index;
-    use crate::Term;
+    use crate::schema::{IndexRecordOption, Schema, TEXT};
+    use crate::{Index, Term};

    #[test]
    fn test_multi_collector() -> crate::Result<()> {
--- a/src/collector/tests.rs
+++ b/src/collector/tests.rs
@@ -1,21 +1,13 @@
-use super::*;
-use crate::core::SegmentReader;
-use crate::fastfield::BytesFastFieldReader;
-use crate::fastfield::DynamicFastFieldReader;
-use crate::fastfield::FastFieldReader;
-use crate::schema::Field;
-use crate::DocId;
-use crate::Score;
-use crate::SegmentOrdinal;
-use crate::{DocAddress, Document, Searcher};
-
-use crate::collector::{Count, FilterCollector, TopDocs};
-use crate::query::{AllQuery, QueryParser};
-use crate::schema::{Schema, FAST, TEXT};
-use crate::DateTime;
-use crate::{doc, Index};
 use std::str::FromStr;

+use super::*;
+use crate::collector::{Count, FilterCollector, TopDocs};
+use crate::core::SegmentReader;
+use crate::fastfield::{BytesFastFieldReader, DynamicFastFieldReader, FastFieldReader};
+use crate::query::{AllQuery, QueryParser};
+use crate::schema::{Field, Schema, FAST, TEXT};
+use crate::{doc, DateTime, DocAddress, DocId, Document, Index, Score, Searcher, SegmentOrdinal};
+
 pub const TEST_COLLECTOR_WITH_SCORE: TestCollector = TestCollector {
    compute_score: true,
 };
--- a/src/collector/top_collector.rs
+++ b/src/collector/top_collector.rs
@@ -1,11 +1,9 @@
-use crate::DocAddress;
-use crate::DocId;
-use crate::SegmentOrdinal;
-use crate::SegmentReader;
 use std::cmp::Ordering;
 use std::collections::BinaryHeap;
 use std::marker::PhantomData;

+use crate::{DocAddress, DocId, SegmentOrdinal, SegmentReader};
+
 /// Contains a feature (field, score, etc.) of a document along with the document address.
 ///
 /// It has a custom implementation of `PartialOrd` that reverses the order. This is because the
@@ -62,8 +60,7 @@ pub(crate) struct TopCollector<T> {
 }

 impl<T> TopCollector<T>
-where
-    T: PartialOrd + Clone,
+where T: PartialOrd + Clone
 {
    /// Creates a top collector, with a number of documents equal to "limit".
    ///
@@ -253,7 +250,7 @@ mod tests {
        // when harvesting we have to guarantee stable sorting in case of a tie
        // on the score
        let doc_ids_collection = [4, 5, 6];
-        let score = 3.14;
+        let score = 3.3f32;

        let mut top_collector_limit_2 = TopSegmentCollector::new(0, 2);
        for id in &doc_ids_collection {
@@ -322,9 +319,10 @@ mod tests {

 #[cfg(all(test, feature = "unstable"))]
 mod bench {
-    use super::TopSegmentCollector;
    use test::Bencher;

+    use super::TopSegmentCollector;
+
    #[bench]
    fn bench_top_segment_collector_collect_not_at_capacity(b: &mut Bencher) {
        let mut top_collector = TopSegmentCollector::new(0, 400);
--- a/src/collector/top_score_collector.rs
+++ b/src/collector/top_score_collector.rs
@@ -1,21 +1,18 @@
+use std::collections::BinaryHeap;
+use std::fmt;
+use std::marker::PhantomData;
+
 use super::Collector;
-use crate::collector::top_collector::{ComparableDoc, TopCollector};
+use crate::collector::custom_score_top_collector::CustomScoreTopCollector;
+use crate::collector::top_collector::{ComparableDoc, TopCollector, TopSegmentCollector};
 use crate::collector::tweak_score_top_collector::TweakedScoreTopCollector;
 use crate::collector::{
    CustomScorer, CustomSegmentScorer, ScoreSegmentTweaker, ScoreTweaker, SegmentCollector,
 };
-use crate::fastfield::{DynamicFastFieldReader, FastFieldReader};
+use crate::fastfield::{DynamicFastFieldReader, FastFieldReader, FastValue};
 use crate::query::Weight;
 use crate::schema::Field;
-use crate::DocAddress;
-use crate::DocId;
-use crate::Score;
-use crate::SegmentOrdinal;
-use crate::SegmentReader;
-use crate::{collector::custom_score_top_collector::CustomScoreTopCollector, fastfield::FastValue};
-use crate::{collector::top_collector::TopSegmentCollector, TantivyError};
-use std::fmt;
-use std::{collections::BinaryHeap, marker::PhantomData};
+use crate::{DocAddress, DocId, Score, SegmentOrdinal, SegmentReader, TantivyError};

 struct FastFieldConvertCollector<
    TCollector: Collector<Fruit = Vec<(u64, DocAddress)>>,
@@ -217,11 +214,12 @@ impl TopDocs {

    /// Set top-K to rank documents by a given fast field.
    ///
-    /// If the field is not a fast or does not exist, this method returns successfully (it is not aware of any schema).
-    /// An error will be returned at the moment of search.
+    /// If the field is not a fast or does not exist, this method returns successfully (it is not
+    /// aware of any schema). An error will be returned at the moment of search.
    ///
-    /// If the field is a FAST field but not a u64 field, search will return successfully but it will return
-    /// returns a monotonic u64-representation (ie. the order is still correct) of the requested field type.
+    /// If the field is a FAST field but not a u64 field, search will return successfully but it
+    /// will return returns a monotonic u64-representation (ie. the order is still correct) of
+    /// the requested field type.
    ///
    /// # Example
    ///
@@ -296,14 +294,15 @@ impl TopDocs {

    /// Set top-K to rank documents by a given fast field.
    ///
-    /// If the field is not a fast field, or its field type does not match the generic type, this method does not panic,
-    /// but an explicit error will be returned at the moment of collection.
+    /// If the field is not a fast field, or its field type does not match the generic type, this
+    /// method does not panic, but an explicit error will be returned at the moment of
+    /// collection.
    ///
    /// Note that this method is a generic. The requested fast field type will be often
    /// inferred in your code by the rust compiler.
    ///
-    /// Implementation-wise, for performance reason, tantivy will manipulate the u64 representation of your fast
-    /// field until the last moment.
+    /// Implementation-wise, for performance reason, tantivy will manipulate the u64 representation
+    /// of your fast field until the last moment.
    ///
    /// # Example
    ///
@@ -715,10 +714,7 @@ mod tests {
    use crate::collector::Collector;
    use crate::query::{AllQuery, Query, QueryParser};
    use crate::schema::{Field, Schema, FAST, STORED, TEXT};
-    use crate::Index;
-    use crate::IndexWriter;
-    use crate::Score;
-    use crate::{DocAddress, DocId, SegmentReader};
+    use crate::{DocAddress, DocId, Index, IndexWriter, Score, SegmentReader};

    fn make_index() -> crate::Result<Index> {
        let mut schema_builder = Schema::builder();
--- a/src/collector/tweak_score_top_collector.rs
+++ b/src/collector/tweak_score_top_collector.rs
@@ -1,7 +1,6 @@
 use crate::collector::top_collector::{TopCollector, TopSegmentCollector};
 use crate::collector::{Collector, SegmentCollector};
-use crate::DocAddress;
-use crate::{DocId, Result, Score, SegmentReader};
+use crate::{DocAddress, DocId, Result, Score, SegmentReader};

 pub(crate) struct TweakedScoreTopCollector<TScoreTweaker, TScore = Score> {
    score_tweaker: TScoreTweaker,
@@ -9,8 +8,7 @@ pub(crate) struct TweakedScoreTopCollector<TScoreTweaker, TScore = Score> {
 }

 impl<TScoreTweaker, TScore> TweakedScoreTopCollector<TScoreTweaker, TScore>
-where
-    TScore: Clone + PartialOrd,
+where TScore: Clone + PartialOrd
 {
    pub fn new(
        score_tweaker: TScoreTweaker,
@@ -118,8 +116,7 @@ where
 }

 impl<F, TScore> ScoreSegmentTweaker<TScore> for F
-where
-    F: 'static + FnMut(DocId, Score) -> TScore,
+where F: 'static + FnMut(DocId, Score) -> TScore
 {
    fn score(&mut self, doc: DocId, score: Score) -> TScore {
        (self)(doc, score)
--- a/src/core/executor.rs
+++ b/src/core/executor.rs
@@ -57,7 +57,11 @@ impl Executor {
                                let (idx, arg) = arg_with_idx;
                                let fruit = f(arg);
                                if let Err(err) = fruit_sender.send((idx, fruit)) {
-                                    error!("Failed to send search task. It probably means all search threads have panicked. {:?}", err);
+                                    error!(
+                                        "Failed to send search task. It probably means all search \
+                                         threads have panicked. {:?}",
+                                        err
+                                    );
                                }
                            });
                        }
--- a/src/core/index.rs
+++ b/src/core/index.rs
@@ -1,35 +1,27 @@
-use super::{segment::Segment, IndexSettings};
-use crate::core::Executor;
-use crate::core::IndexMeta;
-use crate::core::SegmentId;
-use crate::core::SegmentMeta;
-use crate::core::SegmentMetaInventory;
-use crate::core::META_FILEPATH;
-use crate::directory::error::OpenReadError;
-use crate::directory::ManagedDirectory;
-#[cfg(feature = "mmap")]
-use crate::directory::MmapDirectory;
-use crate::directory::INDEX_WRITER_LOCK;
-use crate::directory::{Directory, RamDirectory};
-use crate::error::DataCorruption;
-use crate::error::TantivyError;
-use crate::indexer::index_writer::{HEAP_SIZE_MIN, MAX_NUM_THREAD};
-use crate::indexer::segment_updater::save_new_metas;
-use crate::reader::IndexReader;
-use crate::reader::IndexReaderBuilder;
-use crate::schema::Field;
-use crate::schema::FieldType;
-use crate::schema::Schema;
-use crate::tokenizer::{TextAnalyzer, TokenizerManager};
-use crate::IndexWriter;
 use std::collections::HashSet;
 use std::fmt;
-
 #[cfg(feature = "mmap")]
 use std::path::Path;
 use std::path::PathBuf;
 use std::sync::Arc;

+use super::segment::Segment;
+use super::IndexSettings;
+use crate::core::{
+    Executor, IndexMeta, SegmentId, SegmentMeta, SegmentMetaInventory, META_FILEPATH,
+};
+use crate::directory::error::OpenReadError;
+#[cfg(feature = "mmap")]
+use crate::directory::MmapDirectory;
+use crate::directory::{Directory, ManagedDirectory, RamDirectory, INDEX_WRITER_LOCK};
+use crate::error::{DataCorruption, TantivyError};
+use crate::indexer::index_writer::{MAX_NUM_THREAD, MEMORY_ARENA_NUM_BYTES_MIN};
+use crate::indexer::segment_updater::save_new_metas;
+use crate::reader::{IndexReader, IndexReaderBuilder};
+use crate::schema::{Field, FieldType, Schema};
+use crate::tokenizer::{TextAnalyzer, TokenizerManager};
+use crate::IndexWriter;
+
 fn load_metas(
    directory: &dyn Directory,
    inventory: &SegmentMetaInventory,
@@ -78,7 +70,6 @@ fn load_metas(
 /// let schema = schema_builder.build();
 /// let settings = IndexSettings{sort_by_field: Some(IndexSortByField{field:"number".to_string(), order:Order::Asc}), ..Default::default()};
 /// let index = Index::builder().schema(schema).settings(settings).create_in_ram();
-///
 /// ```
 pub struct IndexBuilder {
    schema: Option<Schema>,
@@ -97,16 +88,21 @@ impl IndexBuilder {
            index_settings: IndexSettings::default(),
        }
    }
+
    /// Set the settings
+    #[must_use]
    pub fn settings(mut self, settings: IndexSettings) -> Self {
        self.index_settings = settings;
        self
    }
+
    /// Set the schema
+    #[must_use]
    pub fn schema(mut self, schema: Schema) -> Self {
        self.schema = Some(schema);
        self
    }
+
    /// Creates a new index using the `RAMDirectory`.
    ///
    /// The index will be allocated in anonymous memory.
@@ -117,6 +113,7 @@ impl IndexBuilder {
            .create(ram_directory)
            .expect("Creating a RAMDirectory should never fail"))
    }
+
    /// Creates a new index in a given filepath.
    /// The index will use the `MMapDirectory`.
    ///
@@ -129,6 +126,7 @@ impl IndexBuilder {
        }
        self.create(mmap_directory)
    }
+
    /// Creates a new index in a temp directory.
    ///
    /// The index will use the `MMapDirectory` in a newly created directory.
@@ -142,12 +140,14 @@ impl IndexBuilder {
        let mmap_directory: Box<dyn Directory> = Box::new(MmapDirectory::create_from_tempdir()?);
        self.create(mmap_directory)
    }
+
    fn get_expect_schema(&self) -> crate::Result<Schema> {
        self.schema
            .as_ref()
            .cloned()
            .ok_or(TantivyError::IndexBuilderMissingArgument("schema"))
    }
+
    /// Opens or creates a new index in the provided directory
    pub fn open_or_create<T: Into<Box<dyn Directory>>>(self, dir: T) -> crate::Result<Index> {
        let dir = dir.into();
@@ -397,17 +397,18 @@ impl Index {
    /// - `num_threads` defines the number of indexing workers that
    /// should work at the same time.
    ///
-    /// - `overall_heap_size_in_bytes` sets the amount of memory
+    /// - `overall_memory_arena_in_bytes` sets the amount of memory
    /// allocated for all indexing thread.
-    /// Each thread will receive a budget of  `overall_heap_size_in_bytes / num_threads`.
+    /// Each thread will receive a budget of  `overall_memory_arena_in_bytes / num_threads`.
    ///
    /// # Errors
    /// If the lockfile already exists, returns `Error::DirectoryLockBusy` or an `Error::IoError`.
-    /// If the heap size per thread is too small or too big, returns `TantivyError::InvalidArgument`
+    /// If the memory arena per thread is too small or too big, returns
+    /// `TantivyError::InvalidArgument`
    pub fn writer_with_num_threads(
        &self,
        num_threads: usize,
-        overall_heap_size_in_bytes: usize,
+        overall_memory_arena_in_bytes: usize,
    ) -> crate::Result<IndexWriter> {
        let directory_lock = self
            .directory
@@ -416,26 +417,25 @@ impl Index {
                TantivyError::LockFailure(
                    err,
                    Some(
-                        "Failed to acquire index lock. If you are using \
-                         a regular directory, this means there is already an \
-                         `IndexWriter` working on this `Directory`, in this process \
-                         or in a different process."
+                        "Failed to acquire index lock. If you are using a regular directory, this \
+                         means there is already an `IndexWriter` working on this `Directory`, in \
+                         this process or in a different process."
                            .to_string(),
                    ),
                )
            })?;
-        let heap_size_in_bytes_per_thread = overall_heap_size_in_bytes / num_threads;
+        let memory_arena_in_bytes_per_thread = overall_memory_arena_in_bytes / num_threads;
        IndexWriter::new(
            self,
            num_threads,
-            heap_size_in_bytes_per_thread,
+            memory_arena_in_bytes_per_thread,
            directory_lock,
        )
    }

    /// Helper to create an index writer for tests.
    ///
-    /// That index writer only simply has a single thread and a heap of 10 MB.
+    /// That index writer only simply has a single thread and a memory arena of 10 MB.
    /// Using a single thread gives us a deterministic allocation of DocId.
    #[cfg(test)]
    pub fn writer_for_tests(&self) -> crate::Result<IndexWriter> {
@@ -446,29 +446,28 @@ impl Index {
    ///
    /// Tantivy will automatically define the number of threads to use, but
    /// no more than 8 threads.
-    /// `overall_heap_size_in_bytes` is the total target memory usage that will be split
+    /// `overall_memory_arena_in_bytes` is the total target memory usage that will be split
    /// between a given number of threads.
    ///
    /// # Errors
    /// If the lockfile already exists, returns `Error::FileAlreadyExists`.
-    /// If the heap size per thread is too small or too big, returns `TantivyError::InvalidArgument`
-    pub fn writer(&self, overall_heap_size_in_bytes: usize) -> crate::Result<IndexWriter> {
+    /// If the memory arena per thread is too small or too big, returns
+    /// `TantivyError::InvalidArgument`
+    pub fn writer(&self, memory_arena_num_bytes: usize) -> crate::Result<IndexWriter> {
        let mut num_threads = std::cmp::min(num_cpus::get(), MAX_NUM_THREAD);
-        let heap_size_in_bytes_per_thread = overall_heap_size_in_bytes / num_threads;
-        if heap_size_in_bytes_per_thread < HEAP_SIZE_MIN {
-            num_threads = (overall_heap_size_in_bytes / HEAP_SIZE_MIN).max(1);
+        let memory_arena_num_bytes_per_thread = memory_arena_num_bytes / num_threads;
+        if memory_arena_num_bytes_per_thread < MEMORY_ARENA_NUM_BYTES_MIN {
+            num_threads = (memory_arena_num_bytes / MEMORY_ARENA_NUM_BYTES_MIN).max(1);
        }
-        self.writer_with_num_threads(num_threads, overall_heap_size_in_bytes)
+        self.writer_with_num_threads(num_threads, memory_arena_num_bytes)
    }

    /// Accessor to the index settings
-    ///
    pub fn settings(&self) -> &IndexSettings {
        &self.settings
    }

    /// Accessor to the index settings
-    ///
    pub fn settings_mut(&mut self) -> &mut IndexSettings {
        &mut self.settings
    }
@@ -556,15 +555,9 @@ impl fmt::Debug for Index {

 #[cfg(test)]
 mod tests {
-    use crate::schema::Field;
-    use crate::schema::{Schema, INDEXED, TEXT};
-    use crate::IndexReader;
-    use crate::ReloadPolicy;
-    use crate::{
-        directory::{RamDirectory, WatchCallback},
-        IndexSettings,
-    };
-    use crate::{Directory, Index};
+    use crate::directory::{RamDirectory, WatchCallback};
+    use crate::schema::{Field, Schema, INDEXED, TEXT};
+    use crate::{Directory, Index, IndexReader, IndexSettings, ReloadPolicy};

    #[test]
    fn test_indexer_for_field() {
@@ -673,10 +666,12 @@ mod tests {
    #[cfg(feature = "mmap")]
    mod mmap_specific {

+        use std::path::PathBuf;
+
+        use tempfile::TempDir;
+
        use super::*;
        use crate::Directory;
-        use std::path::PathBuf;
-        use tempfile::TempDir;

        #[test]
        fn test_index_on_commit_reload_policy_mmap() -> crate::Result<()> {
--- a/src/core/index_meta.rs
+++ b/src/core/index_meta.rs
@@ -1,12 +1,16 @@
-use super::SegmentComponent;
-use crate::schema::Schema;
-use crate::Opstamp;
-use crate::{core::SegmentId, store::Compressor};
-use crate::{Inventory, TrackedObject};
-use serde::{Deserialize, Serialize};
+use std::collections::HashSet;
+use std::fmt;
 use std::path::PathBuf;
-use std::{collections::HashSet, sync::atomic::AtomicBool};
-use std::{fmt, sync::Arc};
+use std::sync::atomic::AtomicBool;
+use std::sync::Arc;
+
+use serde::{Deserialize, Serialize};
+
+use super::SegmentComponent;
+use crate::core::SegmentId;
+use crate::schema::Schema;
+use crate::store::Compressor;
+use crate::{Inventory, Opstamp, TrackedObject};

 #[derive(Clone, Debug, Serialize, Deserialize)]
 struct DeleteMeta {
@@ -188,6 +192,7 @@ impl SegmentMeta {
    }

    #[doc(hidden)]
+    #[must_use]
    pub fn with_delete_meta(self, num_deleted_docs: u32, opstamp: Opstamp) -> SegmentMeta {
        assert!(
            num_deleted_docs <= self.max_doc(),
@@ -282,7 +287,6 @@ impl Order {
 /// * the searchable segments,
 /// * the index `docstamp`
 /// * the schema
-///
 #[derive(Clone, Serialize)]
 pub struct IndexMeta {
    /// `IndexSettings` to configure index options.
@@ -370,10 +374,8 @@ impl fmt::Debug for IndexMeta {
 mod tests {

    use super::IndexMeta;
-    use crate::{
-        schema::{Schema, TEXT},
-        IndexSettings, IndexSortByField, Order,
-    };
+    use crate::schema::{Schema, TEXT};
+    use crate::{IndexSettings, IndexSortByField, Order};

    #[test]
    fn test_serialize_metas() {
--- a/src/core/inverted_index_reader.rs
+++ b/src/core/inverted_index_reader.rs
@@ -1,13 +1,12 @@
 use std::io;

+use common::BinarySerializable;
+
 use crate::directory::FileSlice;
 use crate::positions::PositionReader;
-use crate::postings::TermInfo;
-use crate::postings::{BlockSegmentPostings, SegmentPostings};
-use crate::schema::IndexRecordOption;
-use crate::schema::Term;
+use crate::postings::{BlockSegmentPostings, SegmentPostings, TermInfo};
+use crate::schema::{IndexRecordOption, Term};
 use crate::termdict::TermDictionary;
-use common::BinarySerializable;

 /// The inverted index reader is in charge of accessing
 /// the inverted index associated to a specific field.
--- a/src/core/mod.rs
+++ b/src/core/mod.rs
@@ -8,6 +8,10 @@ mod segment_component;
 mod segment_id;
 mod segment_reader;

+use std::path::Path;
+
+use once_cell::sync::Lazy;
+
 pub use self::executor::Executor;
 pub use self::index::{Index, IndexBuilder};
 pub use self::index_meta::{
@@ -20,9 +24,6 @@ pub use self::segment_component::SegmentComponent;
 pub use self::segment_id::SegmentId;
 pub use self::segment_reader::SegmentReader;

-use once_cell::sync::Lazy;
-use std::path::Path;
-
 /// The meta file contains all the information about the list of segments and the schema
 /// of the index.
 pub static META_FILEPATH: Lazy<&'static Path> = Lazy::new(|| Path::new("meta.json"));
--- a/src/core/searcher.rs
+++ b/src/core/searcher.rs
@@ -1,21 +1,14 @@
-use crate::collector::Collector;
-use crate::core::Executor;
-use crate::core::SegmentReader;
-use crate::query::Query;
-use crate::schema::Document;
-use crate::schema::Schema;
-use crate::schema::Term;
-use crate::space_usage::SearcherSpaceUsage;
-use crate::store::StoreReader;
-use crate::DocAddress;
-use crate::Index;
-use crate::Opstamp;
-use crate::SegmentId;
-use crate::TrackedObject;
-
 use std::collections::BTreeMap;
 use std::{fmt, io};

+use crate::collector::Collector;
+use crate::core::{Executor, SegmentReader};
+use crate::query::Query;
+use crate::schema::{Document, Schema, Term};
+use crate::space_usage::SearcherSpaceUsage;
+use crate::store::StoreReader;
+use crate::{DocAddress, Index, Opstamp, SegmentId, TrackedObject};
+
 /// Identifies the searcher generation accessed by a [Searcher].
 ///
 /// While this might seem redundant, a [SearcherGeneration] contains
@@ -69,7 +62,6 @@ impl SearcherGeneration {
 ///
 /// It guarantees that the `Segment` will not be removed before
 /// the destruction of the `Searcher`.
-///
 pub struct Searcher {
    schema: Schema,
    index: Index,
--- a/src/core/segment.rs
+++ b/src/core/segment.rs
@@ -1,15 +1,13 @@
-use super::SegmentComponent;
-use crate::core::Index;
-use crate::core::SegmentId;
-use crate::core::SegmentMeta;
-use crate::directory::error::{OpenReadError, OpenWriteError};
-use crate::directory::Directory;
-use crate::directory::{FileSlice, WritePtr};
-use crate::schema::Schema;
-use crate::Opstamp;
 use std::fmt;
 use std::path::PathBuf;

+use super::SegmentComponent;
+use crate::core::{Index, SegmentId, SegmentMeta};
+use crate::directory::error::{OpenReadError, OpenWriteError};
+use crate::directory::{Directory, FileSlice, WritePtr};
+use crate::schema::Schema;
+use crate::Opstamp;
+
 /// A segment is a piece of the index.
 #[derive(Clone)]
 pub struct Segment {
@@ -56,6 +54,7 @@ impl Segment {
    }

    #[doc(hidden)]
+    #[must_use]
    pub fn with_delete_meta(self, num_deleted_docs: u32, opstamp: Opstamp) -> Segment {
        Segment {
            index: self.index,
--- a/src/core/segment_id.rs
+++ b/src/core/segment_id.rs
@@ -1,14 +1,14 @@
 use std::cmp::{Ord, Ordering};
+use std::error::Error;
 use std::fmt;
-use uuid::Uuid;
+use std::str::FromStr;
+#[cfg(test)]
+use std::sync::atomic;

 #[cfg(test)]
 use once_cell::sync::Lazy;
 use serde::{Deserialize, Serialize};
-use std::error::Error;
-use std::str::FromStr;
-#[cfg(test)]
-use std::sync::atomic;
+use uuid::Uuid;

 /// Uuid identifying a segment.
 ///
--- a/src/core/segment_reader.rs
+++ b/src/core/segment_reader.rs
@@ -1,28 +1,19 @@
-use crate::core::InvertedIndexReader;
-use crate::core::Segment;
-use crate::core::SegmentComponent;
-use crate::core::SegmentId;
-use crate::directory::CompositeFile;
-use crate::directory::FileSlice;
+use std::collections::HashMap;
+use std::sync::{Arc, RwLock};
+use std::{fmt, io};
+
+use fail::fail_point;
+
+use crate::core::{InvertedIndexReader, Segment, SegmentComponent, SegmentId};
+use crate::directory::{CompositeFile, FileSlice};
 use crate::error::DataCorruption;
-use crate::fastfield::intersect_alive_bitsets;
-use crate::fastfield::AliveBitSet;
-use crate::fastfield::FacetReader;
-use crate::fastfield::FastFieldReaders;
+use crate::fastfield::{intersect_alive_bitsets, AliveBitSet, FacetReader, FastFieldReaders};
 use crate::fieldnorm::{FieldNormReader, FieldNormReaders};
-use crate::schema::FieldType;
-use crate::schema::Schema;
-use crate::schema::{Field, IndexRecordOption};
+use crate::schema::{Field, FieldType, IndexRecordOption, Schema};
 use crate::space_usage::SegmentSpaceUsage;
 use crate::store::StoreReader;
 use crate::termdict::TermDictionary;
-use crate::DocId;
-use crate::Opstamp;
-use fail::fail_point;
-use std::fmt;
-use std::sync::Arc;
-use std::sync::RwLock;
-use std::{collections::HashMap, io};
+use crate::{DocId, Opstamp};

 /// Entry point to access all of the datastructures of the `Segment`
 ///
@@ -130,7 +121,8 @@ impl SegmentReader {
        self.fieldnorm_readers.get_field(field)?.ok_or_else(|| {
            let field_name = self.schema.get_field_name(field);
            let err_msg = format!(
-                "Field norm not found for field {:?}. Was the field set to record norm during indexing?",
+                "Field norm not found for field {:?}. Was the field set to record norm during \
+                 indexing?",
                field_name
            );
            crate::TantivyError::SchemaError(err_msg)
@@ -259,19 +251,24 @@ impl SegmentReader {
        let record_option = record_option_opt.unwrap();
        let postings_file = postings_file_opt.unwrap();

-        let termdict_file: FileSlice = self.termdict_composite.open_read(field)
-            .ok_or_else(||
-               DataCorruption::comment_only(format!("Failed to open field {:?}'s term dictionary in the composite file. Has the schema been modified?", field_entry.name()))
-            )?;
-
-        let positions_file = self
-            .positions_composite
-            .open_read(field)
-            .ok_or_else(|| {
-                let error_msg = format!("Failed to open field {:?}'s positions in the composite file. Has the schema been modified?", field_entry.name());
-               DataCorruption::comment_only(error_msg)
+        let termdict_file: FileSlice =
+            self.termdict_composite.open_read(field).ok_or_else(|| {
+                DataCorruption::comment_only(format!(
+                    "Failed to open field {:?}'s term dictionary in the composite file. Has the \
+                     schema been modified?",
+                    field_entry.name()
+                ))
            })?;

+        let positions_file = self.positions_composite.open_read(field).ok_or_else(|| {
+            let error_msg = format!(
+                "Failed to open field {:?}'s positions in the composite file. Has the schema been \
+                 modified?",
+                field_entry.name()
+            );
+            DataCorruption::comment_only(error_msg)
+        })?;
+
        let inv_idx_reader = Arc::new(InvertedIndexReader::new(
            TermDictionary::open(termdict_file)?,
            postings_file,
--- a/src/directory/composite_file.rs
+++ b/src/directory/composite_file.rs
@@ -1,17 +1,14 @@
-use crate::directory::FileSlice;
-use crate::directory::{TerminatingWrite, WritePtr};
-use crate::schema::Field;
-use crate::space_usage::FieldUsage;
-use crate::space_usage::PerFieldSpaceUsage;
-use common::BinarySerializable;
-use common::CountingWriter;
-use common::HasLen;
-use common::VInt;
 use std::collections::HashMap;
 use std::io::{self, Read, Write};
 use std::iter::ExactSizeIterator;
 use std::ops::Range;

+use common::{BinarySerializable, CountingWriter, HasLen, VInt};
+
+use crate::directory::{FileSlice, TerminatingWrite, WritePtr};
+use crate::schema::Field;
+use crate::space_usage::{FieldUsage, PerFieldSpaceUsage};
+
 #[derive(Eq, PartialEq, Hash, Copy, Ord, PartialOrd, Clone, Debug)]
 pub struct FileAddr {
    field: Field,
@@ -186,13 +183,14 @@ impl CompositeFile {
 #[cfg(test)]
 mod test {

+    use std::io::Write;
+    use std::path::Path;
+
+    use common::{BinarySerializable, VInt};
+
    use super::{CompositeFile, CompositeWrite};
    use crate::directory::{Directory, RamDirectory};
    use crate::schema::Field;
-    use common::BinarySerializable;
-    use common::VInt;
-    use std::io::Write;
-    use std::path::Path;

    #[test]
    fn test_composite_file() -> crate::Result<()> {
--- a/src/directory/directory.rs
+++ b/src/directory/directory.rs
@@ -1,18 +1,12 @@
-use crate::directory::directory_lock::Lock;
-use crate::directory::error::LockError;
-use crate::directory::error::{DeleteError, OpenReadError, OpenWriteError};
-use crate::directory::WatchHandle;
-use crate::directory::{FileHandle, WatchCallback};
-use crate::directory::{FileSlice, WritePtr};
-use std::fmt;
-use std::io;
 use std::io::Write;
-use std::marker::Send;
-use std::marker::Sync;
-use std::path::Path;
-use std::path::PathBuf;
-use std::thread;
+use std::marker::{Send, Sync};
+use std::path::{Path, PathBuf};
 use std::time::Duration;
+use std::{fmt, io, thread};
+
+use crate::directory::directory_lock::Lock;
+use crate::directory::error::{DeleteError, LockError, OpenReadError, OpenWriteError};
+use crate::directory::{FileHandle, FileSlice, WatchCallback, WatchHandle, WritePtr};

 /// Retry the logic of acquiring locks is pretty simple.
 /// We just retry `n` times after a given `duratio`, both
@@ -233,8 +227,7 @@ pub trait DirectoryClone {
 }

 impl<T> DirectoryClone for T
-where
-    T: 'static + Directory + Clone,
+where T: 'static + Directory + Clone
 {
    fn box_clone(&self) -> Box<dyn Directory> {
        Box::new(self.clone())
--- a/src/directory/directory_lock.rs
+++ b/src/directory/directory_lock.rs
@@ -1,6 +1,7 @@
-use once_cell::sync::Lazy;
 use std::path::PathBuf;

+use once_cell::sync::Lazy;
+
 /// A directory lock.
 ///
 /// A lock is associated to a specific path and some
@@ -11,7 +12,6 @@ use std::path::PathBuf;
 /// - [META_LOCK]
 ///
 /// Check out these locks documentation for more information.
-///
 #[derive(Debug)]
 pub struct Lock {
    /// The lock needs to be associated with its own file `path`.
--- a/src/directory/error.rs
+++ b/src/directory/error.rs
@@ -1,15 +1,17 @@
-use crate::Version;
-use std::fmt;
-use std::io;
 use std::path::PathBuf;
+use std::{fmt, io};
+
+use crate::Version;

 /// Error while trying to acquire a directory lock.
 #[derive(Debug, Error)]
 pub enum LockError {
    /// Failed to acquired a lock as it is already held by another
    /// client.
-    /// - In the context of a blocking lock, this means the lock was not released within some `timeout` period.
-    /// - In the context of a non-blocking lock, this means the lock was busy at the moment of the call.
+    /// - In the context of a blocking lock, this means the lock was not released within some
+    ///   `timeout` period.
+    /// - In the context of a non-blocking lock, this means the lock was busy at the moment of the
+    ///   call.
    #[error("Could not acquire lock as it is already held, possibly by a different process.")]
    LockBusy,
    /// Trying to acquire a lock failed with an `IoError`
--- a/src/directory/file_slice.rs
+++ b/src/directory/file_slice.rs
@@ -1,11 +1,11 @@
+use std::ops::{Deref, Range};
+use std::sync::{Arc, Weak};
+use std::{fmt, io};
+
+use common::HasLen;
 use stable_deref_trait::StableDeref;

 use crate::directory::OwnedBytes;
-use common::HasLen;
-use std::fmt;
-use std::ops::Range;
-use std::sync::{Arc, Weak};
-use std::{io, ops::Deref};

 pub type ArcBytes = Arc<dyn Deref<Target = [u8]> + Send + Sync + 'static>;
 pub type WeakArcBytes = Weak<dyn Deref<Target = [u8]> + Send + Sync + 'static>;
@@ -33,8 +33,7 @@ impl FileHandle for &'static [u8] {
 }

 impl<B> From<B> for FileSlice
-where
-    B: StableDeref + Deref<Target = [u8]> + 'static + Send + Sync,
+where B: StableDeref + Deref<Target = [u8]> + 'static + Send + Sync
 {
    fn from(bytes: B) -> FileSlice {
        FileSlice::new(Box::new(OwnedBytes::new(bytes)))
@@ -44,7 +43,6 @@ where
 /// Logical slice of read only file in tantivy.
 ///
 /// It can be cloned and sliced cheaply.
-///
 #[derive(Clone)]
 pub struct FileSlice {
    data: Arc<dyn FileHandle>,
@@ -79,6 +77,7 @@ impl FileSlice {
    /// # Panics
    ///
    /// Panics if `byte_range.end` exceeds the filesize.
+    #[must_use]
    pub fn slice(&self, byte_range: Range<usize>) -> FileSlice {
        assert!(byte_range.end <= self.len());
        FileSlice {
@@ -138,6 +137,7 @@ impl FileSlice {
    /// boundary.
    ///
    /// Equivalent to `.slice(from_offset, self.len())`
+    #[must_use]
    pub fn slice_from(&self, from_offset: usize) -> FileSlice {
        self.slice(from_offset..self.len())
    }
@@ -145,6 +145,7 @@ impl FileSlice {
    /// Returns a slice from the end.
    ///
    /// Equivalent to `.slice(self.len() - from_offset, self.len())`
+    #[must_use]
    pub fn slice_from_end(&self, from_offset: usize) -> FileSlice {
        self.slice(self.len() - from_offset..self.len())
    }
@@ -153,6 +154,7 @@ impl FileSlice {
    /// boundary.
    ///
    /// Equivalent to `.slice(0, to_offset)`
+    #[must_use]
    pub fn slice_to(&self, to_offset: usize) -> FileSlice {
        self.slice(0..to_offset)
    }
@@ -172,10 +174,12 @@ impl HasLen for FileSlice {

 #[cfg(test)]
 mod tests {
-    use super::{FileHandle, FileSlice};
-    use common::HasLen;
    use std::io;

+    use common::HasLen;
+
+    use super::{FileHandle, FileSlice};
+
    #[test]
    fn test_file_slice() -> io::Result<()> {
        let file_slice = FileSlice::new(Box::new(b"abcdef".as_ref()));
--- a/src/directory/file_watcher.rs
+++ b/src/directory/file_watcher.rs
@@ -1,13 +1,13 @@
-use crate::directory::{WatchCallback, WatchCallbackList, WatchHandle};
-use crc32fast::Hasher;
-use std::fs;
-use std::io;
 use std::io::BufRead;
 use std::path::Path;
 use std::sync::atomic::{AtomicUsize, Ordering};
 use std::sync::Arc;
-use std::thread;
 use std::time::Duration;
+use std::{fs, io, thread};
+
+use crc32fast::Hasher;
+
+use crate::directory::{WatchCallback, WatchCallbackList, WatchHandle};

 pub const POLLING_INTERVAL: Duration = Duration::from_millis(if cfg!(test) { 1 } else { 500 });

@@ -99,9 +99,8 @@ mod tests {

    use std::mem;

-    use crate::directory::mmap_directory::atomic_write;
-
    use super::*;
+    use crate::directory::mmap_directory::atomic_write;

    #[test]
    fn test_file_watcher_drop_watcher() -> crate::Result<()> {
--- a/src/directory/footer.rs
+++ b/src/directory/footer.rs
@@ -1,14 +1,13 @@
-use crate::directory::error::Incompatibility;
-use crate::directory::FileSlice;
-use crate::{
-    directory::{AntiCallToken, TerminatingWrite},
-    Version, INDEX_FORMAT_VERSION,
-};
+use std::io;
+use std::io::Write;
+
 use common::{BinarySerializable, CountingWriter, DeserializeFrom, FixedSize, HasLen};
 use crc32fast::Hasher;
 use serde::{Deserialize, Serialize};
-use std::io;
-use std::io::Write;
+
+use crate::directory::error::Incompatibility;
+use crate::directory::{AntiCallToken, FileSlice, TerminatingWrite};
+use crate::{Version, INDEX_FORMAT_VERSION};

 const FOOTER_MAX_LEN: u32 = 50_000;

@@ -64,7 +63,9 @@ impl Footer {
        if footer_magic_byte != FOOTER_MAGIC_NUMBER {
            return Err(io::Error::new(
                io::ErrorKind::InvalidData,
-                    "Footer magic byte mismatch. File corrupted or index was created using old an tantivy version which is not supported anymore. Please use tantivy 0.15 or above to recreate the index.",
+                "Footer magic byte mismatch. File corrupted or index was created using old an \
+                 tantivy version which is not supported anymore. Please use tantivy 0.15 or above \
+                 to recreate the index.",
            ));
        }

@@ -73,7 +74,7 @@ impl Footer {
                io::ErrorKind::InvalidData,
                format!(
                    "Footer seems invalid as it suggests a footer len of {}. File is corrupted, \
-            or the index was created with a different & old version of tantivy.",
+                     or the index was created with a different & old version of tantivy.",
                    footer_len
                ),
            ));
@@ -154,12 +155,13 @@ impl<W: TerminatingWrite> TerminatingWrite for FooterProxy<W> {
 #[cfg(test)]
 mod tests {

-    use crate::directory::footer::Footer;
-    use crate::directory::OwnedBytes;
-    use crate::directory::{footer::FOOTER_MAGIC_NUMBER, FileSlice};
-    use common::BinarySerializable;
    use std::io;

+    use common::BinarySerializable;
+
+    use crate::directory::footer::{Footer, FOOTER_MAGIC_NUMBER};
+    use crate::directory::{FileSlice, OwnedBytes};
+
    #[test]
    fn test_deserialize_footer() {
        let mut buf: Vec<u8> = vec![];
@@ -183,8 +185,9 @@ mod tests {
        let err = Footer::extract_footer(fileslice).unwrap_err();
        assert_eq!(
            err.to_string(),
-            "Footer magic byte mismatch. File corrupted or index was created using old an tantivy version which \
-            is not supported anymore. Please use tantivy 0.15 or above to recreate the index."
+            "Footer magic byte mismatch. File corrupted or index was created using old an tantivy \
+             version which is not supported anymore. Please use tantivy 0.15 or above to recreate \
+             the index."
        );
    }
    #[test]
@@ -219,8 +222,8 @@ mod tests {
        assert_eq!(err.kind(), io::ErrorKind::InvalidData);
        assert_eq!(
            err.to_string(),
-            "Footer seems invalid as it suggests a footer len of 50001. File is corrupted, \
-    or the index was created with a different & old version of tantivy."
+            "Footer seems invalid as it suggests a footer len of 50001. File is corrupted, or the \
+             index was created with a different & old version of tantivy."
        );
    }
 }
--- a/src/directory/managed_directory.rs
+++ b/src/directory/managed_directory.rs
@@ -1,24 +1,21 @@
+use std::collections::HashSet;
+use std::io::Write;
+use std::path::{Path, PathBuf};
+use std::sync::{Arc, RwLock, RwLockWriteGuard};
+use std::{io, result};
+
+use crc32fast::Hasher;
+
 use crate::core::MANAGED_FILEPATH;
 use crate::directory::error::{DeleteError, LockError, OpenReadError, OpenWriteError};
 use crate::directory::footer::{Footer, FooterProxy};
-use crate::directory::GarbageCollectionResult;
-use crate::directory::Lock;
-use crate::directory::META_LOCK;
-use crate::directory::{DirectoryLock, FileHandle};
-use crate::directory::{FileSlice, WritePtr};
-use crate::directory::{WatchCallback, WatchHandle};
+use crate::directory::{
+    DirectoryLock, FileHandle, FileSlice, GarbageCollectionResult, Lock, WatchCallback,
+    WatchHandle, WritePtr, META_LOCK,
+};
 use crate::error::DataCorruption;
 use crate::Directory;

-use crc32fast::Hasher;
-use std::collections::HashSet;
-use std::io;
-use std::io::Write;
-use std::path::{Path, PathBuf};
-use std::result;
-use std::sync::RwLockWriteGuard;
-use std::sync::{Arc, RwLock};
-
 /// Returns true iff the file is "managed".
 /// Non-managed file are not subject to garbage collection.
 ///
@@ -344,12 +341,14 @@ impl Clone for ManagedDirectory {
 #[cfg(test)]
 mod tests_mmap_specific {

-    use crate::directory::{Directory, ManagedDirectory, MmapDirectory, TerminatingWrite};
    use std::collections::HashSet;
    use std::io::Write;
    use std::path::{Path, PathBuf};
+
    use tempfile::TempDir;

+    use crate::directory::{Directory, ManagedDirectory, MmapDirectory, TerminatingWrite};
+
    #[test]
    fn test_managed_directory() {
        let tempdir = TempDir::new().unwrap();
--- a/src/directory/mmap_directory.rs
+++ b/src/directory/mmap_directory.rs
@@ -1,32 +1,28 @@
-use crate::core::META_FILEPATH;
-use crate::directory::error::LockError;
-use crate::directory::error::{DeleteError, OpenDirectoryError, OpenReadError, OpenWriteError};
-use crate::directory::file_watcher::FileWatcher;
-use crate::directory::Directory;
-use crate::directory::DirectoryLock;
-use crate::directory::Lock;
-use crate::directory::WatchCallback;
-use crate::directory::WatchHandle;
-use crate::directory::{AntiCallToken, FileHandle, OwnedBytes};
-use crate::directory::{ArcBytes, WeakArcBytes};
-use crate::directory::{TerminatingWrite, WritePtr};
+use std::collections::HashMap;
+use std::convert::From;
+use std::fs::{self, File, OpenOptions};
+use std::io::{self, BufWriter, Read, Seek, SeekFrom, Write};
+use std::ops::Deref;
+use std::path::{Path, PathBuf};
+use std::sync::{Arc, RwLock};
+use std::{fmt, result};
+
 use fs2::FileExt;
 use memmap2::Mmap;
 use serde::{Deserialize, Serialize};
 use stable_deref_trait::StableDeref;
-use std::convert::From;
-use std::fmt;
-use std::fs::OpenOptions;
-use std::fs::{self, File};
-use std::io::{self, Seek, SeekFrom};
-use std::io::{BufWriter, Read, Write};
-use std::path::{Path, PathBuf};
-use std::result;
-use std::sync::Arc;
-use std::sync::RwLock;
-use std::{collections::HashMap, ops::Deref};
 use tempfile::TempDir;

+use crate::core::META_FILEPATH;
+use crate::directory::error::{
+    DeleteError, LockError, OpenDirectoryError, OpenReadError, OpenWriteError,
+};
+use crate::directory::file_watcher::FileWatcher;
+use crate::directory::{
+    AntiCallToken, ArcBytes, Directory, DirectoryLock, FileHandle, Lock, OwnedBytes,
+    TerminatingWrite, WatchCallback, WatchHandle, WeakArcBytes, WritePtr,
+};
+
 /// Create a default io error given a string.
 pub(crate) fn make_io_err(msg: String) -> io::Error {
    io::Error::new(io::ErrorKind::Other, msg)
@@ -320,8 +316,7 @@ impl Directory for MmapDirectory {

        let mut mmap_cache = self.inner.mmap_cache.write().map_err(|_| {
            let msg = format!(
-                "Failed to acquired write lock \
-                 on mmap cache while reading {:?}",
+                "Failed to acquired write lock on mmap cache while reading {:?}",
                path
            );
            let io_err = make_io_err(msg);
@@ -457,6 +452,7 @@ impl Directory for MmapDirectory {
        #[cfg(windows)]
        {
            use std::os::windows::fs::OpenOptionsExt;
+
            use winapi::um::winbase;

            open_opts
@@ -476,15 +472,12 @@ mod tests {
    // There are more tests in directory/mod.rs
    // The following tests are specific to the MmapDirectory

+    use common::HasLen;
+
    use super::*;
    use crate::indexer::LogMergePolicy;
-    use crate::Index;
-    use crate::ReloadPolicy;
-    use crate::{
-        schema::{Schema, SchemaBuilder, TEXT},
-        IndexSettings,
-    };
-    use common::HasLen;
+    use crate::schema::{Schema, SchemaBuilder, TEXT};
+    use crate::{Index, IndexSettings, ReloadPolicy};

    #[test]
    fn test_open_non_existent_path() {
@@ -521,7 +514,7 @@ mod tests {
        {
            for path in &paths {
                let mut w = mmap_directory.open_write(path).unwrap();
-                w.write(content).unwrap();
+                w.write_all(content).unwrap();
                w.flush().unwrap();
            }
        }
--- a/src/directory/mod.rs
+++ b/src/directory/mod.rs
@@ -1,8 +1,4 @@
-/*!
-
-WORM (Write Once Read Many) directory abstraction.
-
-*/
+//! WORM (Write Once Read Many) directory abstraction.

 #[cfg(feature = "mmap")]
 mod mmap_directory;
@@ -22,19 +18,19 @@ pub mod error;

 mod composite_file;

+use std::io::BufWriter;
+use std::path::PathBuf;
+
+pub use common::{AntiCallToken, TerminatingWrite};
+
 pub(crate) use self::composite_file::{CompositeFile, CompositeWrite};
-pub use self::directory::DirectoryLock;
-pub use self::directory::{Directory, DirectoryClone};
+pub use self::directory::{Directory, DirectoryClone, DirectoryLock};
 pub use self::directory_lock::{Lock, INDEX_WRITER_LOCK, META_LOCK};
 pub(crate) use self::file_slice::{ArcBytes, WeakArcBytes};
 pub use self::file_slice::{FileHandle, FileSlice};
 pub use self::owned_bytes::OwnedBytes;
 pub use self::ram_directory::RamDirectory;
 pub use self::watch_event_router::{WatchCallback, WatchCallbackList, WatchHandle};
-pub use common::AntiCallToken;
-pub use common::TerminatingWrite;
-use std::io::BufWriter;
-use std::path::PathBuf;

 /// Outcome of the Garbage collection
 pub struct GarbageCollectionResult {
@@ -50,11 +46,10 @@ pub struct GarbageCollectionResult {
    pub failed_to_delete_files: Vec<PathBuf>,
 }

+pub use self::managed_directory::ManagedDirectory;
 #[cfg(feature = "mmap")]
 pub use self::mmap_directory::MmapDirectory;

-pub use self::managed_directory::ManagedDirectory;
-
 /// Write object for Directory.
 ///
 /// `WritePtr` are required to implement both Write
--- a/src/directory/owned_bytes.rs
+++ b/src/directory/owned_bytes.rs
@@ -1,9 +1,10 @@
-use crate::directory::FileHandle;
 use std::io;
 use std::ops::Range;

 pub use ownedbytes::OwnedBytes;

+use crate::directory::FileHandle;
+
 impl FileHandle for OwnedBytes {
    fn read_bytes(&self, range: Range<usize>) -> io::Result<OwnedBytes> {
        Ok(self.slice(range))
--- a/src/directory/ram_directory.rs
+++ b/src/directory/ram_directory.rs
@@ -1,19 +1,19 @@
-use crate::core::META_FILEPATH;
-use crate::directory::error::{DeleteError, OpenReadError, OpenWriteError};
-use crate::directory::AntiCallToken;
-use crate::directory::WatchCallbackList;
-use crate::directory::{Directory, FileSlice, WatchCallback, WatchHandle};
-use crate::directory::{TerminatingWrite, WritePtr};
-use common::HasLen;
-use fail::fail_point;
 use std::collections::HashMap;
-use std::fmt;
 use std::io::{self, BufWriter, Cursor, Seek, SeekFrom, Write};
 use std::path::{Path, PathBuf};
-use std::result;
 use std::sync::{Arc, RwLock};
+use std::{fmt, result};
+
+use common::HasLen;
+use fail::fail_point;

 use super::FileHandle;
+use crate::core::META_FILEPATH;
+use crate::directory::error::{DeleteError, OpenReadError, OpenWriteError};
+use crate::directory::{
+    AntiCallToken, Directory, FileSlice, TerminatingWrite, WatchCallback, WatchCallbackList,
+    WatchHandle, WritePtr,
+};

 /// Writer associated with the `RamDirectory`
 ///
@@ -40,7 +40,9 @@ impl Drop for VecWriter {
    fn drop(&mut self) {
        if !self.is_flushed {
            warn!(
-                "You forgot to flush {:?} before its writter got Drop. Do not rely on drop. This also occurs when the indexer crashed, so you may want to check the logs for the root cause.",
+                "You forgot to flush {:?} before its writter got Drop. Do not rely on drop. This \
+                 also occurs when the indexer crashed, so you may want to check the logs for the \
+                 root cause.",
                self.path
            )
        }
@@ -123,7 +125,6 @@ impl fmt::Debug for RamDirectory {
 ///
 /// It is mainly meant for unit testing.
 /// Writes are only made visible upon flushing.
-///
 #[derive(Clone, Default)]
 pub struct RamDirectory {
    fs: Arc<RwLock<InnerDirectory>>,
@@ -233,11 +234,12 @@ impl Directory for RamDirectory {

 #[cfg(test)]
 mod tests {
-    use super::RamDirectory;
-    use crate::Directory;
    use std::io::Write;
    use std::path::Path;

+    use super::RamDirectory;
+    use crate::Directory;
+
    #[test]
    fn test_persist() {
        let msg_atomic: &'static [u8] = b"atomic is the way";
--- a/src/directory/tests.rs
+++ b/src/directory/tests.rs
@@ -1,6 +1,3 @@
-use super::*;
-use futures::channel::oneshot;
-use futures::executor::block_on;
 use std::io::Write;
 use std::mem;
 use std::path::{Path, PathBuf};
@@ -9,6 +6,11 @@ use std::sync::atomic::{AtomicBool, AtomicUsize};
 use std::sync::Arc;
 use std::time::Duration;

+use futures::channel::oneshot;
+use futures::executor::block_on;
+
+use super::*;
+
 #[cfg(feature = "mmap")]
 mod mmap_directory_tests {
    use crate::directory::MmapDirectory;
--- a/src/directory/watch_event_router.rs
+++ b/src/directory/watch_event_router.rs
@@ -1,8 +1,7 @@
+use std::sync::{Arc, RwLock, Weak};
+
 use futures::channel::oneshot;
 use futures::{Future, TryFutureExt};
-use std::sync::Arc;
-use std::sync::RwLock;
-use std::sync::Weak;

 /// Cloneable wrapper for callbacks registered when watching files of a `Directory`.
 #[derive(Clone)]
@@ -103,12 +102,14 @@ impl WatchCallbackList {

 #[cfg(test)]
 mod tests {
-    use crate::directory::{WatchCallback, WatchCallbackList};
-    use futures::executor::block_on;
    use std::mem;
    use std::sync::atomic::{AtomicUsize, Ordering};
    use std::sync::Arc;

+    use futures::executor::block_on;
+
+    use crate::directory::{WatchCallback, WatchCallbackList};
+
    #[test]
    fn test_watch_event_router_simple() {
        let watch_event_router = WatchCallbackList::default();
--- a/src/docset.rs
+++ b/src/docset.rs
@@ -1,7 +1,7 @@
+use std::borrow::{Borrow, BorrowMut};
+
 use crate::fastfield::AliveBitSet;
 use crate::DocId;
-use std::borrow::Borrow;
-use std::borrow::BorrowMut;

 /// Sentinel value returned when a DocSet has been entirely consumed.
 ///
--- a/src/error.rs
+++ b/src/error.rs
@@ -1,17 +1,14 @@
 //! Definition of Tantivy's error and result.

-use std::io;
-
-use crate::directory::error::{Incompatibility, LockError};
-use crate::fastfield::FastFieldNotAvailableError;
-use crate::query;
-use crate::{
-    directory::error::{OpenDirectoryError, OpenReadError, OpenWriteError},
-    schema,
-};
-use std::fmt;
 use std::path::PathBuf;
 use std::sync::PoisonError;
+use std::{fmt, io};
+
+use crate::directory::error::{
+    Incompatibility, LockError, OpenDirectoryError, OpenReadError, OpenWriteError,
+};
+use crate::fastfield::FastFieldNotAvailableError;
+use crate::{query, schema};

 /// Represents a `DataCorruption` error.
 ///
--- a/src/fastfield/alive_bitset.rs
+++ b/src/fastfield/alive_bitset.rs
@@ -1,12 +1,12 @@
-use crate::space_usage::ByteCount;
-use crate::DocId;
-use common::intersect_bitsets;
-use common::BitSet;
-use common::ReadOnlyBitSet;
-use ownedbytes::OwnedBytes;
 use std::io;
 use std::io::Write;

+use common::{intersect_bitsets, BitSet, ReadOnlyBitSet};
+use ownedbytes::OwnedBytes;
+
+use crate::space_usage::ByteCount;
+use crate::DocId;
+
 /// Write a alive `BitSet`
 ///
 /// where `alive_bitset` is the set of alive `DocId`.
@@ -168,11 +168,12 @@ mod tests {
 #[cfg(all(test, feature = "unstable"))]
 mod bench {

-    use super::AliveBitSet;
    use rand::prelude::IteratorRandom;
    use rand::thread_rng;
    use test::Bencher;

+    use super::AliveBitSet;
+
    fn get_alive() -> Vec<u32> {
        let mut data = (0..1_000_000_u32).collect::<Vec<u32>>();
        for _ in 0..(1_000_000) * 1 / 8 {
--- a/src/fastfield/bytes/mod.rs
+++ b/src/fastfield/bytes/mod.rs
@@ -6,11 +6,12 @@ pub use self::writer::BytesFastFieldWriter;

 #[cfg(test)]
 mod tests {
-    use crate::schema::{BytesOptions, IndexRecordOption, Schema, Value};
-    use crate::{query::TermQuery, schema::FAST, schema::INDEXED, schema::STORED};
-    use crate::{DocAddress, DocSet, Index, Searcher, Term};
    use std::ops::Deref;

+    use crate::query::TermQuery;
+    use crate::schema::{BytesOptions, IndexRecordOption, Schema, Value, FAST, INDEXED, STORED};
+    use crate::{DocAddress, DocSet, Index, Searcher, Term};
+
    #[test]
    fn test_bytes() -> crate::Result<()> {
        let mut schema_builder = Schema::builder();
@@ -62,7 +63,7 @@ mod tests {
        assert_eq!(values.len(), 2);
        let values_bytes: Vec<&[u8]> = values
            .into_iter()
-            .flat_map(|value| value.bytes_value())
+            .flat_map(|value| value.as_bytes())
            .collect();
        assert_eq!(values_bytes, &[&b"tantivy"[..], &b"lucene"[..]]);
        Ok(())
--- a/src/fastfield/bytes/reader.rs
+++ b/src/fastfield/bytes/reader.rs
@@ -1,6 +1,5 @@
-use crate::directory::FileSlice;
-use crate::directory::OwnedBytes;
-use crate::fastfield::{BitpackedFastFieldReader, FastFieldReader, MultiValueLength};
+use crate::directory::{FileSlice, OwnedBytes};
+use crate::fastfield::{DynamicFastFieldReader, FastFieldReader, MultiValueLength};
 use crate::DocId;

 /// Reader for byte array fast fields
@@ -15,13 +14,13 @@ use crate::DocId;
 /// and the start index for the next document, and keeping the bytes in between.
 #[derive(Clone)]
 pub struct BytesFastFieldReader {
-    idx_reader: BitpackedFastFieldReader<u64>,
+    idx_reader: DynamicFastFieldReader<u64>,
    values: OwnedBytes,
 }

 impl BytesFastFieldReader {
    pub(crate) fn open(
-        idx_reader: BitpackedFastFieldReader<u64>,
+        idx_reader: DynamicFastFieldReader<u64>,
        values_file: FileSlice,
    ) -> crate::Result<BytesFastFieldReader> {
        let values = values_file.read_bytes()?;
--- a/src/fastfield/bytes/writer.rs
+++ b/src/fastfield/bytes/writer.rs
@@ -1,10 +1,9 @@
 use std::io;

+use crate::fastfield::serializer::CompositeFastFieldSerializer;
+use crate::indexer::doc_id_mapping::DocIdMapping;
 use crate::schema::{Document, Field, Value};
 use crate::DocId;
-use crate::{
-    fastfield::serializer::CompositeFastFieldSerializer, indexer::doc_id_mapping::DocIdMapping,
-};

 /// Writer for byte array (as in, any number of bytes per document) fast fields
 ///
--- a/src/fastfield/error.rs
+++ b/src/fastfield/error.rs
@@ -1,6 +1,7 @@
-use crate::schema::FieldEntry;
 use std::result;

+use crate::schema::FieldEntry;
+
 /// `FastFieldNotAvailableError` is returned when the
 /// user requested for a fast field reader, and the field was not
 /// defined in the schema as a fast field.
--- a/src/fastfield/facet_reader.rs
+++ b/src/fastfield/facet_reader.rs
@@ -1,10 +1,10 @@
+use std::str;
+
 use super::MultiValuedFastFieldReader;
 use crate::error::DataCorruption;
 use crate::schema::Facet;
-use crate::termdict::TermDictionary;
-use crate::termdict::TermOrdinal;
+use crate::termdict::{TermDictionary, TermOrdinal};
 use crate::DocId;
-use std::str;

 /// The facet reader makes it possible to access the list of
 /// facets associated to a given document in a specific
@@ -82,11 +82,8 @@ impl FacetReader {

 #[cfg(test)]
 mod tests {
-    use crate::Index;
-    use crate::{
-        schema::{Facet, FacetOptions, SchemaBuilder, Value, STORED},
-        DocAddress, Document,
-    };
+    use crate::schema::{Facet, FacetOptions, SchemaBuilder, Value, STORED};
+    use crate::{DocAddress, Document, Index};

    #[test]
    fn test_facet_only_indexed() -> crate::Result<()> {
@@ -106,7 +103,7 @@ mod tests {
        facet_reader.facet_ords(0u32, &mut facet_ords);
        assert_eq!(&facet_ords, &[2u64]);
        let doc = searcher.doc(DocAddress::new(0u32, 0u32))?;
-        let value = doc.get_first(facet_field).and_then(Value::facet);
+        let value = doc.get_first(facet_field).and_then(Value::as_facet);
        assert_eq!(value, None);
        Ok(())
    }
@@ -129,7 +126,7 @@ mod tests {
        facet_reader.facet_ords(0u32, &mut facet_ords);
        assert_eq!(&facet_ords, &[2u64]);
        let doc = searcher.doc(DocAddress::new(0u32, 0u32))?;
-        let value: Option<&Facet> = doc.get_first(facet_field).and_then(Value::facet);
+        let value: Option<&Facet> = doc.get_first(facet_field).and_then(Value::as_facet);
        assert_eq!(value, Facet::from_text("/a/b").ok().as_ref());
        Ok(())
    }
--- a/src/fastfield/mod.rs
+++ b/src/fastfield/mod.rs
@@ -1,51 +1,38 @@
-/*!
-Column oriented field storage for tantivy.
+//! Column oriented field storage for tantivy.
+//!
+//! It is the equivalent of `Lucene`'s `DocValues`.
+//!
+//! Fast fields is a column-oriented fashion storage of `tantivy`.
+//!
+//! It is designed for the fast random access of some document
+//! fields given a document id.
+//!
+//! `FastField` are useful when a field is required for all or most of
+//! the `DocSet` : for instance for scoring, grouping, filtering, or faceting.
+//!
+//!
+//! Fields have to be declared as `FAST` in the  schema.
+//! Currently only 64-bits integers (signed or unsigned) are
+//! supported.
+//!
+//! They are stored in a bit-packed fashion so that their
+//! memory usage is directly linear with the amplitude of the
+//! values stored.
+//!
+//! Read access performance is comparable to that of an array lookup.

-It is the equivalent of `Lucene`'s `DocValues`.
-
-Fast fields is a column-oriented fashion storage of `tantivy`.
-
-It is designed for the fast random access of some document
-fields given a document id.
-
-`FastField` are useful when a field is required for all or most of
-the `DocSet` : for instance for scoring, grouping, filtering, or faceting.
-
-
-Fields have to be declared as `FAST` in the  schema.
-Currently only 64-bits integers (signed or unsigned) are
-supported.
-
-They are stored in a bit-packed fashion so that their
-memory usage is directly linear with the amplitude of the
-values stored.
-
-Read access performance is comparable to that of an array lookup.
-*/
-
-pub use self::alive_bitset::intersect_alive_bitsets;
-pub use self::alive_bitset::write_alive_bitset;
-pub use self::alive_bitset::AliveBitSet;
+pub use self::alive_bitset::{intersect_alive_bitsets, write_alive_bitset, AliveBitSet};
 pub use self::bytes::{BytesFastFieldReader, BytesFastFieldWriter};
 pub use self::error::{FastFieldNotAvailableError, Result};
 pub use self::facet_reader::FacetReader;
 pub use self::multivalued::{MultiValuedFastFieldReader, MultiValuedFastFieldWriter};
-pub(crate) use self::reader::BitpackedFastFieldReader;
-pub use self::reader::DynamicFastFieldReader;
-pub use self::reader::FastFieldReader;
+pub use self::reader::{DynamicFastFieldReader, FastFieldReader};
 pub use self::readers::FastFieldReaders;
-pub use self::serializer::CompositeFastFieldSerializer;
-pub use self::serializer::FastFieldDataAccess;
-pub use self::serializer::FastFieldStats;
+pub use self::serializer::{CompositeFastFieldSerializer, FastFieldDataAccess, FastFieldStats};
 pub use self::writer::{FastFieldsWriter, IntFastFieldWriter};
-use crate::schema::Cardinality;
-use crate::schema::FieldType;
-use crate::schema::Value;
+use crate::chrono::{NaiveDateTime, Utc};
+use crate::schema::{Cardinality, FieldType, Type, Value};
 use crate::DocId;
-use crate::{
-    chrono::{NaiveDateTime, Utc},
-    schema::Type,
-};

 mod alive_bitset;
 mod bytes;
@@ -213,22 +200,20 @@ fn value_to_u64(value: &Value) -> u64 {
 #[cfg(test)]
 mod tests {

-    use super::*;
-    use crate::directory::CompositeFile;
-    use crate::directory::{Directory, RamDirectory, WritePtr};
-    use crate::merge_policy::NoMergePolicy;
-    use crate::schema::Field;
-    use crate::schema::Schema;
-    use crate::schema::FAST;
-    use crate::schema::{Document, IntOptions};
-    use crate::{Index, SegmentId, SegmentReader};
+    use std::collections::HashMap;
+    use std::path::Path;
+
    use common::HasLen;
    use once_cell::sync::Lazy;
    use rand::prelude::SliceRandom;
    use rand::rngs::StdRng;
    use rand::SeedableRng;
-    use std::collections::HashMap;
-    use std::path::Path;
+
+    use super::*;
+    use crate::directory::{CompositeFile, Directory, RamDirectory, WritePtr};
+    use crate::merge_policy::NoMergePolicy;
+    use crate::schema::{Document, Field, IntOptions, Schema, FAST};
+    use crate::{Index, SegmentId, SegmentReader};

    pub static SCHEMA: Lazy<Schema> = Lazy::new(|| {
        let mut schema_builder = Schema::builder();
@@ -407,7 +392,7 @@ mod tests {
            serializer.close().unwrap();
        }
        let file = directory.open_read(path).unwrap();
-        //assert_eq!(file.len(), 17710 as usize); //bitpacked size
+        // assert_eq!(file.len(), 17710 as usize); //bitpacked size
        assert_eq!(file.len(), 10175_usize); // linear interpol size
        {
            let fast_fields_composite = CompositeFile::open(&file)?;
@@ -587,16 +572,16 @@ mod tests {

 #[cfg(all(test, feature = "unstable"))]
 mod bench {
-    use super::tests::FIELD;
-    use super::tests::{generate_permutation, SCHEMA};
-    use super::*;
-    use crate::directory::CompositeFile;
-    use crate::directory::{Directory, RamDirectory, WritePtr};
-    use crate::fastfield::FastFieldReader;
    use std::collections::HashMap;
    use std::path::Path;
+
    use test::{self, Bencher};

+    use super::tests::{generate_permutation, FIELD, SCHEMA};
+    use super::*;
+    use crate::directory::{CompositeFile, Directory, RamDirectory, WritePtr};
+    use crate::fastfield::FastFieldReader;
+
    #[bench]
    fn bench_intfastfield_linear_veclookup(b: &mut Bencher) {
        let permutation = generate_permutation();
--- a/src/fastfield/multivalued/mod.rs
+++ b/src/fastfield/multivalued/mod.rs
@@ -7,23 +7,17 @@ pub use self::writer::MultiValuedFastFieldWriter;
 #[cfg(test)]
 mod tests {

+    use chrono::Duration;
+    use futures::executor::block_on;
+    use proptest::strategy::Strategy;
+    use proptest::{prop_oneof, proptest};
+    use test_log::test;
+
    use crate::collector::TopDocs;
    use crate::indexer::NoMergePolicy;
    use crate::query::QueryParser;
-    use crate::schema::Cardinality;
-    use crate::schema::Facet;
-    use crate::schema::FacetOptions;
-    use crate::schema::IntOptions;
-    use crate::schema::Schema;
-    use crate::Document;
-    use crate::Index;
-    use crate::Term;
-    use chrono::Duration;
-    use futures::executor::block_on;
-    use proptest::prop_oneof;
-    use proptest::proptest;
-    use proptest::strategy::Strategy;
-    use test_log::test;
+    use crate::schema::{Cardinality, Facet, FacetOptions, IntOptions, Schema};
+    use crate::{Document, Index, Term};

    #[test]
    fn test_multivalued_u64() -> crate::Result<()> {
@@ -110,7 +104,7 @@ mod tests {
                    retrieved_doc
                        .get_first(date_field)
                        .expect("cannot find value")
-                        .date_value()
+                        .as_date()
                        .unwrap()
                        .timestamp(),
                    first_time_stamp.timestamp()
@@ -119,7 +113,7 @@ mod tests {
                    retrieved_doc
                        .get_first(time_i)
                        .expect("cannot find value")
-                        .i64_value(),
+                        .as_i64(),
                    Some(1i64)
                );
            }
@@ -138,7 +132,7 @@ mod tests {
                    retrieved_doc
                        .get_first(date_field)
                        .expect("cannot find value")
-                        .date_value()
+                        .as_date()
                        .unwrap()
                        .timestamp(),
                    two_secs_ahead.timestamp()
@@ -147,7 +141,7 @@ mod tests {
                    retrieved_doc
                        .get_first(time_i)
                        .expect("cannot find value")
-                        .i64_value(),
+                        .as_i64(),
                    Some(3i64)
                );
            }
@@ -180,7 +174,7 @@ mod tests {
                    retrieved_doc
                        .get_first(date_field)
                        .expect("cannot find value")
-                        .date_value()
+                        .as_date()
                        .expect("value not of Date type")
                        .timestamp(),
                    (first_time_stamp + Duration::seconds(offset_sec)).timestamp()
@@ -189,7 +183,7 @@ mod tests {
                    retrieved_doc
                        .get_first(time_i)
                        .expect("cannot find value")
-                        .i64_value(),
+                        .as_i64(),
                    Some(time_i_val)
                );
            }
--- a/src/fastfield/multivalued/reader.rs
+++ b/src/fastfield/multivalued/reader.rs
@@ -10,7 +10,6 @@ use crate::DocId;
 /// The `vals_reader` will access the concatenated list of all
 /// values for all reader.
 /// The `idx_reader` associated, for each document, the index of its first value.
-///
 #[derive(Clone)]
 pub struct MultiValuedFastFieldReader<Item: FastValue> {
    idx_reader: DynamicFastFieldReader<u64>,
--- a/src/fastfield/multivalued/writer.rs
+++ b/src/fastfield/multivalued/writer.rs
@@ -1,13 +1,15 @@
+use std::io;
+
+use fnv::FnvHashMap;
+use tantivy_bitpacker::minmax;
+
 use crate::fastfield::serializer::BitpackedFastFieldSerializerLegacy;
-use crate::fastfield::CompositeFastFieldSerializer;
+use crate::fastfield::{value_to_u64, CompositeFastFieldSerializer};
+use crate::indexer::doc_id_mapping::DocIdMapping;
 use crate::postings::UnorderedTermId;
 use crate::schema::{Document, Field};
 use crate::termdict::TermOrdinal;
 use crate::DocId;
-use crate::{fastfield::value_to_u64, indexer::doc_id_mapping::DocIdMapping};
-use fnv::FnvHashMap;
-use std::io;
-use tantivy_bitpacker::minmax;

 /// Writer for multi-valued (as in, more than one value per document)
 /// int fast field.
@@ -20,7 +22,8 @@ use tantivy_bitpacker::minmax;
 /// - add your document simply by calling `.add_document(...)`.
 ///
 /// The `MultiValuedFastFieldWriter` can be acquired from the
-/// fastfield writer, by calling [`.get_multivalue_writer(...)`](./struct.FastFieldsWriter.html#method.get_multivalue_writer).
+/// fastfield writer, by calling
+/// [`.get_multivalue_writer(...)`](./struct.FastFieldsWriter.html#method.get_multivalue_writer).
 ///
 /// Once acquired, writing is done by calling calls to
 /// `.add_document_vals(&[u64])` once per document.
@@ -76,7 +79,7 @@ impl MultiValuedFastFieldWriter {
        // facets are indexed in the `SegmentWriter` as we encode their unordered id.
        if !self.is_facet {
            for field_value in doc.field_values() {
-                if field_value.field() == self.field {
+                if field_value.field == self.field {
                    self.add_val(value_to_u64(field_value.value()));
                }
            }
@@ -131,7 +134,6 @@ impl MultiValuedFastFieldWriter {
    /// During the serialization of the segment, terms gets sorted and
    /// `tantivy` builds a mapping to convert this `UnorderedTermId` into
    /// term ordinals.
-    ///
    pub fn serialize(
        &self,
        serializer: &mut CompositeFastFieldSerializer,
--- a/src/fastfield/reader.rs
+++ b/src/fastfield/reader.rs
@@ -1,25 +1,25 @@
-use super::FastValue;
-use crate::directory::CompositeFile;
-use crate::directory::FileSlice;
-use crate::directory::OwnedBytes;
-use crate::directory::{Directory, RamDirectory, WritePtr};
-use crate::fastfield::{CompositeFastFieldSerializer, FastFieldsWriter};
-use crate::schema::Schema;
-use crate::schema::FAST;
-use crate::DocId;
-use common::BinarySerializable;
-use fastfield_codecs::bitpacked::BitpackedFastFieldReader as BitpackedReader;
-use fastfield_codecs::bitpacked::BitpackedFastFieldSerializer;
-use fastfield_codecs::linearinterpol::LinearInterpolFastFieldReader;
-use fastfield_codecs::linearinterpol::LinearInterpolFastFieldSerializer;
-use fastfield_codecs::multilinearinterpol::MultiLinearInterpolFastFieldReader;
-use fastfield_codecs::multilinearinterpol::MultiLinearInterpolFastFieldSerializer;
-use fastfield_codecs::FastFieldCodecReader;
-use fastfield_codecs::FastFieldCodecSerializer;
 use std::collections::HashMap;
 use std::marker::PhantomData;
 use std::path::Path;

+use common::BinarySerializable;
+use fastfield_codecs::bitpacked::{
+    BitpackedFastFieldReader as BitpackedReader, BitpackedFastFieldSerializer,
+};
+use fastfield_codecs::linearinterpol::{
+    LinearInterpolFastFieldReader, LinearInterpolFastFieldSerializer,
+};
+use fastfield_codecs::multilinearinterpol::{
+    MultiLinearInterpolFastFieldReader, MultiLinearInterpolFastFieldSerializer,
+};
+use fastfield_codecs::{FastFieldCodecReader, FastFieldCodecSerializer};
+
+use super::FastValue;
+use crate::directory::{CompositeFile, Directory, FileSlice, OwnedBytes, RamDirectory, WritePtr};
+use crate::fastfield::{CompositeFastFieldSerializer, FastFieldsWriter};
+use crate::schema::{Schema, FAST};
+use crate::DocId;
+
 /// FastFieldReader is the trait to access fast field data.
 pub trait FastFieldReader<Item: FastValue>: Clone {
    /// Return the value associated to the given document.
@@ -64,7 +64,6 @@ pub trait FastFieldReader<Item: FastValue>: Clone {
 #[derive(Clone)]
 /// DynamicFastFieldReader wraps different readers to access
 /// the various encoded fastfield data
-///
 pub enum DynamicFastFieldReader<Item: FastValue> {
    /// Bitpacked compressed fastfield data.
    Bitpacked(FastFieldReaderCodecWrapper<Item, BitpackedReader>),
@@ -146,7 +145,6 @@ impl<Item: FastValue> FastFieldReader<Item> for DynamicFastFieldReader<Item> {
 /// Wrapper for accessing a fastfield.
 ///
 /// Holds the data and the codec to the read the data.
-///
 #[derive(Clone)]
 pub struct FastFieldReaderCodecWrapper<Item: FastValue, CodecReader> {
    reader: CodecReader,
@@ -162,7 +160,8 @@ impl<Item: FastValue, C: FastFieldCodecReader> FastFieldReaderCodecWrapper<Item,
        assert_eq!(
            BitpackedFastFieldSerializer::ID,
            id,
-            "Tried to open fast field as bitpacked encoded (id=1), but got serializer with different id"
+            "Tried to open fast field as bitpacked encoded (id=1), but got serializer with \
+             different id"
        );
        Self::open_from_bytes(bytes)
    }
@@ -249,8 +248,6 @@ impl<Item: FastValue, C: FastFieldCodecReader + Clone> FastFieldReader<Item>
    }
 }

-pub(crate) type BitpackedFastFieldReader<Item> = FastFieldReaderCodecWrapper<Item, BitpackedReader>;
-
 impl<Item: FastValue> From<Vec<Item>> for DynamicFastFieldReader<Item> {
    fn from(vals: Vec<Item>) -> DynamicFastFieldReader<Item> {
        let mut schema_builder = Schema::builder();
--- a/src/fastfield/readers.rs
+++ b/src/fastfield/readers.rs
@@ -1,14 +1,12 @@
-use crate::directory::CompositeFile;
-use crate::directory::FileSlice;
-use crate::fastfield::MultiValuedFastFieldReader;
-use crate::fastfield::{BitpackedFastFieldReader, FastFieldNotAvailableError};
-use crate::fastfield::{BytesFastFieldReader, FastValue};
+use super::reader::DynamicFastFieldReader;
+use crate::directory::{CompositeFile, FileSlice};
+use crate::fastfield::{
+    BytesFastFieldReader, FastFieldNotAvailableError, FastValue, MultiValuedFastFieldReader,
+};
 use crate::schema::{Cardinality, Field, FieldType, Schema};
 use crate::space_usage::PerFieldSpaceUsage;
 use crate::TantivyError;

-use super::reader::DynamicFastFieldReader;
-
 /// Provides access to all of the BitpackedFastFieldReader.
 ///
 /// Internally, `FastFieldReaders` have preloaded fast field readers,
@@ -131,10 +129,11 @@ impl FastFieldReaders {
        self.typed_fast_field_reader(field)
    }

-    /// Returns the `u64` fast field reader reader associated to `field`, regardless of whether the given
-    /// field is effectively of type `u64` or not.
+    /// Returns the `u64` fast field reader reader associated to `field`, regardless of whether the
+    /// given field is effectively of type `u64` or not.
    ///
-    /// If not, the fastfield reader will returns the u64-value associated to the original FastValue.
+    /// If not, the fastfield reader will returns the u64-value associated to the original
+    /// FastValue.
    pub fn u64_lenient(&self, field: Field) -> crate::Result<DynamicFastFieldReader<u64>> {
        self.typed_fast_field_reader(field)
    }
@@ -171,8 +170,8 @@ impl FastFieldReaders {
        self.typed_fast_field_multi_reader(field)
    }

-    /// Returns a `u64s` multi-valued fast field reader reader associated to `field`, regardless of whether the given
-    /// field is effectively of type `u64` or not.
+    /// Returns a `u64s` multi-valued fast field reader reader associated to `field`, regardless of
+    /// whether the given field is effectively of type `u64` or not.
    ///
    /// If `field` is not a u64 multi-valued fast field, this method returns an Error.
    pub fn u64s_lenient(&self, field: Field) -> crate::Result<MultiValuedFastFieldReader<u64>> {
@@ -219,7 +218,7 @@ impl FastFieldReaders {
                )));
            }
            let fast_field_idx_file = self.fast_field_data(field, 0)?;
-            let idx_reader = BitpackedFastFieldReader::open(fast_field_idx_file)?;
+            let idx_reader = DynamicFastFieldReader::open(fast_field_idx_file)?;
            let data = self.fast_field_data(field, 1)?;
            BytesFastFieldReader::open(idx_reader, data)
        } else {
--- a/src/fastfield/serializer/mod.rs
+++ b/src/fastfield/serializer/mod.rs
@@ -1,16 +1,15 @@
-use crate::directory::CompositeWrite;
-use crate::directory::WritePtr;
-use crate::schema::Field;
-use common::BinarySerializable;
-use common::CountingWriter;
-pub use fastfield_codecs::bitpacked::BitpackedFastFieldSerializer;
-pub use fastfield_codecs::bitpacked::BitpackedFastFieldSerializerLegacy;
+use std::io::{self, Write};
+
+use common::{BinarySerializable, CountingWriter};
+pub use fastfield_codecs::bitpacked::{
+    BitpackedFastFieldSerializer, BitpackedFastFieldSerializerLegacy,
+};
 use fastfield_codecs::linearinterpol::LinearInterpolFastFieldSerializer;
 use fastfield_codecs::multilinearinterpol::MultiLinearInterpolFastFieldSerializer;
-pub use fastfield_codecs::FastFieldCodecSerializer;
-pub use fastfield_codecs::FastFieldDataAccess;
-pub use fastfield_codecs::FastFieldStats;
-use std::io::{self, Write};
+pub use fastfield_codecs::{FastFieldCodecSerializer, FastFieldDataAccess, FastFieldStats};
+
+use crate::directory::{CompositeWrite, WritePtr};
+use crate::schema::Field;

 /// `CompositeFastFieldSerializer` is in charge of serializing
 /// fastfields on disk.
@@ -58,7 +57,8 @@ impl CompositeFastFieldSerializer {
        Ok(CompositeFastFieldSerializer { composite_write })
    }

-    /// Serialize data into a new u64 fast field. The best compression codec will be chosen automatically.
+    /// Serialize data into a new u64 fast field. The best compression codec will be chosen
+    /// automatically.
    pub fn create_auto_detect_u64_fast_field(
        &mut self,
        field: Field,
@@ -76,7 +76,8 @@ impl CompositeFastFieldSerializer {
            0,
        )
    }
-    /// Serialize data into a new u64 fast field. The best compression codec will be chosen automatically.
+    /// Serialize data into a new u64 fast field. The best compression codec will be chosen
+    /// automatically.
    pub fn create_auto_detect_u64_fast_field_with_idx(
        &mut self,
        field: Field,
@@ -112,7 +113,8 @@ impl CompositeFastFieldSerializer {
                broken_estimation.1
            );
        }
-        // removing nan values for codecs with broken calculations, and max values which disables codecs
+        // removing nan values for codecs with broken calculations, and max values which disables
+        // codecs
        estimations.retain(|estimation| !estimation.0.is_nan() && estimation.0 != f32::MAX);
        estimations.sort_by(|a, b| a.0.partial_cmp(&b.0).unwrap());
        let (_ratio, name, id) = estimations[0];
--- a/src/fastfield/writer.rs
+++ b/src/fastfield/writer.rs
@@ -1,3 +1,10 @@
+use std::collections::HashMap;
+use std::io;
+
+use common;
+use fnv::FnvHashMap;
+use tantivy_bitpacker::BlockedBitpacker;
+
 use super::multivalued::MultiValuedFastFieldWriter;
 use super::serializer::FastFieldStats;
 use super::FastFieldDataAccess;
@@ -6,11 +13,6 @@ use crate::indexer::doc_id_mapping::DocIdMapping;
 use crate::postings::UnorderedTermId;
 use crate::schema::{Cardinality, Document, Field, FieldEntry, FieldType, Schema};
 use crate::termdict::TermOrdinal;
-use common;
-use fnv::FnvHashMap;
-use std::collections::HashMap;
-use std::io;
-use tantivy_bitpacker::BlockedBitpacker;

 /// The fastfieldswriter regroup all of the fast field writers.
 pub struct FastFieldsWriter {
@@ -324,7 +326,8 @@ struct WriterFastFieldAccessProvider<'map, 'bitp> {
 impl<'map, 'bitp> FastFieldDataAccess for WriterFastFieldAccessProvider<'map, 'bitp> {
    /// Return the value associated to the given doc.
    ///
-    /// Whenever possible use the Iterator passed to the fastfield creation instead, for performance reasons.
+    /// Whenever possible use the Iterator passed to the fastfield creation instead, for performance
+    /// reasons.
    ///
    /// # Panics
    ///
@@ -332,7 +335,9 @@ impl<'map, 'bitp> FastFieldDataAccess for WriterFastFieldAccessProvider<'map, 'b
    fn get_val(&self, doc: u64) -> u64 {
        if let Some(doc_id_map) = self.doc_id_map {
            self.vals
-                .get(doc_id_map.get_old_doc_id(doc as u32) as usize) // consider extra FastFieldReader wrapper for non doc_id_map
+                .get(doc_id_map.get_old_doc_id(doc as u32) as usize) // consider extra
+                                                                     // FastFieldReader wrapper for
+                                                                     // non doc_id_map
        } else {
            self.vals.get(doc as usize)
        }
--- a/src/fieldnorm/mod.rs
+++ b/src/fieldnorm/mod.rs
@@ -21,32 +21,24 @@ mod reader;
 mod serializer;
 mod writer;

+use self::code::{fieldnorm_to_id, id_to_fieldnorm};
 pub use self::reader::{FieldNormReader, FieldNormReaders};
 pub use self::serializer::FieldNormsSerializer;
 pub use self::writer::FieldNormsWriter;

-use self::code::{fieldnorm_to_id, id_to_fieldnorm};
-
 #[cfg(test)]
 mod tests {
-    use crate::directory::CompositeFile;
-    use crate::directory::{Directory, RamDirectory, WritePtr};
-    use crate::fieldnorm::FieldNormReader;
-    use crate::fieldnorm::FieldNormsSerializer;
-    use crate::fieldnorm::FieldNormsWriter;
-    use crate::query::Query;
-    use crate::query::TermQuery;
-    use crate::schema::IndexRecordOption;
-    use crate::schema::TextFieldIndexing;
-    use crate::schema::TextOptions;
-    use crate::schema::TEXT;
-    use crate::Index;
-    use crate::Term;
-    use crate::TERMINATED;
-    use once_cell::sync::Lazy;
    use std::path::Path;

-    use crate::schema::{Field, Schema, STORED};
+    use once_cell::sync::Lazy;
+
+    use crate::directory::{CompositeFile, Directory, RamDirectory, WritePtr};
+    use crate::fieldnorm::{FieldNormReader, FieldNormsSerializer, FieldNormsWriter};
+    use crate::query::{Query, TermQuery};
+    use crate::schema::{
+        Field, IndexRecordOption, Schema, TextFieldIndexing, TextOptions, STORED, TEXT,
+    };
+    use crate::{Index, Term, TERMINATED};

    pub static SCHEMA: Lazy<Schema> = Lazy::new(|| {
        let mut schema_builder = Schema::builder();
@@ -87,7 +79,7 @@ mod tests {
            fieldnorm_writers.record(3u32, *TXT_FIELD, 3);
            fieldnorm_writers.serialize(serializer, None)?;
        }
-        let file = directory.open_read(&path)?;
+        let file = directory.open_read(path)?;
        {
            let fields_composite = CompositeFile::open(&file)?;
            assert!(fields_composite.open_read(*FIELD).is_none());
--- a/src/fieldnorm/reader.rs
+++ b/src/fieldnorm/reader.rs
@@ -1,11 +1,10 @@
+use std::sync::Arc;
+
 use super::{fieldnorm_to_id, id_to_fieldnorm};
-use crate::directory::CompositeFile;
-use crate::directory::FileSlice;
-use crate::directory::OwnedBytes;
+use crate::directory::{CompositeFile, FileSlice, OwnedBytes};
 use crate::schema::Field;
 use crate::space_usage::PerFieldSpaceUsage;
 use crate::DocId;
-use std::sync::Arc;

 /// Reader for the fieldnorm (for each document, the number of tokens indexed in the
 /// field) of all indexed fields in the index.
--- a/src/fieldnorm/serializer.rs
+++ b/src/fieldnorm/serializer.rs
@@ -1,9 +1,9 @@
-use crate::directory::CompositeWrite;
-use crate::directory::WritePtr;
-use crate::schema::Field;
 use std::io;
 use std::io::Write;

+use crate::directory::{CompositeWrite, WritePtr};
+use crate::schema::Field;
+
 /// The fieldnorms serializer is in charge of
 /// the serialization of field norms for all fields.
 pub struct FieldNormsSerializer {
--- a/src/fieldnorm/writer.rs
+++ b/src/fieldnorm/writer.rs
@@ -1,12 +1,11 @@
-use crate::{indexer::doc_id_mapping::DocIdMapping, DocId};
-
-use super::fieldnorm_to_id;
-use super::FieldNormsSerializer;
-use crate::schema::Field;
-use crate::schema::Schema;
 use std::cmp::Ordering;
 use std::{io, iter};

+use super::{fieldnorm_to_id, FieldNormsSerializer};
+use crate::indexer::doc_id_mapping::DocIdMapping;
+use crate::schema::{Field, Schema};
+use crate::DocId;
+
 /// The `FieldNormsWriter` is in charge of tracking the fieldnorm byte
 /// of each document for each field with field norms.
 ///
--- a/src/functional_test.rs
+++ b/src/functional_test.rs
@@ -1,14 +1,10 @@
-use crate::schema;
-use crate::Index;
-use crate::IndexSettings;
-use crate::IndexSortByField;
-use crate::Order;
-use crate::Searcher;
-use crate::{doc, schema::*};
-use rand::thread_rng;
-use rand::Rng;
 use std::collections::HashSet;

+use rand::{thread_rng, Rng};
+
+use crate::schema::*;
+use crate::{doc, schema, Index, IndexSettings, IndexSortByField, Order, Searcher};
+
 fn check_index_content(searcher: &Searcher, vals: &[u64]) -> crate::Result<()> {
    assert!(searcher.segment_readers().len() < 20);
    assert_eq!(searcher.num_docs() as usize, vals.len());
@@ -130,14 +126,12 @@ fn test_functional_indexing_sorted() -> crate::Result<()> {
    Ok(())
 }

-const LOREM: &str = "Doc Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed \
-             do eiusmod tempor incididunt ut labore et dolore magna aliqua. \
-             Ut enim ad minim veniam, quis nostrud exercitation ullamco \
-             laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure \
-             dolor in reprehenderit in voluptate velit esse cillum dolore eu \
-             fugiat nulla pariatur. Excepteur sint occaecat cupidatat non \
-             proident, sunt in culpa qui officia deserunt mollit anim id est \
-             laborum.";
+const LOREM: &str = "Doc Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod \
+                     tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, \
+                     quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo \
+                     consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse \
+                     cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat \
+                     non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.";
 fn get_text() -> String {
    use rand::seq::SliceRandom;
    let mut rng = thread_rng();
--- a/src/indexer/delete_queue.rs
+++ b/src/indexer/delete_queue.rs
@@ -1,9 +1,9 @@
-use super::operation::DeleteOperation;
-use crate::Opstamp;
-
 use std::ops::DerefMut;
 use std::sync::{Arc, RwLock, Weak};

+use super::operation::DeleteOperation;
+use crate::Opstamp;
+
 // The DeleteQueue is similar in conceptually to a multiple
 // consumer single producer broadcast channel.
 //
@@ -13,12 +13,10 @@ use std::sync::{Arc, RwLock, Weak};
 // which points to a specific place of the `DeleteQueue`.
 //
 // New consumer can be created in two ways
-// - calling `delete_queue.cursor()` returns a cursor, that
-//   will include all future delete operation (and some or none
-//   of the past operations... The client is in charge of checking the opstamps.).
-// - cloning an existing cursor returns a new cursor, that
-//   is at the exact same position, and can now advance independently
-//   from the original cursor.
+// - calling `delete_queue.cursor()` returns a cursor, that will include all future delete operation
+//   (and some or none of the past operations... The client is in charge of checking the opstamps.).
+// - cloning an existing cursor returns a new cursor, that is at the exact same position, and can
+//   now advance independently from the original cursor.
 #[derive(Default)]
 struct InnerDeleteQueue {
    writer: Vec<DeleteOperation>,
@@ -179,8 +177,8 @@ pub struct DeleteCursor {

 impl DeleteCursor {
    /// Skips operations and position it so that
-    /// - either all of the delete operation currently in the
-    ///   queue are consume and the next get will return None.
+    /// - either all of the delete operation currently in the queue are consume and the next get
+    ///   will return None.
    /// - the next get will return the first operation with an
    /// `opstamp >= target_opstamp`.
    pub fn skip_to(&mut self, target_opstamp: Opstamp) {
--- a/src/indexer/demuxer.rs
+++ b/src/indexer/demuxer.rs
@@ -5,8 +5,8 @@ use crate::fastfield::AliveBitSet;
 use crate::{merge_filtered_segments, Directory, Index, IndexSettings, Segment, SegmentOrdinal};
 /// DemuxMapping can be used to reorganize data from multiple segments.
 ///
-/// DemuxMapping is useful in a multitenant settings, in which each document might actually belong to a different tenant.
-/// It allows to reorganize documents as follows:
+/// DemuxMapping is useful in a multitenant settings, in which each document might actually belong
+/// to a different tenant. It allows to reorganize documents as follows:
 ///
 /// e.g. if you have two tenant ids TENANT_A and TENANT_B and two segments with
 /// the documents (simplified)
@@ -18,7 +18,8 @@ use crate::{merge_filtered_segments, Directory, Index, IndexSettings, Segment, S
 /// Seg 2 [TENANT_B, TENANT_B]
 ///
 /// Demuxing is the tool for that.
-/// Semantically you can define a mapping from [old segment ordinal, old doc_id] -> [new segment ordinal].
+/// Semantically you can define a mapping from [old segment ordinal, old doc_id] -> [new segment
+/// ordinal].
 #[derive(Debug, Default)]
 pub struct DemuxMapping {
    /// [index old segment ordinal] -> [index doc_id] = new segment ordinal
@@ -132,27 +133,24 @@ pub fn demux(

 #[cfg(test)]
 mod tests {
-    use crate::{
-        collector::TopDocs,
-        directory::RamDirectory,
-        query::QueryParser,
-        schema::{Schema, TEXT},
-        DocAddress, Term,
-    };
-
    use super::*;
+    use crate::collector::TopDocs;
+    use crate::directory::RamDirectory;
+    use crate::query::QueryParser;
+    use crate::schema::{Schema, TEXT};
+    use crate::{DocAddress, Term};

    #[test]
    fn test_demux_map_to_deletebitset() {
        let max_value = 2;
        let mut demux_mapping = DemuxMapping::default();
-        //segment ordinal 0 mapping
+        // segment ordinal 0 mapping
        let mut doc_id_to_segment = DocIdToSegmentOrdinal::with_max_doc(max_value);
        doc_id_to_segment.set(0, 1);
        doc_id_to_segment.set(1, 0);
        demux_mapping.add(doc_id_to_segment);

-        //segment ordinal 1 mapping
+        // segment ordinal 1 mapping
        let mut doc_id_to_segment = DocIdToSegmentOrdinal::with_max_doc(max_value);
        doc_id_to_segment.set(0, 1);
        doc_id_to_segment.set(1, 1);
@@ -235,13 +233,13 @@ mod tests {
        let mut demux_mapping = DemuxMapping::default();
        {
            let max_value = 2;
-            //segment ordinal 0 mapping
+            // segment ordinal 0 mapping
            let mut doc_id_to_segment = DocIdToSegmentOrdinal::with_max_doc(max_value);
            doc_id_to_segment.set(0, 1);
            doc_id_to_segment.set(1, 0);
            demux_mapping.add(doc_id_to_segment);

-            //segment ordinal 1 mapping
+            // segment ordinal 1 mapping
            let mut doc_id_to_segment = DocIdToSegmentOrdinal::with_max_doc(max_value);
            doc_id_to_segment.set(0, 1);
            doc_id_to_segment.set(1, 1);
@@ -274,7 +272,7 @@ mod tests {
                let text_field = index.schema().get_field("text").unwrap();

                let do_search = |term: &str| {
-                    let query = QueryParser::for_index(&index, vec![text_field])
+                    let query = QueryParser::for_index(index, vec![text_field])
                        .parse_query(term)
                        .unwrap();
                    let top_docs: Vec<(f32, DocAddress)> =
@@ -303,7 +301,7 @@ mod tests {
                let text_field = index.schema().get_field("text").unwrap();

                let do_search = |term: &str| {
-                    let query = QueryParser::for_index(&index, vec![text_field])
+                    let query = QueryParser::for_index(index, vec![text_field])
                        .parse_query(term)
                        .unwrap();
                    let top_docs: Vec<(f32, DocAddress)> =
--- a/src/indexer/doc_id_mapping.rs
+++ b/src/indexer/doc_id_mapping.rs
@@ -1,13 +1,12 @@
 //! This module is used when sorting the index by a property, e.g.
 //! to get mappings from old doc_id to new doc_id and vice versa, after sorting
-//!
+
+use std::cmp::Reverse;
+use std::ops::Index;

 use super::SegmentWriter;
-use crate::{
-    schema::{Field, Schema},
-    DocId, IndexSortByField, Order, SegmentOrdinal, TantivyError,
-};
-use std::{cmp::Reverse, ops::Index};
+use crate::schema::{Field, Schema};
+use crate::{DocId, IndexSortByField, Order, SegmentOrdinal, TantivyError};

 /// Struct to provide mapping from new doc_id to old doc_id and segment.
 #[derive(Clone)]
@@ -152,11 +151,12 @@ pub(crate) fn get_doc_id_mapping_from_field(

 #[cfg(test)]
 mod tests_indexsorting {
+    use crate::collector::TopDocs;
    use crate::fastfield::FastFieldReader;
    use crate::indexer::doc_id_mapping::DocIdMapping;
-    use crate::{collector::TopDocs, query::QueryParser, schema::*};
-    use crate::{schema::Schema, DocAddress};
-    use crate::{Index, IndexSettings, IndexSortByField, Order};
+    use crate::query::QueryParser;
+    use crate::schema::{Schema, *};
+    use crate::{DocAddress, Index, IndexSettings, IndexSortByField, Order};

    fn create_test_index(
        index_settings: Option<IndexSettings>,
@@ -217,7 +217,7 @@ mod tests_indexsorting {
        ];

        for option in options {
-            //let options = get_text_options();
+            // let options = get_text_options();
            // no index_sort
            let index = create_test_index(None, option.clone())?;
            let my_text_field = index.schema().get_field("text_field").unwrap();
@@ -318,7 +318,7 @@ mod tests_indexsorting {
                    .doc(DocAddress::new(0, 3))?
                    .get_first(my_string_field)
                    .unwrap()
-                    .text(),
+                    .as_text(),
                Some("blublub")
            );
        }
@@ -341,7 +341,7 @@ mod tests_indexsorting {
                    .doc(DocAddress::new(0, 0))?
                    .get_first(my_string_field)
                    .unwrap()
-                    .text(),
+                    .as_text(),
                Some("blublub")
            );
            let doc = searcher.doc(DocAddress::new(0, 4))?;
@@ -363,7 +363,7 @@ mod tests_indexsorting {
        {
            let doc = searcher.doc(DocAddress::new(0, 4))?;
            assert_eq!(
-                doc.get_first(my_string_field).unwrap().text(),
+                doc.get_first(my_string_field).unwrap().as_text(),
                Some("blublub")
            );
        }
--- a/src/indexer/doc_opstamp_mapping.rs
+++ b/src/indexer/doc_opstamp_mapping.rs
@@ -1,5 +1,4 @@
-use crate::DocId;
-use crate::Opstamp;
+use crate::{DocId, Opstamp};

 // Doc to opstamp is used to identify which
 // document should be deleted.
--- a/src/indexer/index_writer.rs
+++ b/src/indexer/index_writer.rs
@@ -1,14 +1,19 @@
+use std::ops::Range;
+use std::sync::Arc;
+use std::thread;
+use std::thread::JoinHandle;
+
+use common::BitSet;
+use crossbeam::channel;
+use futures::executor::block_on;
+use futures::future::Future;
+use smallvec::smallvec;
+
 use super::operation::{AddOperation, UserOperation};
 use super::segment_updater::SegmentUpdater;
-use super::PreparedCommit;
-use crate::core::Index;
-use crate::core::Segment;
-use crate::core::SegmentComponent;
-use crate::core::SegmentId;
-use crate::core::SegmentMeta;
-use crate::core::SegmentReader;
-use crate::directory::TerminatingWrite;
-use crate::directory::{DirectoryLock, GarbageCollectionResult};
+use super::{AddBatch, AddBatchReceiver, AddBatchSender, PreparedCommit};
+use crate::core::{Index, Segment, SegmentComponent, SegmentId, SegmentMeta, SegmentReader};
+use crate::directory::{DirectoryLock, GarbageCollectionResult, TerminatingWrite};
 use crate::docset::{DocSet, TERMINATED};
 use crate::error::TantivyError;
 use crate::fastfield::write_alive_bitset;
@@ -17,32 +22,17 @@ use crate::indexer::doc_opstamp_mapping::DocToOpstampMapping;
 use crate::indexer::index_writer_status::IndexWriterStatus;
 use crate::indexer::operation::DeleteOperation;
 use crate::indexer::stamper::Stamper;
-use crate::indexer::MergePolicy;
-use crate::indexer::SegmentEntry;
-use crate::indexer::SegmentWriter;
-use crate::schema::Document;
-use crate::schema::IndexRecordOption;
-use crate::schema::Term;
+use crate::indexer::{MergePolicy, SegmentEntry, SegmentWriter};
+use crate::schema::{Document, IndexRecordOption, Term};
 use crate::Opstamp;
-use common::BitSet;
-use crossbeam::channel;
-use futures::executor::block_on;
-use futures::future::Future;
-use smallvec::smallvec;
-use std::ops::Range;
-use std::sync::Arc;
-use std::thread;
-use std::thread::JoinHandle;

-use super::{AddBatch, AddBatchReceiver, AddBatchSender};
-
-// Size of the margin for the heap. A segment is closed when the remaining memory
-// in the heap goes below MARGIN_IN_BYTES.
+// Size of the margin for the `memory_arena`. A segment is closed when the remaining memory
+// in the `memory_arena` goes below MARGIN_IN_BYTES.
 pub const MARGIN_IN_BYTES: usize = 1_000_000;

 // We impose the memory per thread to be at least 3 MB.
-pub const HEAP_SIZE_MIN: usize = ((MARGIN_IN_BYTES as u32) * 3u32) as usize;
-pub const HEAP_SIZE_MAX: usize = u32::max_value() as usize - MARGIN_IN_BYTES;
+pub const MEMORY_ARENA_NUM_BYTES_MIN: usize = ((MARGIN_IN_BYTES as u32) * 3u32) as usize;
+pub const MEMORY_ARENA_NUM_BYTES_MAX: usize = u32::max_value() as usize - MARGIN_IN_BYTES;

 // We impose the number of index writter thread to be at most this.
 pub const MAX_NUM_THREAD: usize = 8;
@@ -71,7 +61,7 @@ pub struct IndexWriter {

    index: Index,

-    heap_size_in_bytes_per_thread: usize,
+    memory_arena_in_bytes_per_thread: usize,

    workers_join_handle: Vec<JoinHandle<crate::Result<()>>>,

@@ -189,10 +179,10 @@ fn index_documents(
 ) -> crate::Result<()> {
    let schema = segment.schema();

-    let mut segment_writer = SegmentWriter::for_segment(memory_budget, segment.clone(), &schema)?;
+    let mut segment_writer = SegmentWriter::for_segment(memory_budget, segment.clone(), schema)?;
    for document_group in grouped_document_iterator {
        for doc in document_group {
-            segment_writer.add_document(doc, &schema)?;
+            segment_writer.add_document(doc)?;
        }
        let mem_usage = segment_writer.mem_usage();
        if mem_usage >= memory_budget - MARGIN_IN_BYTES {
@@ -278,22 +268,26 @@ impl IndexWriter {
    /// should work at the same time.
    /// # Errors
    /// If the lockfile already exists, returns `Error::FileAlreadyExists`.
-    /// If the heap size per thread is too small or too big, returns `TantivyError::InvalidArgument`
+    /// If the memory arena per thread is too small or too big, returns
+    /// `TantivyError::InvalidArgument`
    pub(crate) fn new(
        index: &Index,
        num_threads: usize,
-        heap_size_in_bytes_per_thread: usize,
+        memory_arena_in_bytes_per_thread: usize,
        directory_lock: DirectoryLock,
    ) -> crate::Result<IndexWriter> {
-        if heap_size_in_bytes_per_thread < HEAP_SIZE_MIN {
+        if memory_arena_in_bytes_per_thread < MEMORY_ARENA_NUM_BYTES_MIN {
            let err_msg = format!(
-                "The heap size per thread needs to be at least {}.",
-                HEAP_SIZE_MIN
+                "The memory arena in bytes per thread needs to be at least {}.",
+                MEMORY_ARENA_NUM_BYTES_MIN
            );
            return Err(TantivyError::InvalidArgument(err_msg));
        }
-        if heap_size_in_bytes_per_thread >= HEAP_SIZE_MAX {
-            let err_msg = format!("The heap size per thread cannot exceed {}", HEAP_SIZE_MAX);
+        if memory_arena_in_bytes_per_thread >= MEMORY_ARENA_NUM_BYTES_MAX {
+            let err_msg = format!(
+                "The memory arena in bytes per thread cannot exceed {}",
+                MEMORY_ARENA_NUM_BYTES_MAX
+            );
            return Err(TantivyError::InvalidArgument(err_msg));
        }
        let (document_sender, document_receiver): (AddBatchSender, AddBatchReceiver) =
@@ -311,7 +305,7 @@ impl IndexWriter {
        let mut index_writer = IndexWriter {
            _directory_lock: Some(directory_lock),

-            heap_size_in_bytes_per_thread,
+            memory_arena_in_bytes_per_thread,
            index: index.clone(),

            index_writer_status: IndexWriterStatus::from(document_receiver),
@@ -392,7 +386,13 @@ impl IndexWriter {
    fn operation_receiver(&self) -> crate::Result<AddBatchReceiver> {
        self.index_writer_status
            .operation_receiver()
-            .ok_or_else(|| crate::TantivyError::ErrorInThread("The index writer was killed. It can happen if an indexing worker encounterred an Io error for instance.".to_string()))
+            .ok_or_else(|| {
+                crate::TantivyError::ErrorInThread(
+                    "The index writer was killed. It can happen if an indexing worker \
+                     encounterred an Io error for instance."
+                        .to_string(),
+                )
+            })
    }

    /// Spawns a new worker thread for indexing.
@@ -405,7 +405,7 @@ impl IndexWriter {

        let mut delete_cursor = self.delete_queue.cursor();

-        let mem_budget = self.heap_size_in_bytes_per_thread;
+        let mem_budget = self.memory_arena_in_bytes_per_thread;
        let index = self.index.clone();
        let join_handle: JoinHandle<crate::Result<()>> = thread::Builder::new()
            .name(format!("thrd-tantivy-index{}", self.worker_id))
@@ -564,7 +564,7 @@ impl IndexWriter {
        let new_index_writer: IndexWriter = IndexWriter::new(
            &self.index,
            self.num_threads,
-            self.heap_size_in_bytes_per_thread,
+            self.memory_arena_in_bytes_per_thread,
            directory_lock,
        )?;

@@ -653,7 +653,6 @@ impl IndexWriter {
    ///
    /// Commit returns the `opstamp` of the last document
    /// that made it in the commit.
-    ///
    pub fn commit(&mut self) -> crate::Result<Opstamp> {
        self.prepare_commit()?.commit()
    }
@@ -780,8 +779,7 @@ impl Drop for IndexWriter {

 #[cfg(test)]
 mod tests {
-    use std::collections::HashMap;
-    use std::collections::HashSet;
+    use std::collections::{HashMap, HashSet};

    use futures::executor::block_on;
    use proptest::prelude::*;
@@ -794,31 +792,20 @@ mod tests {
    use crate::error::*;
    use crate::fastfield::FastFieldReader;
    use crate::indexer::NoMergePolicy;
-    use crate::query::QueryParser;
-    use crate::query::TermQuery;
-    use crate::schema::Cardinality;
-    use crate::schema::Facet;
-    use crate::schema::FacetOptions;
-    use crate::schema::IntOptions;
-    use crate::schema::TextFieldIndexing;
-    use crate::schema::TextOptions;
-    use crate::schema::STORED;
-    use crate::schema::TEXT;
-    use crate::schema::{self, IndexRecordOption, FAST, INDEXED, STRING};
-    use crate::DocAddress;
-    use crate::Index;
-    use crate::ReloadPolicy;
-    use crate::Term;
-    use crate::{IndexSettings, IndexSortByField, Order};
+    use crate::query::{QueryParser, TermQuery};
+    use crate::schema::{
+        self, Cardinality, Facet, FacetOptions, IndexRecordOption, IntOptions, TextFieldIndexing,
+        TextOptions, FAST, INDEXED, STORED, STRING, TEXT,
+    };
+    use crate::{DocAddress, Index, IndexSettings, IndexSortByField, Order, ReloadPolicy, Term};

-    const LOREM: &str = "Doc Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed \
-             do eiusmod tempor incididunt ut labore et dolore magna aliqua. \
-             Ut enim ad minim veniam, quis nostrud exercitation ullamco \
-             laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure \
-             dolor in reprehenderit in voluptate velit esse cillum dolore eu \
-             fugiat nulla pariatur. Excepteur sint occaecat cupidatat non \
-             proident, sunt in culpa qui officia deserunt mollit anim id est \
-             laborum.";
+    const LOREM: &str = "Doc Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do \
+                         eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad \
+                         minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip \
+                         ex ea commodo consequat. Duis aute irure dolor in reprehenderit in \
+                         voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur \
+                         sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt \
+                         mollit anim id est laborum.";

    #[test]
    fn test_operations_group() {
@@ -973,8 +960,8 @@ mod tests {
        let index_writer = index.writer(3_000_000).unwrap();
        assert_eq!(
            format!("{:?}", index_writer.get_merge_policy()),
-            "LogMergePolicy { min_num_segments: 8, max_docs_before_merge: 10000000, min_layer_size: 10000, \
-             level_log_size: 0.75, del_docs_ratio_before_merge: 1.0 }"
+            "LogMergePolicy { min_num_segments: 8, max_docs_before_merge: 10000000, \
+             min_layer_size: 10000, level_log_size: 0.75, del_docs_ratio_before_merge: 1.0 }"
        );
        let merge_policy = Box::new(NoMergePolicy::default());
        index_writer.set_merge_policy(merge_policy);
@@ -1402,6 +1389,7 @@ mod tests {
    ) -> crate::Result<()> {
        let mut schema_builder = schema::Schema::builder();
        let id_field = schema_builder.add_u64_field("id", FAST | INDEXED | STORED);
+        let bytes_field = schema_builder.add_bytes_field("bytes", FAST | INDEXED | STORED);
        let text_field = schema_builder.add_text_field(
            "text_field",
            TextOptions::default()
@@ -1448,8 +1436,14 @@ mod tests {
            match op {
                IndexingOp::AddDoc { id } => {
                    let facet = Facet::from(&("/cola/".to_string() + &id.to_string()));
-                    index_writer
-                        .add_document(doc!(id_field=>id, multi_numbers=> id, multi_numbers => id, text_field => id.to_string(), facet_field => facet, large_text_field=> LOREM))?;
+                    index_writer.add_document(doc!(id_field=>id,
+                            bytes_field => id.to_le_bytes().as_slice(),
+                            multi_numbers=> id,
+                            multi_numbers => id,
+                            text_field => id.to_string(),
+                            facet_field => facet,
+                            large_text_field=> LOREM
+                    ))?;
                }
                IndexingOp::DeleteDoc { id } => {
                    index_writer.delete_term(Term::from_field_u64(id_field, id));
@@ -1547,12 +1541,7 @@ mod tests {
            let store_reader = segment_reader.get_store_reader().unwrap();
            // test store iterator
            for doc in store_reader.iter(segment_reader.alive_bitset()) {
-                let id = doc
-                    .unwrap()
-                    .get_first(id_field)
-                    .unwrap()
-                    .u64_value()
-                    .unwrap();
+                let id = doc.unwrap().get_first(id_field).unwrap().as_u64().unwrap();
                assert!(expected_ids_and_num_occurences.contains_key(&id));
            }
            // test store random access
@@ -1562,7 +1551,7 @@ mod tests {
                    .unwrap()
                    .get_first(id_field)
                    .unwrap()
-                    .u64_value()
+                    .as_u64()
                    .unwrap();
                assert!(expected_ids_and_num_occurences.contains_key(&id));
                let id2 = store_reader
@@ -1570,7 +1559,7 @@ mod tests {
                    .unwrap()
                    .get_first(multi_numbers)
                    .unwrap()
-                    .u64_value()
+                    .as_u64()
                    .unwrap();
                assert_eq!(id, id2);
            }
--- a/src/indexer/index_writer_status.rs
+++ b/src/indexer/index_writer_status.rs
@@ -90,10 +90,12 @@ impl Drop for IndexWriterBomb {

 #[cfg(test)]
 mod tests {
-    use super::IndexWriterStatus;
-    use crossbeam::channel;
    use std::mem;

+    use crossbeam::channel;
+
+    use super::IndexWriterStatus;
+
    #[test]
    fn test_bomb_goes_boom() {
        let (_tx, rx) = channel::bounded(10);
--- a/src/indexer/log_merge_policy.rs
+++ b/src/indexer/log_merge_policy.rs
@@ -1,7 +1,9 @@
+use std::cmp;
+
+use itertools::Itertools;
+
 use super::merge_policy::{MergeCandidate, MergePolicy};
 use crate::core::SegmentMeta;
-use itertools::Itertools;
-use std::cmp;

 const DEFAULT_LEVEL_LOG_SIZE: f64 = 0.75;
 const DEFAULT_MIN_LAYER_SIZE: u32 = 10_000;
@@ -139,14 +141,14 @@ impl Default for LogMergePolicy {

 #[cfg(test)]
 mod tests {
-    use super::*;
-    use crate::{
-        core::{SegmentId, SegmentMeta, SegmentMetaInventory},
-        schema,
-    };
-    use crate::{indexer::merge_policy::MergePolicy, schema::INDEXED};
    use once_cell::sync::Lazy;

+    use super::*;
+    use crate::core::{SegmentId, SegmentMeta, SegmentMetaInventory};
+    use crate::indexer::merge_policy::MergePolicy;
+    use crate::schema;
+    use crate::schema::INDEXED;
+
    static INVENTORY: Lazy<SegmentMetaInventory> = Lazy::new(SegmentMetaInventory::default);

    use crate::Index;
--- a/src/indexer/merge_operation.rs
+++ b/src/indexer/merge_operation.rs
@@ -1,9 +1,8 @@
-use crate::Opstamp;
-use crate::SegmentId;
-use crate::{Inventory, TrackedObject};
 use std::collections::HashSet;
 use std::ops::Deref;

+use crate::{Inventory, Opstamp, SegmentId, TrackedObject};
+
 #[derive(Default)]
 pub(crate) struct MergeOperationInventory(Inventory<InnerMergeOperation>);

--- a/src/indexer/merge_policy.rs
+++ b/src/indexer/merge_policy.rs
@@ -1,8 +1,8 @@
-use crate::core::SegmentId;
-use crate::core::SegmentMeta;
 use std::fmt::Debug;
 use std::marker;

+use crate::core::{SegmentId, SegmentMeta};
+
 /// Set of segment suggested for a merge.
 #[derive(Debug, Clone)]
 pub struct MergeCandidate(pub Vec<SegmentId>);
@@ -39,8 +39,7 @@ impl MergePolicy for NoMergePolicy {
 pub mod tests {

    use super::*;
-    use crate::core::SegmentId;
-    use crate::core::SegmentMeta;
+    use crate::core::{SegmentId, SegmentMeta};

    /// `MergePolicy` useful for test purposes.
    ///
--- a/Show More
+++ b/Show More
Author	SHA1	Message	Date
Pascal Seitz	9c12860b01	extend proptest to cover bytes field codec bug	2022-02-18 10:50:46 +01:00
Pascal Seitz	886245ad21	Fix opening bytes index with dynamic codec Fix #1278	2022-02-18 07:08:43 +01:00
Shikhar Bhushan	505e6a440c	Remove test assertion sensitive to background segment merging (#1274 )	2022-02-17 10:59:46 +09:00
Koichi Akabe	fcd651f6a9	Add Vaporetto tokenizer to README (#1271 ) * Add Vaporetto tokenizer to README * Update README.md	2022-02-14 18:19:57 +09:00
Paul Masurel	e6653228a9	Renamed github workflows (#1269 )	2022-02-04 15:10:24 +09:00
Paul Masurel	bdedefe07d	Adding an IndexingContext object (#1268 )	2022-02-04 15:08:01 +09:00
Paul Masurel	13a4473faa	Removing obsolete clippy allow thingy.	2022-02-01 11:54:01 +09:00
Paul Masurel	2069e3e52b	Fixing clippy comments	2022-02-01 10:24:05 +09:00
Paul Masurel	0d8263cba1	Using nightly to format	2022-01-31 16:10:11 +09:00
Paul Masurel	65b365b81c	Fixing all-features build.	2022-01-31 14:41:14 +09:00
dependabot[bot]	4c1366da87	Update fastdivide requirement from 0.3 to 0.4 (#1265 ) Updates the requirements on fastdivide to permit the latest version. --- updated-dependencies: - dependency-name: fastdivide dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-01-31 11:26:50 +09:00
Paul Masurel	eca6628b3c	Minor refactoring (#1266 )	2022-01-28 15:55:55 +09:00
Paul Masurel	9679c5f306	Rename quickwit-inc -> quickwit-oss	2022-01-27 15:37:09 +09:00
Shikhar Bhushan	5a2497b6fd	Avoid exposing TrackedObject from Warmer API (#1264 )	2022-01-25 10:04:08 +09:00
Shikhar Bhushan	99d4b1a177	Searcher Warming API (#1261 ) Adds an API to register Warmers in the IndexReader. Co-authored-by: Paul Masurel <paul@quickwit.io>	2022-01-20 23:40:25 +09:00