Commit Graph

  • 73452284ae Remove unused crates from dependencies (#2018) tottoto 2023-05-02 19:34:20 +09:00
  • ba309e18a1 switch to nanosecond precision (#2016) PSeitz 2023-05-01 09:32:20 +08:00
  • cbf2bdc75b change bucket count type (#2013) PSeitz 2023-04-27 15:47:31 +08:00
  • 1f06997d04 fix single collector special case (#2014) PSeitz 2023-04-27 15:30:19 +08:00
  • c599bf3b6c chore!:drop JSON support on intermediate agg result (#1992) PSeitz 2023-04-26 19:05:16 +08:00
  • 80df1d9835 Handle error for exists on MMapDirectory (#1988) PSeitz 2023-04-25 15:20:33 +08:00
  • 2e369db936 switch to Aggregation without serde_untagged (#2003) PSeitz 2023-04-25 14:54:51 +08:00
  • 7b31100208 refactor vint (#2010) PSeitz 2023-04-25 14:49:36 +08:00
  • 9c93bfeb51 optimise warmup code path (#2007) trinity-1686a 2023-04-21 11:23:09 +02:00
  • 74f9eafefc refactor Term (#2006) PSeitz 2023-04-20 21:31:43 +08:00
  • ff3d3313c4 fix BooleanQuery document (#1999) RT_Enzyme 2023-04-20 17:37:20 +08:00
  • fbda511a1a Making more things public for quickwit. (#2005) Paul Masurel 2023-04-20 11:37:45 +09:00
  • c1defdda05 Bump aho-corasick dependency to version 1.0 and adjust to API changes (#2002) Adam Reichold 2023-04-18 07:34:30 +02:00
  • e522163a1c use json in agg tests (#1998) PSeitz 2023-04-17 20:08:48 +08:00
  • e83abbfe4a perf: faster term hash map (#1940) PSeitz 2023-04-17 15:07:33 +08:00
  • 780e26331d sstable compression (#1946) trinity-1686a 2023-04-14 16:25:50 +02:00
  • 0286ecea09 re-export a few sstable functions on dicitonary (#1996) trinity-1686a 2023-04-14 11:13:48 +02:00
  • b0ef9a6252 use crates.io dependency (#1990) PSeitz 2023-04-14 09:35:20 +08:00
  • 36138c493b Merge pull request #1994 from quickwit-oss/fmassot/expose-simple-token-stream François Massot 2023-04-13 18:55:02 +02:00
  • 64bce340b2 Expose to use it in quickwit. François Massot 2023-04-07 13:09:21 +02:00
  • 205e8a0a92 encode dictionary type in fst footer (#1968) trinity-1686a 2023-04-12 09:43:01 +02:00
  • 4b01cc4c49 Made BooleanWeight and BoostWeight public (#1991) Paul Masurel 2023-04-12 10:26:30 +09:00
  • 0ed13eeea8 add sparse to agg benchmark (#1986) PSeitz 2023-04-11 14:13:32 +08:00
  • 91a38058fe Fix typo in READEME.md (#1989) Tony-X 2023-04-10 20:07:20 -07:00
  • 41af70799d add percentiles aggregations (#1984) PSeitz 2023-04-07 13:18:28 +08:00
  • f853bf204b Align the numerical type priority order with columnar. (#1978) Paul Masurel 2023-04-07 10:07:54 +09:00
  • 11ae48d3bc Update benchmarks section in READEME.md to link to the bench repo (#1985) Tony-X 2023-04-06 18:07:06 -07:00
  • 4b3af76b74 Added failing use case issue/1981-reproduce-bm25-bug Paul Masurel 2023-04-04 18:23:03 +09:00
  • 5eb12173d6 Proptest merge columnar (#1976) Paul Masurel 2023-04-04 11:28:42 +09:00
  • 806a1e1b1e clarify tokenizer docs tokenizer_docs Pascal Seitz 2023-04-03 22:59:38 +08:00
  • 5c4ea6a708 tokenizer option on text fastfield (#1945) PSeitz 2023-03-31 16:03:38 +08:00
  • 4cf93dab7d fix build (#1973) PSeitz 2023-03-31 12:54:03 +08:00
  • 5c380b76e7 Better mixed types support in aggs and fix serialization issue (#1971) PSeitz 2023-03-31 11:52:11 +08:00
  • 571735c5f7 Fix index sort by on optional/multicolumn (#1972) PSeitz 2023-03-31 10:24:11 +08:00
  • 8e92f960d3 Fix comment: change max_merge_size to max_docs_before_merge. (#1970) zhouhui 2023-03-28 21:49:00 +08:00
  • 057211c3d8 Fixing build on arm (#1966) Paul Masurel 2023-03-27 22:42:57 +09:00
  • 059fc767ea Added ::MIN ::MAX DateTime. (#1965) Paul Masurel 2023-03-27 15:32:53 +09:00
  • 694a056255 Faster range (#1954) Paul Masurel 2023-03-27 14:56:32 +09:00
  • 2955e34452 Added proptests for building/merging columnar. (#1963) Paul Masurel 2023-03-27 14:56:02 +09:00
  • 821208480b Adding Debug/Display impl. Refining the ColumnIndex::get_cardinality Paul Masurel 2023-03-26 14:40:37 +09:00
  • a088de3079 proptests merge-proptests Paul Masurel 2023-03-25 15:53:00 +09:00
  • a2e3c2ed5b Renaming Column::idx -> Column::index (#1961) Paul Masurel 2023-03-26 13:58:50 +09:00
  • 835f228bfa fix cardinality when merging empty columns (#1960) PSeitz 2023-03-25 14:58:15 +08:00
  • 2b6a4da640 Exposing empty column builder. (#1959) Paul Masurel 2023-03-24 16:34:41 +09:00
  • d6a95381ee add memory check for term agg (#1957) PSeitz 2023-03-24 13:47:45 +08:00
  • da2804644f fetch blocks of vals in aggregation for all cardinality (#1950) PSeitz 2023-03-23 15:41:11 +08:00
  • 5504cfd012 remove IterColumn (#1955) PSeitz 2023-03-23 13:43:17 +08:00
  • 482b4155e8 fix bug with new sstable index format (#1953) trinity-1686a 2023-03-22 10:22:36 +01:00
  • 1a35f6573d Switch fs2 to fs4 as it is now unmaintained and does not support illumos (#1944) Till Wegmüller 2023-03-22 05:48:49 +01:00
  • e5e50603a8 new sstable format (#1943) trinity-1686a 2023-03-21 15:03:52 +01:00
  • ab703486aa Updated columnar todo columnar-todo Paul Masurel 2023-03-21 18:55:23 +09:00
  • 8f7f1d6be4 add Display for ByteCount (#1949) PSeitz 2023-03-21 15:02:35 +08:00
  • 6a7a1106d6 work in batches of docs (#1937) PSeitz 2023-03-21 13:57:44 +08:00
  • 9e2faecf5b add memory limit for aggregations (#1942) PSeitz 2023-03-16 13:21:07 +08:00
  • b6703f1b3c fix validation in date histogram (#1936) PSeitz 2023-03-15 13:10:43 +08:00
  • 2fb3740cb0 handle missing column for aggs (#1920) PSeitz 2023-03-15 13:09:59 +08:00
  • 7ce8a65619 fix: doc store for files larger 4GB (#1856) quickwit-0.5-rev quickwit-0.5 PSeitz 2023-02-10 21:29:43 +08:00
  • 7bf0a14041 fix: auto downgrade index record option, instead of vint error (#1857) PSeitz 2023-02-10 20:45:23 +08:00
  • c91d4e4e65 fix sort order test for term aggregation (#1858) PSeitz 2023-02-10 17:26:58 +08:00
  • 6f6f639170 fmt code, update lz4_flex (#1838) PSeitz 2023-02-10 00:42:32 +08:00
  • a022e97dc2 Bumped tantivy version Paul Masurel 2023-03-13 14:10:41 +09:00
  • 8459efa32c split term collection count and sub_agg (#1921) PSeitz 2023-03-13 11:37:41 +08:00
  • 61cfd8dc57 fix clippy (#1927) PSeitz 2023-03-13 10:12:02 +08:00
  • 6474a0f58e Created branch specifically for Quickwit 0.5 Paul Masurel 2023-03-11 12:27:20 +09:00
  • 064518156f refactor tokenization pipeline to use GATs (#1924) trinity-1686a 2023-03-09 09:39:37 +01:00
  • a42a96f470 fix panic in dict column merge (#1930) PSeitz 2023-03-08 21:04:37 +08:00
  • fcf5a25d93 use DeltaReader directly to implement Dictionnary::ord_to_term (#1928) trinity-1686a 2023-03-08 03:15:56 +01:00
  • c0a5b28fd3 Update lru requirement from 0.9.0 to 0.10.0 (#1932) dependabot[bot] 2023-03-07 15:09:02 +09:00
  • a4f7ca8309 use DeltaReader directly to implement Dictionnary::term_ord (#1925) trinity-1686a 2023-03-06 09:45:22 +01:00
  • 364e321415 Clippy fix (#1926) Paul Masurel 2023-03-06 10:37:17 +09:00
  • 72925c2bba Removed azure stuff remove-byteorder Paul Masurel 2023-03-03 21:47:31 +09:00
  • ed5a3b3172 Bumped murmurhash version Paul Masurel 2023-03-03 21:24:32 +09:00
  • 6037cdfe7e remove dynamic dispatch in collect_segment remove_dyn Pascal Seitz 2023-03-02 20:15:47 +08:00
  • ca20bfa776 add date_histogram (#1900) PSeitz 2023-03-02 12:17:35 +08:00
  • faa706d804 add coerce option for text and numbers types (#1904) PSeitz 2023-03-01 18:36:59 +08:00
  • 850a0d7ae2 add agg benchmark for optional and multi value (#1916) PSeitz 2023-03-01 16:01:52 +08:00
  • 7fae4d98d7 Adapting for quickwit2 (#1912) Paul Masurel 2023-03-01 16:27:46 +09:00
  • bc36458334 move buffer in front of dynamic dispatch (#1915) PSeitz 2023-02-28 13:07:50 +08:00
  • 8a71e00da3 allow limiting the number of matched term in range query (#1899) trinity-1686a 2023-02-27 10:44:08 +01:00
  • e510f699c8 feat: add support for u64,i64,f64 fields in term aggregation (#1883) PSeitz 2023-02-27 15:04:41 +08:00
  • d25fc155b2 Making some of the column/termdict operations async-friendly (#1902) Paul Masurel 2023-02-27 15:34:47 +09:00
  • 8ea97e7d6b Minor refactoring preparing for getting columnar integrated in quickwit. (#1911) Paul Masurel 2023-02-27 14:23:30 +09:00
  • 0a726a0897 Added Empty ColumnIndex (#1910) Paul Masurel 2023-02-27 13:59:22 +09:00
  • 66ff53b0f4 Various minor code cleanup (#1909) Paul Masurel 2023-02-27 13:48:34 +09:00
  • d002698008 Re-export of query grammar. (#1908) Paul Masurel 2023-02-27 12:26:34 +09:00
  • c838aa808b Removedc the extra nesting in unit test file (#1907) Paul Masurel 2023-02-27 12:17:52 +09:00
  • 06850719dc Renaming .values(DocId) to .values_for_doc(DocId) (#1906) Paul Masurel 2023-02-27 12:15:13 +09:00
  • 5f23bb7e65 switch to sparse collection for histogram (#1898) PSeitz 2023-02-23 14:02:58 +08:00
  • 533ad99cd5 add PhrasePrefixQuery (#1842) trinity-1686a 2023-02-22 11:18:33 +01:00
  • c7278b3258 remove schema in aggs (#1888) PSeitz 2023-02-22 11:50:28 +08:00
  • 6b403e3281 Re-export of columnar Paul Masurel 2023-02-22 11:23:20 +09:00
  • 789cc8703e Adding unit test testing docfreq after merge (#1895) Paul Masurel 2023-02-22 11:05:34 +09:00
  • e5098d9fe8 Moving test around reenabling tests that were disabled. (#1894) Paul Masurel 2023-02-22 10:31:52 +09:00
  • f537334e4f Adding a write schema to columnar's merge operations. (#1884) Paul Masurel 2023-02-21 18:25:16 +09:00
  • e2aa5af075 Clippy warnings fixes (#1885) Paul Masurel 2023-02-20 19:04:13 +09:00
  • 02bebf4ff5 Cargo fmt Paul Masurel 2023-02-20 09:40:01 +09:00
  • 0274c982d5 Refactoring. (#1881) Paul Masurel 2023-02-17 21:57:14 +09:00
  • 74bf60b4f7 implement SegmentAggregationCollector on bucket aggs (#1878) PSeitz 2023-02-17 19:53:29 +08:00
  • bf1449b22d update examples for literate docs (#1880) PSeitz 2023-02-17 18:48:22 +08:00
  • 111f25a8f7 clippy (#1879) PSeitz 2023-02-17 18:34:21 +08:00