tantivy

mirror of https://github.com/quickwit-oss/tantivy.git synced 2026-06-05 01:50:42 +00:00

Author	SHA1	Message	Date
PSeitz	c4e2708901	fix clippy, fmt (#2162 )	2023-08-30 08:04:26 +02:00
Chris Tam	e6cacc40a9	Remove outdated fast field documentation (#2145 )	2023-08-24 07:49:49 +02:00
Paul Masurel	756156beaf	Fix doc	2023-08-17 17:47:45 +09:00
trinity-1686a	b92082b748	implement lenient parser (#2129 ) * move query parser to nom * add suupport for term grouping * initial work on infallible parser * fmt * add tests and fix minor parsing bugs * address review comments * add support for lenient queries in tantivy * make lenient parser report errors * allow mixing occur and bool in query	2023-08-08 15:41:29 +02:00
Paul Masurel	7ee78bda52	Readding s in datetime precision variant names (#2065 ) There is no clear win and it change some serialization in quickwit.	2023-06-01 06:39:46 +02:00
PSeitz	e56addc63e	enable tokenizer on json fields (#2053 ) * enable tokenizer on json fields enable tokenizer on json fields for type text * Avoid making the tokenizer within the TextAnalyzer pub(crate) * Moving BoxableTokenizer to tantivy. --------- Co-authored-by: Paul Masurel <paul@quickwit.io>	2023-05-24 10:47:39 +02:00
Adrien Guillo	a789ad9aee	Rename `DatePrecision` to `DateTimePrecision` (#2051 )	2023-05-23 17:09:11 +02:00
Paul Masurel	d7e97331e5	Minor refactoring find field (#2055 ) * Minor refactoring Moving find_field_with_default to Schema. * Clippy comments	2023-05-22 15:00:48 +09:00
Paul Masurel	4417be165d	Minor refactoring (#2054 ) Moving find_field_with_default to Schema.	2023-05-22 14:56:38 +09:00
Yuri Astrakhan	74275b76a6	Inline format arguments where makes sense (#2038 ) Applied this command to the code, making it a bit shorter and slightly more readable. ``` cargo +nightly clippy --all-features --benches --tests --workspace --fix -- -A clippy::all -W clippy::uninlined_format_args cargo +nightly fmt --all ```	2023-05-10 18:03:59 +09:00
PSeitz	4ee1b5cda0	add seperate tokenizer manager for fast fields (#2019 ) * add seperate tokenizer manager for fast fields * rename	2023-05-08 11:22:31 +02:00
PSeitz	ba309e18a1	switch to nanosecond precision (#2016 )	2023-05-01 03:32:20 +02:00
PSeitz	7b31100208	refactor vint (#2010 ) - improve performance of vint vint serialization shows up in performance profiles during indexing. It would also make sense to limit the value space to u29 and operate on 4 bytes only. - remove unused code - add missing inlines - fix regex test	2023-04-25 08:49:36 +02:00
PSeitz	74f9eafefc	refactor Term (#2006 ) * refactor Term add ValueBytes for serialized term values add missing debug for ip skip unnecessary json path validation remove code duplication add DATE_TIME_PRECISION_INDEXED constant add missing Term clarification remove weird value_bytes_mut() API * fix naming	2023-04-20 15:31:43 +02:00
Paul Masurel	fbda511a1a	Making more things public for quickwit. (#2005 )	2023-04-20 11:37:45 +09:00
PSeitz	5c4ea6a708	tokenizer option on text fastfield (#1945 ) * tokenizer option on text fastfield allow to set tokenizer option on text fastfield (fixes #1901) handle PreTokenized strings in fast field * change visibility * remove custom de/serialization	2023-03-31 10:03:38 +02:00
PSeitz	faa706d804	add coerce option for text and numbers types (#1904 ) * add coerce option for text and numbers types allow to coerce the field type when indexing if the type does not match * Apply suggestions from code review Co-authored-by: Paul Masurel <paul@quickwit.io> * add tests,add COERCE flag, include bool in coercion --------- Co-authored-by: Paul Masurel <paul@quickwit.io>	2023-03-01 11:36:59 +01:00
Paul Masurel	f537334e4f	Adding a write schema to columnar's merge operations. (#1884 ) * Adding a write schema to columnar's merge operations. * Added unit test checking min/max when columns are empty. * CR comment * Rename to value_type_to_column_type	2023-02-21 18:25:16 +09:00
PSeitz	111f25a8f7	clippy (#1879 ) * fix clippy * fix clippy * fmt	2023-02-17 11:34:21 +01:00
Paul Masurel	7423f99719	Issue/columnar for json (#1876 ) Adding support for JSON fast field.	2023-02-16 20:38:32 +09:00
trinity-1686a	539ff08a79	move DateTime to tantivy_common (#1861 ) * move DateTime to tantivy_common * resolve imports of columnar::DateTime as import of common::DateTime	2023-02-11 17:03:06 +01:00
PSeitz	36c6138e7f	fix: auto downgrade index record option, instead of vint error (#1857 ) Prev: thread 'main' panicked at 'called `Result::unwrap()` on an `Err` value: IoError(Custom { kind: InvalidData, error: "Reach end of buffer while reading VInt" })', src/main.rs:46:14 Now: Automatic downgrade to next available level	2023-02-10 13:45:23 +01:00
trinity-1686a	1390834ae8	make Term::as_slice public (#1846 )	2023-02-09 15:37:07 +01:00
trinity-1686a	3ac973bea4	fix invalid endianness in documentation (#1845 ) * fix doc about term endianness * rustfmt	2023-02-09 15:36:38 +01:00
Paul Masurel	bd5eea9852	Integrated columnar work.	2023-02-09 13:14:31 +01:00
Lonre Wang	8ba333f1b4	Typo fix (#1803 ) * Update text_options.rs * Update src/schema/text_options.rs Co-authored-by: Paul Masurel <paul@quickwit.io>	2023-01-19 17:56:05 +09:00
PSeitz	f687b3a5aa	start migrate Field to &str (#1772 ) start migrate Field to &str in preparation of columnar return Result for get_field	2023-01-18 16:12:07 +09:00
Paul Masurel	7a8fce0ae7	Minor mini fixes	2023-01-10 14:15:30 +09:00
Adam Reichold	2080c370c2	Enable usage of FuzzyTermQuery for specific fields via QueryParser (#1750 ) * Make nightly Clippy mostly happy. * Document how to produce TermSetQuery queries using QueryParser. * Enable construction of queries using FuzzyTermQuery via the QueryParser * Use FxHashMap instead of HashMap in the QueryParser as these hash tables are not exposed to DoS attacks. * Use a struct instead of a tuple to improve readability.	2023-01-04 18:11:27 +09:00
PSeitz	f9171a3981	fix clippy (#1725 ) * fix clippy * fix clippy fastfield codecs * fix clippy bitpacker * fix clippy common * fix clippy stacker * fix clippy sstable * fmt	2022-12-20 07:30:06 +01:00
boraarslan	495824361a	Move `split_full_path` to `Schema` (#1692 )	2022-11-29 20:56:13 +09:00
PSeitz	ee1f2c1f28	add aggregation support for date type (#1693 ) * add aggregation support for date type fixes #1332 * serialize key_as_string as rfc3339 in date histogram * update docs * enable date for range aggregation	2022-11-28 09:12:08 +09:00
Paul Masurel	0b40a7fe43	Added a `expand_dots` JsonObjectOptions. (#1687 ) Related with quickwit#2345.	2022-11-21 23:03:00 +09:00
PSeitz	0c2bd36fe3	Panic on duplicate field names (#1647 ) fixes #1601	2022-10-26 16:17:33 +09:00
Pascal Seitz	6bb73a527f	add range query via ip fast field	2022-10-24 16:00:38 +08:00
Pascal Seitz	6800fdec9d	add indexing for ip field Closes #1595	2022-10-18 10:07:48 +08:00
Pascal Seitz	024e53a99c	remove truncate	2022-10-17 12:14:35 +08:00
Pascal Seitz	8d75e451bd	fix truncate, remove mutable access from term	2022-10-17 12:14:35 +08:00
Pascal Seitz	fcfd76ec55	refactor Term fixes some issues with Term Remove duplicate calls to truncate or resize Replace Magic Number 5 with constant Enforce minimum size of 5 for metadata Fix broken truncate docs use constructor instead new + set calls normalize constructor stack replace assert on internal behavior fixes #1585	2022-10-17 12:14:34 +08:00
Pascal Seitz	952b048341	add term aggregation clarification	2022-10-14 16:12:19 +08:00
PSeitz	8b69aab0fc	avoid prepare_doc allocation (#1610 ) avoid prepare_doc allocation, ~10% more thoughput best case	2022-10-11 14:15:55 +09:00
PSeitz	3650d1f36a	Merge pull request #1553 from quickwit-oss/ip_field ip field	2022-10-11 13:09:47 +08:00
François Massot	e443ca63aa	Merge pull request #1608 from quickwit-oss/nigel/serialise-bytes-as-b64-#2042 Serialise bytes as base64 strings instead of arrays.	2022-10-10 11:51:23 +02:00
Pascal Seitz	5c9cbee29d	handle IpV4 serialization case	2022-10-07 19:52:00 +08:00
Pascal Seitz	b2ca83a93c	switch to ipv6, add monotonic_mapping tests	2022-10-07 18:47:55 +08:00
Nigel Andrews	3b189080d4	Use raw string literals in tests	2022-10-07 12:28:25 +02:00
Nigel Andrews	00a6586efe	Replaced String::serialize for serializer.serialize_str	2022-10-07 11:55:05 +02:00
PSeitz	534b1d33c3	use ipv6 Co-authored-by: Paul Masurel <paul@quickwit.io>	2022-10-07 16:56:00 +08:00
Pascal Seitz	5171ff611b	serialize ip as u128, add test for positions_to_docid	2022-10-07 16:25:01 +08:00
Pascal Seitz	e50e74acf8	remove u128 type	2022-10-07 16:25:01 +08:00

1 2 3 4 5 ...

338 Commits