Paul Masurel
24d2e3f6c1
switching for the stacker datastructure
2016-09-05 10:27:14 +09:00
Paul Masurel
45bb1c5ab8
Working... Test passing. Indexing wikipediua in 4min37
2016-08-30 00:52:52 +09:00
Paul Masurel
a599614a94
Code clean up.
2016-08-27 17:00:14 +09:00
Paul Masurel
5e806c88ef
Issue 20 Searcher pool implemented using a channel.
...
Operational but not really ready for merge.
2016-08-27 16:15:02 +09:00
Paul Masurel
46e2b73ea5
small refactoring
2016-08-26 10:32:49 +09:00
Paul Masurel
b2afe8592c
Erased text in README
2016-08-26 10:17:09 +09:00
Paul Masurel
59150ad802
superficial refactoring
2016-08-26 09:30:09 +09:00
Paul Masurel
ce95bfd9a5
Closes #19 . Move example to examples directory.
2016-08-24 10:58:08 +09:00
Paul Masurel
e6200e85af
Schema now:
...
- internally wrapped by an Arc, cloning aggressively is ok.
- a field in schema, redundant with metas
- read-only. Needs to be built via a SchemaBuilder.
2016-08-23 22:23:47 +09:00
Paul Masurel
054405adf1
Removing the locks from the index to keep only those in Directory
2016-08-23 11:18:19 +09:00
Paul Masurel
182d2549b9
Added commit / rollback / docstamp
2016-08-21 14:38:46 +09:00
Paul Masurel
0639911cc3
Moved index code to another module.
2016-08-20 15:29:27 +09:00
Paul Masurel
08612899eb
Change error used in directory.
2016-08-19 00:56:13 +09:00
Paul Masurel
04238d9e65
Documented the Directory behavior.
...
Removed sync calls, and moved then on file creation
and flush instead.
2016-08-18 23:41:58 +09:00
Paul Masurel
0972a1c6a0
Removing data copy in the RAMDirectory
...
The fst crate recently added support for sliced `Arc<Vec<u8>>`.
This called for a rewrite of the RAMDirectory for tantivy's RAMDirectory.
Previously every single read was copying data.
In addition:
- RAMDirectory's Write object panic if someone does not flush
right before the destruction of the object.
- In the same spirit, the postings serializer panics if someone
opens a term without closing the previous one.
Closes #16
2016-08-18 10:45:34 +09:00
Paul Masurel
ed38007e9f
NOBUG made building index from directory public
2016-08-16 11:55:10 +09:00
Paul Masurel
cf50b97f8e
FieldType::Text -> Str
2016-08-14 23:29:00 +09:00
Paul Masurel
e847de1ebb
Refactoring, and making server working with a dynamic schema
2016-08-14 14:28:36 +09:00
Paul Masurel
4c42c314e3
Changed serialization of field_entry
2016-08-13 23:45:12 +09:00
Paul Masurel
f66a48af42
Bugfix in merger
2016-08-13 20:14:00 +09:00
Paul Masurel
654769bb60
merging positions.
2016-08-13 15:59:12 +09:00
Paul Masurel
c3e3715cbd
Can request for more or less functionality when opening a segment postings.
2016-08-13 14:15:28 +09:00
Paul Masurel
9a8f153d2b
positions
2016-08-13 13:15:39 +09:00
Paul Masurel
cf1415a17c
Misc fixes.
2016-08-13 00:31:56 +09:00
Paul Masurel
1fba8a2b58
blop
2016-08-12 10:13:01 +09:00
Paul Masurel
e644a371d6
moving stuff around in the segment writer
2016-08-12 09:07:27 +09:00
Paul Masurel
57626cd217
indexing untokenized fields
2016-08-12 00:50:23 +09:00
Paul Masurel
e017840377
BUGFIX - every field was recording term frequency. (no position)
2016-08-12 00:46:22 +09:00
Paul Masurel
ca24daae51
Added CLI wizard for index new
2016-08-11 00:37:07 +09:00
Paul Masurel
c53a930389
Reintroduced cli command in tantivy
2016-08-10 10:08:18 +09:00
Paul Masurel
f9ee6c45bd
Added occur
2016-08-07 00:06:03 +09:00
Paul Masurel
d4bbec6631
Safer interface for union_postings
2016-08-06 18:30:05 +09:00
Paul Masurel
0ee473f474
Added tantivy::Error
2016-08-06 17:02:31 +09:00
Paul Masurel
bf0d072c2d
next() -> advance()
2016-08-06 11:50:17 +09:00
Paul Masurel
c93b9a2b64
Change benchmark
2016-08-03 21:39:02 +09:00
Paul Masurel
b39203c88c
Added divide by 0 as returning always 0 in divider by
2016-08-01 10:47:21 +09:00
Paul Masurel
f94efcf5aa
More than one default field in query parser. fst version to 1.3.1.
2016-08-01 09:22:01 +09:00
Paul Masurel
090973ff16
Added field norms.
2016-07-31 22:01:54 +09:00
Paul Masurel
97c6be1882
Added fieldnorms
2016-07-31 20:19:36 +09:00
Paul Masurel
22bc18040e
code cleaning.
2016-07-31 16:51:32 +09:00
Paul Masurel
1e9feaddf9
Syncing all components.
2016-07-31 16:29:20 +09:00
Paul Masurel
e486495cb8
Code cleaning.
2016-07-31 15:34:32 +09:00
Paul Masurel
0e5d6720ba
Fst version and SegmentId serialization
...
+ Using the master branch of fst (because of the slice issue)
+ Serializing SegmentId to the associated uid string directly
2016-07-31 12:22:26 +09:00
Paul Masurel
7281022ed1
Renamed MMapDirectory::create to MMapDirectory to open
2016-07-31 12:06:10 +09:00
Paul Masurel
bf2c95bb14
Minor changes.
2016-07-29 17:21:55 +09:00
Paul Masurel
04556c454a
added top collector
2016-07-29 16:20:57 +09:00
Paul Masurel
5f64e1ec9c
added a simple query parser
2016-07-28 17:48:43 +09:00
Paul Masurel
1080fe19dc
Added test for tfidf score, added skip
2016-07-24 17:35:33 +09:00
Paul Masurel
852ce73849
test passing.
2016-07-23 23:15:10 +09:00
Paul Masurel
e20acd4ef6
Added doc_freq to searcher. Sums the value for all readers.
2016-07-05 21:37:35 +09:00