Commit Graph

273 Commits

Author SHA1 Message Date
Paul Masurel
24d2e3f6c1 switching for the stacker datastructure 2016-09-05 10:27:14 +09:00
Paul Masurel
45bb1c5ab8 Working... Test passing. Indexing wikipediua in 4min37 2016-08-30 00:52:52 +09:00
Paul Masurel
a599614a94 Code clean up. 2016-08-27 17:00:14 +09:00
Paul Masurel
5e806c88ef Issue 20 Searcher pool implemented using a channel.
Operational but not really ready for merge.
2016-08-27 16:15:02 +09:00
Paul Masurel
46e2b73ea5 small refactoring 2016-08-26 10:32:49 +09:00
Paul Masurel
b2afe8592c Erased text in README 2016-08-26 10:17:09 +09:00
Paul Masurel
59150ad802 superficial refactoring 2016-08-26 09:30:09 +09:00
Paul Masurel
ce95bfd9a5 Closes #19. Move example to examples directory. 2016-08-24 10:58:08 +09:00
Paul Masurel
e6200e85af Schema now:
- internally wrapped by an Arc, cloning aggressively is ok.
- a field in schema, redundant with metas
- read-only. Needs to be built via a SchemaBuilder.
2016-08-23 22:23:47 +09:00
Paul Masurel
054405adf1 Removing the locks from the index to keep only those in Directory 2016-08-23 11:18:19 +09:00
Paul Masurel
182d2549b9 Added commit / rollback / docstamp 2016-08-21 14:38:46 +09:00
Paul Masurel
0639911cc3 Moved index code to another module. 2016-08-20 15:29:27 +09:00
Paul Masurel
08612899eb Change error used in directory. 2016-08-19 00:56:13 +09:00
Paul Masurel
04238d9e65 Documented the Directory behavior.
Removed sync calls, and moved then on file creation
and flush instead.
2016-08-18 23:41:58 +09:00
Paul Masurel
0972a1c6a0 Removing data copy in the RAMDirectory
The fst crate recently added support for sliced `Arc<Vec<u8>>`.
This called for a rewrite of the RAMDirectory for tantivy's RAMDirectory.
Previously every single read was copying data.

In addition:
- RAMDirectory's Write object panic if someone does not flush
right before the destruction of the object.
- In the same spirit, the postings serializer panics if someone
opens a term without closing the previous one.

Closes #16
2016-08-18 10:45:34 +09:00
Paul Masurel
ed38007e9f NOBUG made building index from directory public 2016-08-16 11:55:10 +09:00
Paul Masurel
cf50b97f8e FieldType::Text -> Str 2016-08-14 23:29:00 +09:00
Paul Masurel
e847de1ebb Refactoring, and making server working with a dynamic schema 2016-08-14 14:28:36 +09:00
Paul Masurel
4c42c314e3 Changed serialization of field_entry 2016-08-13 23:45:12 +09:00
Paul Masurel
f66a48af42 Bugfix in merger 2016-08-13 20:14:00 +09:00
Paul Masurel
654769bb60 merging positions. 2016-08-13 15:59:12 +09:00
Paul Masurel
c3e3715cbd Can request for more or less functionality when opening a segment postings. 2016-08-13 14:15:28 +09:00
Paul Masurel
9a8f153d2b positions 2016-08-13 13:15:39 +09:00
Paul Masurel
cf1415a17c Misc fixes. 2016-08-13 00:31:56 +09:00
Paul Masurel
1fba8a2b58 blop 2016-08-12 10:13:01 +09:00
Paul Masurel
e644a371d6 moving stuff around in the segment writer 2016-08-12 09:07:27 +09:00
Paul Masurel
57626cd217 indexing untokenized fields 2016-08-12 00:50:23 +09:00
Paul Masurel
e017840377 BUGFIX - every field was recording term frequency. (no position) 2016-08-12 00:46:22 +09:00
Paul Masurel
ca24daae51 Added CLI wizard for index new 2016-08-11 00:37:07 +09:00
Paul Masurel
c53a930389 Reintroduced cli command in tantivy 2016-08-10 10:08:18 +09:00
Paul Masurel
f9ee6c45bd Added occur 2016-08-07 00:06:03 +09:00
Paul Masurel
d4bbec6631 Safer interface for union_postings 2016-08-06 18:30:05 +09:00
Paul Masurel
0ee473f474 Added tantivy::Error 2016-08-06 17:02:31 +09:00
Paul Masurel
bf0d072c2d next() -> advance() 2016-08-06 11:50:17 +09:00
Paul Masurel
c93b9a2b64 Change benchmark 2016-08-03 21:39:02 +09:00
Paul Masurel
b39203c88c Added divide by 0 as returning always 0 in divider by 2016-08-01 10:47:21 +09:00
Paul Masurel
f94efcf5aa More than one default field in query parser. fst version to 1.3.1. 2016-08-01 09:22:01 +09:00
Paul Masurel
090973ff16 Added field norms. 2016-07-31 22:01:54 +09:00
Paul Masurel
97c6be1882 Added fieldnorms 2016-07-31 20:19:36 +09:00
Paul Masurel
22bc18040e code cleaning. 2016-07-31 16:51:32 +09:00
Paul Masurel
1e9feaddf9 Syncing all components. 2016-07-31 16:29:20 +09:00
Paul Masurel
e486495cb8 Code cleaning. 2016-07-31 15:34:32 +09:00
Paul Masurel
0e5d6720ba Fst version and SegmentId serialization
+ Using the master branch of fst (because of the slice issue)
+ Serializing SegmentId to the associated uid string directly
2016-07-31 12:22:26 +09:00
Paul Masurel
7281022ed1 Renamed MMapDirectory::create to MMapDirectory to open 2016-07-31 12:06:10 +09:00
Paul Masurel
bf2c95bb14 Minor changes. 2016-07-29 17:21:55 +09:00
Paul Masurel
04556c454a added top collector 2016-07-29 16:20:57 +09:00
Paul Masurel
5f64e1ec9c added a simple query parser 2016-07-28 17:48:43 +09:00
Paul Masurel
1080fe19dc Added test for tfidf score, added skip 2016-07-24 17:35:33 +09:00
Paul Masurel
852ce73849 test passing. 2016-07-23 23:15:10 +09:00
Paul Masurel
e20acd4ef6 Added doc_freq to searcher. Sums the value for all readers. 2016-07-05 21:37:35 +09:00