Commit Graph

106 Commits

Author SHA1 Message Date
Paul Masurel
fe905ff18b bug/4 Bugfix, and made unit test way faster 2016-10-11 08:59:41 +09:00
Paul Masurel
9298a6ad9e bug/4 2016-10-01 19:03:36 +09:00
Paul Masurel
1a08ca4f95 Doc 2016-09-22 15:45:37 +09:00
Paul Masurel
de7880bde9 NOBUG Added comments. 2016-09-21 10:38:50 +09:00
Paul Masurel
0b11ed517a NOBUG Added documents. 2016-09-20 11:07:40 +09:00
Paul Masurel
17715fe84c Added documentation 2016-09-20 00:43:52 +09:00
Paul Masurel
578b8df288 NOBUG Added documentation 2016-09-19 18:45:50 +09:00
Paul Masurel
f3a24f5b3c NOBUG Code cleaning , cargo clippy 2016-09-19 17:01:37 +09:00
Paul Masurel
7969fb3a71 Use logging. 2016-09-15 00:00:14 +09:00
Paul Masurel
346fc31ac2 Chaining heaps.
We commit close segments when the indexer heap is close to its capacity.
(currently we use a limit of 10_000_000).

Because we do this check before indexing a document, and before
also because serialization starts by closing the postingswriter, and
therefore all of the recorders open for the last document, we may still
overflow the heap.

We don't want to resize the heap because we may have references to objects
in the current heap.

Because of that, heap are actually chained list.
In an ideal settings, the limit should work fine and this overflow behavior should
never be activated.
2016-09-14 10:27:55 +09:00
Paul Masurel
385794419e Bigger heap in unit testwq 2016-09-07 10:04:02 +09:00
Paul Masurel
24d2e3f6c1 switching for the stacker datastructure 2016-09-05 10:27:14 +09:00
Paul Masurel
619b65b0f3 ISSUE-8 Replacing Vec by an Unrolled Linked List. 2016-08-28 15:38:56 +09:00
Paul Masurel
a599614a94 Code clean up. 2016-08-27 17:00:14 +09:00
Paul Masurel
5e806c88ef Issue 20 Searcher pool implemented using a channel.
Operational but not really ready for merge.
2016-08-27 16:15:02 +09:00
Paul Masurel
b2afe8592c Erased text in README 2016-08-26 10:17:09 +09:00
Paul Masurel
59150ad802 superficial refactoring 2016-08-26 09:30:09 +09:00
Paul Masurel
ce95bfd9a5 Closes #19. Move example to examples directory. 2016-08-24 10:58:08 +09:00
Paul Masurel
e6200e85af Schema now:
- internally wrapped by an Arc, cloning aggressively is ok.
- a field in schema, redundant with metas
- read-only. Needs to be built via a SchemaBuilder.
2016-08-23 22:23:47 +09:00
Paul Masurel
0639911cc3 Moved index code to another module. 2016-08-20 15:29:27 +09:00
Paul Masurel
1f4fafd964 Added usage snippet for tantivy 2016-08-16 00:58:11 +09:00
Paul Masurel
f66a48af42 Bugfix in merger 2016-08-13 20:14:00 +09:00
Paul Masurel
2a8262e15e Added positions in postings. 2016-08-13 15:36:39 +09:00
Paul Masurel
e76dc40aaf hiding some inner module 2016-08-11 16:26:18 +09:00
Paul Masurel
ca24daae51 Added CLI wizard for index new 2016-08-11 00:37:07 +09:00
Paul Masurel
f9ee6c45bd Added occur 2016-08-07 00:06:03 +09:00
Paul Masurel
0ee473f474 Added tantivy::Error 2016-08-06 17:02:31 +09:00
Paul Masurel
bf0d072c2d next() -> advance() 2016-08-06 11:50:17 +09:00
Paul Masurel
bc54db6872 Explanation as a struct 2016-08-06 11:17:05 +09:00
Paul Masurel
140f6fd2b4 explanation kind of working in an ugly way. 2016-08-05 12:02:13 +09:00
Paul Masurel
66ebbae85a No influence from starting and trailing spaces. 2016-08-04 00:53:24 +09:00
Paul Masurel
f94efcf5aa More than one default field in query parser. fst version to 1.3.1. 2016-08-01 09:22:01 +09:00
Paul Masurel
090973ff16 Added field norms. 2016-07-31 22:01:54 +09:00
Paul Masurel
97c6be1882 Added fieldnorms 2016-07-31 20:19:36 +09:00
Paul Masurel
e486495cb8 Code cleaning. 2016-07-31 15:34:32 +09:00
Paul Masurel
a6b2d94119 Added unit tests 2016-07-29 18:38:18 +09:00
Paul Masurel
04556c454a added top collector 2016-07-29 16:20:57 +09:00
Paul Masurel
5f64e1ec9c added a simple query parser 2016-07-28 17:48:43 +09:00
Paul Masurel
852ce73849 test passing. 2016-07-23 23:15:10 +09:00
Paul Masurel
68f8c1877e Attempt to add a tfidf multitermscorer. 2016-07-20 21:49:48 +09:00
Paul Masurel
e20acd4ef6 Added doc_freq to searcher. Sums the value for all readers. 2016-07-05 21:37:35 +09:00
Paul Masurel
0c230c9ebf fixed reading docfreq 2016-06-21 10:07:12 +09:00
Paul Masurel
36684d76c5 Added a bunch of test, change read_postings to work on a term 2016-06-20 21:27:02 +09:00
Paul Masurel
b93fa39910 Added test for field entry json ser/deser 2016-06-19 23:01:24 +09:00
Paul Masurel
6d30546517 Swith to copy semantics for Field 2016-06-19 00:37:03 +09:00
Paul Masurel
3a8d566563 Started refactoring 2016-06-19 00:19:17 +09:00
Paul Masurel
3d8cd078d1 delegating the search logic to a query trait. 2016-06-12 16:22:40 +09:00
Paul Masurel
cdcc72a0c9 test passing 2016-05-19 11:08:50 +09:00
Paul Masurel
7ede24a957 block encoding is handled in rust. 2016-05-02 18:26:50 +09:00
Paul Masurel
2eb3195904 blop 2016-05-01 15:23:35 +09:00