Paul Masurel
644d8a3a10
Added snippet generator
2018-09-10 16:39:45 +09:00
Paul Masurel
e32dba1a97
Phrase weight
2018-09-10 09:26:33 +09:00
Paul Masurel
a78f4cca37
Merge branch 'issue/368' into issue/368b
2018-09-09 16:04:20 +09:00
Paul Masurel
2e44f0f099
blop
2018-09-09 14:23:24 +09:00
Vignesh Sarma K
9ccba9f864
Merge branch 'master' into issue/368
2018-09-07 20:27:38 +05:30
Paul Masurel
9101bf5753
Fragments
2018-09-07 09:57:12 +09:00
Paul Masurel
23e97da9f6
Merge branch 'master' of github.com:tantivy-search/tantivy
2018-09-07 08:44:14 +09:00
Paul Masurel
1d439e96f5
Using sort unstable by key.
2018-09-07 08:43:44 +09:00
Paul Masurel
934933582e
Closes #402 ( #403 )
2018-09-06 10:12:26 +09:00
Paul Masurel
98c7fbdc6f
Issue/378 ( #392 )
...
* Added failing unit test
* Closes #378 . Handling queries that end up empty after going through the analyzer.
* Fixed stop word example
2018-09-06 10:11:54 +09:00
Paul Masurel
cec9956a01
Issue/389 ( #405 )
...
* Setting up the dependency.
* Completed README
2018-09-06 10:10:40 +09:00
Paul Masurel
c64972e039
Apply unicode lowercasing. ( #408 )
...
Checks if the str is ASCII, and uses a fast track if it is the case.
If not, the std's definition of a lowercase character.
Closes #406
2018-09-05 09:43:56 +09:00
Paul Masurel
b3b2421e8a
Issue/367 ( #404 )
...
* First stab
* Closes #367
2018-09-04 09:17:00 +09:00
Paul Masurel
f570fe37d4
small changes
2018-08-31 09:03:44 +09:00
Paul Masurel
6704ab6987
Added methods to extract the matching terms. First stab
2018-08-30 09:47:19 +09:00
Paul Masurel
a12d211330
Extracting terms matching query in the document
2018-08-30 09:23:34 +09:00
petr-tik
d15efd6635
Closes #235 - adds a new error type ( #398 )
...
error message suggests possible causes
Addressed code review 1 thread + smaller heap size
2018-08-29 08:26:59 +09:00
Vignesh Sarma K (വിഘ്നേഷ് ശ൪മ കെ)
18814ba0c1
add a test for second fragment having higher score
2018-08-28 22:27:56 +05:30
Vignesh Sarma K (വിഘ്നേഷ് ശ൪മ കെ)
f247935bb9
Use HighlightSection::new rather than just directly creating the object
2018-08-28 22:16:22 +05:30
Vignesh Sarma K (വിഘ്നേഷ് ശ൪മ കെ)
6a197e023e
ran rustfmt
2018-08-28 20:41:58 +05:30
Vignesh Sarma K (വിഘ്നേഷ് ശ൪മ കെ)
96a313c6dd
add more tests
2018-08-28 20:41:58 +05:30
Vignesh Sarma K (വിഘ്നേഷ് ശ൪മ കെ)
fb9b1c1f41
add a test and fix the bug of not calculating first token
2018-08-28 20:41:58 +05:30
Vignesh Sarma K (വിഘ്നേഷ് ശ൪മ കെ)
e1bca6db9d
update calculate_score to try_add_token
...
`try_add_token` will now update the stop_offset as well.
`FragmentCandidate::new` now just takes `start_offset`,
it expects `try_add_token` to be called to add a token.
2018-08-28 20:41:58 +05:30
Vignesh Sarma K (വിഘ്നേഷ് ശ൪മ കെ)
8438eda01a
use while let instead of loop and if.
...
as per CR comment
2018-08-28 20:41:57 +05:30
Vignesh Sarma K (വിഘ്നേഷ് ശ൪മ കെ)
b373f00840
add htmlescape and update to_html fn to use it.
...
tests and imports also updated.
2018-08-28 20:41:57 +05:30
Vignesh Sarma K (വിഘ്നേഷ് ശ൪മ കെ)
46decdb0ea
compare against accumulator rather than init value
2018-08-28 20:41:41 +05:30
Vignesh Sarma K (വിഘ്നേഷ് ശ൪മ കെ)
835cdc2fe8
Initial version of snippet
...
refer #368
2018-08-28 20:41:41 +05:30
Paul Masurel
19756bb7d6
Getting started on #368
2018-08-28 20:41:41 +05:30
Paul Masurel
2649c8a715
Issue/246 ( #393 )
...
* Moving Range and All to Leaves
* Parsing OR/AND
* Simplify user input ast
* AND and OR supported. Returning an error when mixing syntax
Closes #246
* Added support for NOT
* Updated changelog
2018-08-28 11:03:54 +09:00
Paul Masurel
ede97eded6
Removed use
2018-08-28 09:54:04 +09:00
Dru Sellers
af593b1116
Add default EN stopwords to the default analyzer ( #381 )
...
* Add a default list of en stopwords
* Add the default en stopword filter to the standard tokenizers
* code review feedback
2018-08-22 10:49:39 +09:00
Paul Masurel
3a8e524f77
Added example to show how to access the inverted list directly
2018-08-21 09:36:13 +09:00
Dru Sellers
ef3a16a129
Switch from error-chain to failure crate ( #376 )
...
* Switch from error-chain to failure crate
* Added deprecated alias for
* Started editing the changeld
2018-08-20 09:40:45 +09:00
Paul Masurel
a0a284fe91
Added a full fledge empty query and relyign on it in QueryParser, instead of using an empty clause.
2018-08-20 09:21:32 +09:00
Paul Masurel
60a9a7f837
Added example showing how to delete/update documents
2018-08-17 09:43:55 +09:00
Paul Masurel
3e14a76623
Update regex_query.rs
2018-08-15 16:38:32 +09:00
Vignesh Sarma K
09e00f1d42
add position_length to Token ( #337 )
...
* add position_length to Token
refer #291
* Add term offset to `PhraseQuery`
ref #291
* Add new constructor for `PhraseQuery` that allows custom offset
* fix the method name as per pr comment
* Closes #291
Added unit test.
Using offsets from the analyzer in QueryParser.
2018-08-13 10:14:50 +09:00
Paul Masurel
290620fdee
Added slashes
2018-08-13 09:13:01 +09:00
petr-tik
f0d1b85bd8
N370 pr fix num searchers ( #371 )
...
* Change ordering to Acquire
* set_num_searchers now uses AtomicUsize.store
2018-08-13 08:56:30 +09:00
petr-tik
aaef546f91
Moved NUM_SEARCHERS into a local variable ( #369 )
...
* Moved NUM_SEARCHERS into a local variable
dynamically determined as the number of available cpus.
var name in lowercase (not a constant anymore).
updated it in docstring
* lowercased the varnames
* User can set number of logical cores in create_from_metas
* cargo fmt
* Num_searchers as Arc<AtomicUsize>
Retrieving the value with Relaxed ordering
Reverted create_from_metas signature. However, it calls num_cpus and
sets the Arc val
2018-08-12 20:08:14 +09:00
Paul Masurel
811ddf2226
Closes #364 ( #365 )
...
* Closes #364
* Trying to raise the recursion limit
* Better unit test and bug fix on token offsets
2018-08-08 11:15:20 +09:00
Paul Masurel
79a339d353
Removing env_logger dependency
2018-08-02 19:29:09 +09:00
Paul Masurel
e45e4c79d9
update crossbeam
2018-08-02 19:24:08 +09:00
Paul Masurel
848bf41bc9
Updating rand to 0.5 ( #363 )
2018-08-02 19:19:04 +09:00
Paul Masurel
d11cb087a7
Updated to combine-0.3 ( #362 )
2018-08-02 18:29:58 +09:00
Jacob Brown
2dd7422f42
replace chan with crossbeam-channel ( #361 )
...
* replace chan with crossbeam-channel
* Update Cargo.toml
2018-08-02 12:47:22 +09:00
Paul Masurel
e8707c02c0
Issue/333 ( #335 )
...
* Add skip information for posting list (skip to doc ids)
* Separate num bits from data for positions (skip n positions)
* Address in the position using a n-position offset
* Added a long skip structure to allow efficient opening of the position for a given term.
2018-07-31 10:51:53 +09:00
Paul Masurel
52b4575245
Issue/355 ( #358 )
...
* issue with top_k sorting (#356 )
* Closes #355
2018-07-31 08:24:55 +09:00
Paul Masurel
190e60a41c
Closes #339 . ( #340 )
...
As required per the FacetCollector,
facet values needs to be sorted before being encoded in the
multivalued field.
2018-07-25 18:21:48 +09:00
Vignesh Sarma K
b9558801a1
Declare and implement separate Clone Traits ( #336 )
...
For traits, `Directory` and `MergePolicy`.
refer #306
2018-07-18 12:36:43 +09:00