Commit Graph

212 Commits

Author SHA1 Message Date
Chang She
f485378ea4 Basic full text search capabilities (#62)
This is v1 of integrating full text search index into LanceDB.

# API
The query API is roughly the same as before, except if the input is text
instead of a vector we assume that its fts search.

## Example
If `table` is a LanceDB LanceTable, then:

Build index: `table.create_fts_index("text")`

Query: `df = table.search("puppy").limit(10).select(["text"]).to_df()`

# Implementation
Here we use the tantivy-py package to build the index. We then use the
row id's as the full-text-search index's doc id then we just do a Take
operation to fetch the rows.

# Limitations

1. don't support incremental row appends yet. New data won't show up in
search
2. local filesystem only 
3. requires building tantivy explicitly

---------

Co-authored-by: Chang She <chang@lancedb.com>
2023-05-24 22:25:31 -06:00
Chang She
59014a01e0 bump version for v0.1.2 2023-05-05 11:27:09 -07:00
Chang She
afa7fe19e6 bump version for v0.1.1 2023-04-26 16:55:25 -07:00
Chang She
7a375185a1 increment lance version to include cosine distance fix 2023-04-25 19:57:58 -07:00
Chang She
342b726ed7 bump version for v0.1 2023-04-19 23:26:46 -07:00
Chang She
3ba7fa15a4 bump version for v0.0.4 2023-04-18 09:20:53 -07:00
Chang She
c32b6880e7 bump version for v0.0.3 2023-03-30 19:19:29 -07:00
Chang She
5af64d72f3 bump version to v0.0.2 2023-03-24 12:04:20 -07:00
Chang She
b91139d3c7 Add tutorial notebook
Convert contextualization and embeddings functionality.
And use it with converted notebook for video search
2023-03-23 15:07:58 -07:00
Chang She
b3e7673c35 enable source dist and bdist builds 2023-03-22 19:46:15 -07:00
Chang She
01db9417fa add ruff and black pre-commit hook 2023-03-22 17:59:15 -07:00
Chang She
8800c0129f add js package 2023-03-17 19:15:05 -07:00