Will Jones
cd32944e54
feat: upgrade lance to v0.17.0 ( #1608 )
...
Changelog: https://github.com/lancedb/lance/releases/tag/v0.17.0
Highlights:
* You can do "phrase queries" by adding double quotes around phrases
(multiple tokens) in FTS.
Added follow ups in: https://github.com/lancedb/lancedb/issues/1611
2024-09-06 14:10:02 -07:00
Jon X
7eb3b52297
docs: added a blank line between a paragraph and a list block ( #1604 )
...
Though the markdown can be rendered well on GitHub (GFM style?), but it
seems that it's required to insert a blank line between a paragraph and
a list block to make it render well with `mkdocs`?
see also the web page:
https://lancedb.github.io/lancedb/concepts/index_hnsw/
2024-09-06 09:38:19 +05:30
BubbleCal
8dcd328dce
feat: support to create table from record batch iterator ( #1593 )
2024-09-06 10:41:38 +08:00
Philip Zeyliger
1d61717d0e
docs: fix get_registry() usage ( #1601 )
...
Docs used `get_registry.get(...)` whereas what works is
`get_registry().get(...)`. Fixing the two instances I found. I tested
the open clip version by trying it locally in a Jupyter notebook.
2024-09-06 01:48:24 +05:30
Lei Xu
4ee7225e91
ci: public java package ( #1485 )
...
Co-authored-by: Lu Qiu <luqiujob@gmail.com >
2024-09-05 11:48:48 -07:00
Rithik Kumar
2bc7dca3ca
docs: add changes to Embeddings-> Available models-> overview page ( #1596 )
...
adding features and improvements to - Manage Embeddings page
Before:

After:



2024-09-05 22:19:08 +05:30
Gagan Bhullar
b24810a011
feat(python, rust): expose offset in query ( #1556 )
...
PR is part of #1555
2024-09-05 08:33:07 -07:00
Jon X
2b8e872be0
docs: removed the unnecessary fence code tag ( #1599 )
2024-09-05 14:40:38 +05:30
Ayush Chaurasia
03ef1dc081
feat: update default reranker to RRF ( #1580 )
...
- Both LinearCombination (the current default) and RRF are pretty fast
compared to model based rerankers. RRF is slightly faster.
- In our tests RRF has also been slightly more accurate.
This PR:
- Makes RRF the default reranker
- Removed duplicate docs for rerankers
2024-09-03 14:00:13 +05:30
Rithik Kumar
fde636ca2e
docs: fix links - quick start to embedding ( #1591 )
2024-09-02 21:55:35 +05:30
Ayush Chaurasia
51966a84f5
docs: add multi-vector reranking, answerdotai and studies section ( #1579 )
2024-08-31 04:09:14 +05:30
Rithik Kumar
38015ffa7c
docs: improve overall language on all example pages ( #1582 )
...
Refine and improve the language clarity and quality across all example
pages in the documentation to ensure better understanding and
readability.
---------
Co-authored-by: Ayush Chaurasia <ayush.chaurarsia@gmail.com >
2024-08-31 03:48:11 +05:30
Ayush Chaurasia
dc72ece847
feat!: better api for manual hybrid queries ( #1575 )
...
Currently, the only documented way of performing hybrid search is by
using embedding API and passing string queries that get automatically
embedded. There are use cases where users might like to pass vectors and
text manually instead.
This ticket contains more information and historical context -
https://github.com/lancedb/lancedb/issues/937
This breaks a undocumented pathway that allowed passing (vector, text)
tuple queries which was intended to be temporary, so this is marked as a
breaking change. For all practical purposes, this should not really
impact most users
### usage
```
results = table.search(query_type="hybrid")
.vector(vector_query)
.text(text_query)
.limit(5)
.to_pandas()
```
2024-08-30 17:37:58 +05:30
BubbleCal
1521435193
fix: specify column to search for FTS ( #1572 )
...
Before this we ignored the `fts_columns` parameter, and for now we
support to search on only one column, it could lead to an error if we
have multiple indexed columns for FTS
---------
Signed-off-by: BubbleCal <bubble-cal@outlook.com >
2024-08-29 23:43:46 +08:00
Ayush Chaurasia
bfe8fccfab
docs: add hnsw docs ( #1570 )
2024-08-29 15:16:27 +05:30
Rithik Kumar
6f6eb170a9
docs: revamp Python example: Overview page and remove redundant examples and notebooks ( #1574 )
...
before:

After:

---------
Co-authored-by: Ayush Chaurasia <ayush.chaurarsia@gmail.com >
2024-08-29 13:48:10 +05:30
Rithik Kumar
dd1c16bbaf
docs: fix links, convert backslash to forward slash in mkdocs.yml ( #1571 )
...
Co-authored-by: Ayush Chaurasia <ayush.chaurarsia@gmail.com >
2024-08-28 16:07:57 +05:30
Gagan Bhullar
a76186ee83
fix(node): read consistency level fix ( #1567 )
...
PR fixes #1565
2024-08-27 17:03:42 -07:00
Rithik Kumar
ae85008714
docs: revamp embedding models ( #1568 )
...
before:

After:

---------
Co-authored-by: Ayush Chaurasia <ayush.chaurarsia@gmail.com >
2024-08-27 17:14:35 +05:30
Gagan Bhullar
a85f039352
fix(bug): limit fix ( #1548 )
...
PR fixes #1151
2024-08-26 14:25:14 -07:00
Bill Chambers
9c25998110
docs: update serverless_lancedb_with_s3_and_lambda.md ( #1559 )
2024-08-26 14:55:28 +05:30
Ayush Chaurasia
549ca51a8a
feat: add answerdotai rerankers support and minor improvements ( #1560 )
...
This PR:
- Adds missing license headers
- Integrates with answerdotai Rerankers package
- Updates ColbertReranker to subclass answerdotai package. This is done
to keep backwards compatibility as some users might be used to importing
ColbertReranker directly
- Set `trust_remote_code` to ` True` by default in CrossEncoder and
sentence-transformer based rerankers
2024-08-26 13:25:10 +05:30
Rithik Kumar
632007d0e2
docs: add recommender system example ( #1561 )
...
before:

After:

---------
Co-authored-by: Ayush Chaurasia <ayush.chaurarsia@gmail.com >
2024-08-25 12:30:30 +05:30
Lance Release
02d85a4ea4
Updating package-lock.json
2024-08-23 13:56:54 +00:00
Lance Release
a9d0625e2b
Bump version: 0.10.0-beta.0 → 0.10.0-beta.1
v0.10.0-beta.1
2024-08-23 13:56:34 +00:00
Lance Release
89bcc1b2e7
Bump version: 0.13.0-beta.0 → 0.13.0-beta.1
python-v0.13.0-beta.1
2024-08-23 13:56:30 +00:00
rahuljo
6ad5553eca
docs: add dlt-lancedb integration page ( #1551 )
...
Co-authored-by: Akela Drissner-Schmid <32450038+akelad@users.noreply.github.com >
2024-08-22 15:18:49 +05:30
Gagan Bhullar
6eb7ccfdee
fix: rerank attribute unknown ( #1554 )
...
PR fixes #1550
2024-08-22 11:46:36 +05:30
Rithik Kumar
758c82858f
docs: add AI agent example ( #1553 )
...
before:

After:

---------
Co-authored-by: Ayush Chaurasia <ayush.chaurarsia@gmail.com >
2024-08-22 00:54:05 +05:30
Rithik Kumar
0cbc9cd551
docs: add evaluation example ( #1552 )
...
before:

After:

---------
Co-authored-by: Ayush Chaurasia <ayush.chaurarsia@gmail.com >
2024-08-21 20:37:04 +05:30
Ayush Chaurasia
7d65dd97cf
chore(python): update Colbert architecture and minor improvements ( #1547 )
...
- Update ColBertReranker architecture: The current implementation
doesn't use the right arch. This PR uses the implementation in Rerankers
library. Fixes https://github.com/lancedb/lancedb/issues/1546
Benchmark diff (hit rate):
Hybrid - 91 vs 87
reranked vector - 85 vs 80
- Reranking in FTS is basically disabled in main after last week's FTS
updates. I think there's no blocker in supporting that?
- Allow overriding accelerators: Most transformer based Rerankers and
Embedding automatically select device. This PR allows overriding those
settings by passing `device`. Fixes:
https://github.com/lancedb/lancedb/issues/1487
---------
Co-authored-by: BubbleCal <bubble-cal@outlook.com >
2024-08-21 12:26:52 +05:30
Ayush Chaurasia
85bb7e54e4
docs: missing griffe dependency for mkdocs deployment ( #1545 )
2024-08-19 07:48:23 +05:30
Rithik Kumar
21014cab45
docs: add chatbot example and improve quality of other examples ( #1544 )
2024-08-17 12:35:33 +05:30
Lei Xu
5857cb4c6e
docs: add a section to describe scalar index ( #1495 )
2024-08-16 18:48:29 -07:00
Rithik Kumar
09ce6c5bb5
docs: add vector search example ( #1543 )
2024-08-16 21:30:45 +05:30
BubbleCal
0fa50775d6
feat: support to query/index FTS on RemoteTable/AsyncTable ( #1537 )
...
Signed-off-by: BubbleCal <bubble-cal@outlook.com >
2024-08-16 12:01:05 +08:00
Gagan Bhullar
20faa4424b
feat(python): add delete unverified parameter ( #1542 )
...
PR fixes #1527
2024-08-15 09:01:32 -07:00
BubbleCal
b624fc59eb
docs: add create_fts_index doc in Python API Reference ( #1533 )
...
resolve #1313
---------
Signed-off-by: BubbleCal <bubble-cal@outlook.com >
2024-08-15 11:35:16 +08:00
Gagan Bhullar
d2caa5e202
feat(nodejs): add delete unverified ( #1530 )
...
PR fixes part of #1527
2024-08-14 08:53:53 -07:00
BubbleCal
501817cfac
chore: bump the required python version to 3.9 ( #1541 )
...
Signed-off-by: BubbleCal <bubble-cal@outlook.com >
2024-08-14 08:44:31 -07:00
Ryan Green
b3daa25f46
feat: allow new scalar index types to be created in remote table ( #1538 )
2024-08-13 16:05:42 -02:30
Matt Basta
6008a8257b
fix: remove native.d.ts from .npmignore ( #1531 )
...
This removes the type definitions for a number of important TypeScript
interfaces from `.npmignore` so that the package is not incorrectly
typed `any` in a number of places.
---
Presently the `opts` argument to `lancedb.connect` is typed `any`, even
though it shouldn't be.
<img width="560" alt="image"
src="https://github.com/user-attachments/assets/5c974ce8-5a59-44a1-935d-cbb808f0ea24 ">
Clicking into the type definitions for the published package, it has the
correct type signature:
<img width="831" alt="image"
src="https://github.com/user-attachments/assets/6e39a519-13ff-4ca8-95ae-85538ac59d5d ">
However, `ConnectionOptions` is imported from `native.js` (along with a
number of other imports a bit further down):
<img width="384" alt="image"
src="https://github.com/user-attachments/assets/10c1b055-ae78-4088-922e-2816af64c23c ">
This is not otherwise an issue, except that the type definitions for
`native.js` are not included in the published package:
<img width="217" alt="image"
src="https://github.com/user-attachments/assets/f15cd3b6-a8de-4011-9fa2-391858da20ec ">
I haven't compiled the Rust code and run the build script, but I
strongly suspect that disincluding the type definitions in `.npmignore`
is ultimately the root cause here.
2024-08-13 10:06:15 -07:00
Lance Release
aaff43d304
Updating package-lock.json
2024-08-12 19:48:18 +00:00
Lance Release
d4c3a8ca87
Bump version: 0.9.0 → 0.10.0-beta.0
v0.10.0-beta.0
2024-08-12 19:48:02 +00:00
Lance Release
ff5bbfdd4c
Bump version: 0.12.0 → 0.13.0-beta.0
python-v0.13.0-beta.0
2024-08-12 19:47:57 +00:00
Lei Xu
694ca30c7c
feat(nodejs): add bitmap and label list index types in nodejs ( #1532 )
2024-08-11 12:06:02 -07:00
Lei Xu
b2317c904d
feat: create bitmap and label list scalar index using python async api ( #1529 )
...
* Expose `bitmap` and `LabelList` scalar index type via Rust and Async
Python API
* Add documents
2024-08-11 09:16:11 -07:00
BubbleCal
613f3063b9
chore: upgrade lance to 0.16.1 ( #1524 )
...
Signed-off-by: BubbleCal <bubble-cal@outlook.com >
2024-08-09 19:18:05 +08:00
BubbleCal
5d2cd7fb2e
chore: upgrade object_store to 0.10.2 ( #1523 )
...
To use the same version with lance
Signed-off-by: BubbleCal <bubble-cal@outlook.com >
2024-08-09 12:03:46 +08:00
Ayush Chaurasia
a88e9bb134
docs: add lancedb embedding fcn on cloud docs ( #1521 )
2024-08-09 07:21:04 +05:30