Commit Graph

523 Commits

Author SHA1 Message Date
Ayush Chaurasia
d59dbf8230 fix: Pydantic 1.x compat for weak_lru caching in embeddings API (#643)
Colab has pydantic 1.x by default and pydantic 1.x BaseModel objects
don't support weakref creation by default that we use to cache embedding
models
https://github.com/lancedb/lancedb/blob/main/python/lancedb/embeddings/utils.py#L206
. It needs to be added to slot.
2024-04-05 16:23:49 -07:00
Ayush Chaurasia
c0a49a9a5b Multi-task instructor model with quantization support & weak_lru cache for embedding function models (#612)
resolves #608
2024-04-05 16:23:49 -07:00
QianZhu
2f2964a645 fix saas open_table and table_names issues (#640)
- added check whether a table exists in SaaS open_table
- remove prefilter not supported warning in SaaS search
- fixed issues for SaaS table_names
2024-04-05 16:23:49 -07:00
Rob Meng
3d50c9cdfe upgrade lance to 0.8.14 (#636)
upgrade lance
2024-04-05 16:23:49 -07:00
Rob Meng
bdb3b46f7e skip missing file on mirrored dir when deleting (#635)
mirrored store is not garueeteed to have all the files. Ignore the ones
that doesn't exist.
2024-04-05 16:23:49 -07:00
Lei Xu
49306a99ba chore: apple silicon runner (#633)
Close #632
2024-04-05 16:23:49 -07:00
Lei Xu
86efd36689 chore: improve create_table API consistency between local and remote SDK (#627) 2024-04-05 16:23:47 -07:00
Bert
20ab85171b fix: node remote connection handles non http errors (#624)
https://github.com/lancedb/lancedb/issues/623

Fixes issue trying to print response status when using remote client. If
the error is not an HTTP error (e.g. dns/network failure), there won't
be a response.
2024-04-05 16:23:14 -07:00
Ayush Chaurasia
159ecbac5a Exponential standoff retry support for handling rate limited embedding functions (#614)
Users ingesting data using rate limited apis don't need to manually make
the process sleep for counter rate limits
resolves #579
2024-04-05 16:23:14 -07:00
Lance Release
148f6d7283 Updating package-lock.json 2024-04-05 16:23:14 -07:00
Lance Release
c604912139 Updating package-lock.json 2024-04-05 16:23:14 -07:00
Lance Release
178af0c2b8 Bump version: 0.3.5 → 0.3.6 2024-04-05 16:23:14 -07:00
Lance Release
c1b037f0a5 [python] Bump version: 0.3.2 → 0.3.3 2024-04-05 16:23:14 -07:00
Lei Xu
3855bdf986 chore: bump lance to 8.10 (#622) 2024-04-05 16:23:14 -07:00
Ayush Chaurasia
07ab4cd14c Disable posthog on docs & reduce sentry trace factor (#607)
- posthog charges per event and docs events are registered very
frequently. We can keep tracking them on GA
- Reduced sentry trace factor
2024-04-05 16:23:13 -07:00
Chang She
531c947fc1 doc: node sdk now supports windows (#616) 2024-04-05 16:22:59 -07:00
Bert
4e9aab9e8b ci: cancel in progress runs on new push (#620) 2024-04-05 16:22:59 -07:00
Bert
cd7a4dd251 fix!: sort table names (#619)
https://github.com/lancedb/lance/issues/1385
2024-04-05 16:22:59 -07:00
QianZhu
3c139c2ee5 Qian/query option doc (#615)
- API documentation improvement for queries (table.search)
- a small bug fix for the remote API on create_table

![image](https://github.com/lancedb/lancedb/assets/1305083/712e9bd3-deb8-4d81-8cd0-d8e98ef68f4e)

![image](https://github.com/lancedb/lancedb/assets/1305083/ba22125a-8c36-4e34-a07f-e39f0136e62c)
2024-04-05 16:22:59 -07:00
Will Jones
166b281d66 increment pylance (#618) 2024-04-05 16:22:59 -07:00
Bert
c9fee0faed added api docs for prefilter flag (#617)
Added the prefilter flag argument to the `LanceQueryBuilder.where`.

This should make it display here:

https://lancedb.github.io/lancedb/python/python/#lancedb.query.LanceQueryBuilder.select

And also in intellisense like this:
<img width="848" alt="image"
src="https://github.com/lancedb/lancedb/assets/5846846/e0c53f4f-96bc-411b-9159-680a6c4d0070">

Also adds some improved documentation about the `where` argument to this
method.

---------

Co-authored-by: Weston Pace <weston.pace@gmail.com>
2024-04-05 16:22:59 -07:00
Weston Pace
301e08f30e feat: allow prefiltering with index (#610)
Support for prefiltering with an index was added in lance version 0.8.7.
We can remove the lancedb check that prevents this. Closes #261
2024-04-05 16:22:59 -07:00
Lei Xu
b5e57ebce3 doc: add doc to use GPU for indexing (#611) 2024-04-05 16:22:59 -07:00
Lance Release
87364532bf Updating package-lock.json 2024-04-05 16:22:59 -07:00
Lance Release
c275ec006f Updating package-lock.json 2024-04-05 16:22:59 -07:00
Lance Release
53b0375e6d Bump version: 0.3.4 → 0.3.5 2024-04-05 16:22:59 -07:00
Bert
6881c50866 fix conv version (#605) 2024-04-05 16:22:59 -07:00
Lance Release
a174832d61 Updating package-lock.json 2024-04-05 16:22:59 -07:00
Lance Release
722cede32b Bump version: 0.3.3 → 0.3.4 2024-04-05 16:22:59 -07:00
Bert
4d086d63eb feat: added dataset stats api to node (#604) 2024-04-05 16:22:59 -07:00
Bert
f5e9c073f0 feat: added data stats apis (#596) 2024-04-05 16:22:59 -07:00
Rob Meng
178e016ff2 expose remap index api (#603)
expose index remap options in `compact_files`
2024-04-05 16:22:59 -07:00
Rob Meng
3c998b020f feat: expose optimize index api (#602)
expose `optimize_index` api.
2024-04-05 16:22:59 -07:00
Lance Release
a3c955070e [python] Bump version: 0.3.1 → 0.3.2 2024-04-05 16:22:59 -07:00
Bert
edeecd3d9f update lance to 0.8.7 (#598) 2024-04-05 16:22:59 -07:00
Chang She
2861f33982 fix(python): fix multiple embedding functions bug (#597)
Closes #594

The embedding functions are pydantic models so multiple instances with
the same parameters are considered ==, which means that if you have
multiple embedding columns it's possible for the embeddings to get
overwritten. Instead we use `is` instead of == to avoid this problem.

testing: modified unit test to include this case
2024-04-05 16:22:59 -07:00
Rob Meng
0036ca9de7 feat: add checkout method to table to reuse existing store and connections (#593)
Prior to this PR, to get a new version of a table, we need to re-open
the table. This has a few downsides w.r.t. performance:
* Object store is recreated, which takes time and throws away existing
warm connections
* Commit handler is thrown aways as well, which also may contain warm
connections
2024-04-05 16:22:59 -07:00
Rob Meng
2826bc7f1a feat: include manifest files in mirrow store (#589) 2024-04-05 16:22:59 -07:00
Will Jones
e37a0566e0 Revert "[python] Bump version: 0.3.2 → 0.3.3"
This reverts commit c30faf6083.
2024-04-05 16:22:59 -07:00
Will Jones
48999ffc27 [python] Bump version: 0.3.2 → 0.3.3 2024-04-05 16:22:59 -07:00
Ayush Chaurasia
0dc893993f [Docs]: Minor Fixes (#587)
* Filename typo
* Remove rick_morty csv as users won't really be able to use it.. We can
create a an executable colab and download it from a bucket or smth.
2024-04-05 16:22:59 -07:00
Ayush Chaurasia
12de39612e [Docs] Embeddings API: Add multi-lingual semantic search example (#582) 2024-04-05 16:22:59 -07:00
Ayush Chaurasia
05509bfb03 [Docs]Versioning docs (#586)
closes #564

---------

Co-authored-by: Chang She <chang@lancedb.com>
2024-04-05 16:22:59 -07:00
Lance Release
fa702f992e Updating package-lock.json 2024-04-05 16:22:59 -07:00
Lance Release
7f707205de Updating package-lock.json 2024-04-05 16:22:59 -07:00
Lance Release
2394ff14d0 Bump version: 0.3.2 → 0.3.3 2024-04-05 16:22:59 -07:00
Chang She
31334b05df chore: bump lance version in python/rust lancedb (#584)
To include latest v0.8.6

Co-authored-by: Chang She <chang@lancedb.com>
2024-04-05 16:22:59 -07:00
Ayush Chaurasia
942976f49f [Docs] Update embedding function docs (#581) 2024-04-05 16:22:59 -07:00
Ayush Chaurasia
507f6087c2 [Python]Embeddings API refactor (#580)
Sets things up for this -> https://github.com/lancedb/lancedb/issues/579
- Just separates out the registry/ingestion code from the function
implementation code
- adds a `get_registry` util
- package name "open-clip" -> "open-clip-torch"
2024-04-05 16:22:59 -07:00
Ayush Chaurasia
39c1cb87ad [Docs] Add posthog telemetry to docs (#577)
Allows creation of funnels and user journeys
2024-04-05 16:22:59 -07:00