Lance Release
33ab68c790
[python] Bump version: 0.4.4 → 0.5.0
2024-04-05 16:26:36 -07:00
Chang She
dbc3515d96
chore(python): turn off lazy frame ingestion ( #821 )
2024-04-05 16:26:36 -07:00
Chang She
ac3d95ec34
feat(python): allow the entire table to be converted a polars dataframe ( #814 )
2024-04-05 16:26:36 -07:00
Chang She
72b39432e8
feat(python): add exist_ok option to create table ( #813 )
...
This mimics CREATE TABLE IF NOT EXISTS behavior.
We add `db.create_table(..., exist_ok=True)` parameter.
By default it is set to False, so trying to create
a table with the same name will raise an exception.
If set to True, then it only opens the table if it
already exists. If you pass in a schema, it will
be checked against the existing table to make sure
you get what you want. If you pass in data, it will
NOT be added to the existing table.
2024-04-05 16:26:35 -07:00
Ayush Chaurasia
340fd98b42
chore(python): get rid of Pydantic deprication warning in embedding fcn ( #816 )
...
```
UserWarning: Valid config keys have changed in V2:
* 'keep_untouched' has been renamed to 'ignored_types' warnings.warn(message, UserWarning)
```
2024-04-05 16:26:20 -07:00
Anton Shevtsov
dc0b11a86a
Add openai api key not found help ( #815 )
...
This pull request adds check for the presence of an environment variable
`OPENAI_API_KEY` and removes an unused parameter in
`retry_with_exponential_backoff` function.
2024-04-05 16:26:20 -07:00
Chang She
17dcb70076
feat(python): basic polars integration ( #811 )
...
We should now be able to directly ingest polars dataframes and return
results as polars dataframes

2024-04-05 16:26:19 -07:00
Andrew Miracle
8daed93a91
eslint fix
2024-04-05 16:25:52 -07:00
Ayush Chaurasia
2f72d5138e
feat(python): Add gemini text embedding function ( #806 )
...
Named it Gemini-text for now. Not sure how complicated it will be to
support both text and multimodal embeddings under the same class
"gemini"..But its not something to worry about for now I guess.
2024-04-05 16:25:52 -07:00
Andrew Miracle
f1aad1afc7
Merge branch 'main' into tecmie/embeddings-openai
2024-04-05 16:25:51 -07:00
Andrew Miracle
fa13fb9392
rebase from lancedb/main
2024-04-05 16:25:14 -07:00
Lance Release
d39145c7e4
Updating package-lock.json
2024-04-05 16:25:14 -07:00
Lance Release
3463248eba
Bump version: 0.4.2 → 0.4.3
2024-04-05 16:25:14 -07:00
Lance Release
3191966ffb
[python] Bump version: 0.4.3 → 0.4.4
2024-04-05 16:25:14 -07:00
Will Jones
3b119420b2
upgrade lance ( #809 )
2024-04-05 16:25:14 -07:00
Lei Xu
6f7cb75b07
chore: remove black as dependency ( #808 )
...
We use `ruff` in CI and dev workflow now.
2024-04-05 16:25:14 -07:00
Chang She
118a11c9b3
feat(node): align incoming data to table schema ( #802 )
2024-04-05 16:25:14 -07:00
Sebastian Law
70ca6d8ea5
use requests instead of aiohttp for underlying http client ( #803 )
...
instead of starting and stopping the current thread's event loop on
every http call, just make an http call.
2024-04-05 16:25:14 -07:00
Chang She
556e01d9d9
chore(python): add docstring for limit behavior ( #800 )
...
Closes #796
2024-04-05 16:25:14 -07:00
Chang She
1060dde858
feat(python): add phrase query option for fts ( #798 )
...
addresses #797
Problem: tantivy does not expose option to explicitly
Proposed solution here:
1. Add a `.phrase_query()` option
2. Under the hood, LanceDB takes care of wrapping the input in quotes
and replace nested double quotes with single quotes
I've also filed an upstream issue, if they support phrase queries
natively then we can get rid of our manual custom processing here.
2024-04-05 16:25:14 -07:00
Chang She
950e05da81
feat(python): add count_rows with filter option ( #801 )
...
Closes #795
2024-04-05 16:25:14 -07:00
Chang She
2b7754f929
fix(rust): not sure why clippy is suddenly unhappy ( #794 )
...
should fix the error on top of main
https://github.com/lancedb/lancedb/actions/runs/7457190471/job/20288985725
2024-04-05 16:25:14 -07:00
Chang She
d0bff7b78e
feat(python): support new style optional syntax ( #793 )
2024-04-05 16:25:14 -07:00
Chang She
85f3f8793c
chore(python): document phrase queries in fts ( #788 )
...
closes #769
Add unit test and documentation on using quotes to perform a phrase
query
2024-04-05 16:25:14 -07:00
Chang She
a758876a65
feat(node): support table.schema for LocalTable ( #789 )
...
Close #773
we pass an empty table over IPC so we don't need to manually deal with
serde. Then we just return the schema attribute from the empty table.
---------
Co-authored-by: albertlockett <albert.lockett@gmail.com >
2024-04-05 16:25:14 -07:00
Lei Xu
073a2a1b28
chore: bump lance to 0.9.5 ( #790 )
2024-04-05 16:25:14 -07:00
Chang She
195c106242
feat(python): Set heap size to get faster fts indexing performance ( #762 )
...
By default tantivy-py uses 128MB heapsize. We change the default to 1GB
and we allow the user to customize this
locally this makes `test_fts.py` run 10x faster
2024-04-05 16:25:13 -07:00
Lance Release
f0a654036e
Updating package-lock.json
2024-04-05 16:25:02 -07:00
lucasiscovici
792830ccb5
raise exception if fts index does not exist ( #776 )
...
raise exception if fts index does not exist
---------
Co-authored-by: Chang She <759245+changhiskhan@users.noreply.github.com >
2024-04-05 16:25:02 -07:00
Lance Release
162f8536d1
Updating package-lock.json
2024-04-05 16:25:02 -07:00
sudhir
5d198327bb
Make examples work with current version of Openai api's ( #779 )
...
These examples don't work because of changes in openai api from version
1+
2024-04-05 16:25:02 -07:00
Lance Release
55cc3ed5a2
Bump version: 0.4.2 → 0.4.3
2024-04-05 16:25:02 -07:00
Chris
b11428dddb
Minor Fixes to Ingest Embedding Functions Docs ( #777 )
...
Addressed minor typos and grammatical issues to improve readability
---------
Co-authored-by: Christopher Correa <chris.correa@gmail.com >
2024-04-05 16:25:02 -07:00
Lance Release
1387dc6e48
[python] Bump version: 0.4.3 → 0.4.4
2024-04-05 16:25:02 -07:00
Vladimir Varankin
84c6c8f08c
Minor corrections for docs of embedding_functions ( #780 )
...
In addition to #777 , this pull request fixes more typos in the
documentation for "Ingest Embedding Functions".
2024-04-05 16:25:02 -07:00
Will Jones
63e273606e
upgrade lance ( #809 )
2024-04-05 16:25:02 -07:00
QianZhu
35f83694be
small bug fix for example code in SaaS JS doc ( #770 )
2024-04-05 16:25:02 -07:00
Lei Xu
45b006d68c
chore: remove black as dependency ( #808 )
...
We use `ruff` in CI and dev workflow now.
2024-04-05 16:25:02 -07:00
Chang She
20208b9efb
chore(python): handle NaN input in fts ingestion ( #763 )
...
If the input text is None, Tantivy raises an error
complaining it cannot add a NoneType. We handle this
upstream so None's are not added to the document.
If all of the indexed fields are None then we skip
this document.
2024-04-05 16:25:02 -07:00
Bengsoon Chuah
c00af75d63
Add relevant imports for each step ( #764 )
...
I found that it was quite incoherent to have to read through the
documentation and having to search which submodule that each class
should be imported from.
For example, it is cumbersome to have to navigate to another
documentation page to find out that `EmbeddingFunctionRegistry` is from
`lancedb.embeddings`
2024-04-05 16:25:02 -07:00
QianZhu
21245dfb9d
SaaS JS API sdk doc ( #740 )
...
Co-authored-by: Aidan <64613310+aidangomar@users.noreply.github.com >
2024-04-05 16:25:02 -07:00
Chang She
81487f10fe
feat(js): support list of string input ( #755 )
...
Add support for adding lists of string input (e.g., list of categorical
labels)
Follow-up items: #757 #758
2024-04-05 16:25:02 -07:00
Lance Release
3aa233f38a
Updating package-lock.json
2024-04-05 16:25:02 -07:00
Lance Release
3278fa75d1
Bump version: 0.4.1 → 0.4.2
2024-04-05 16:25:02 -07:00
Lance Release
549f2bf396
[python] Bump version: 0.4.2 → 0.4.3
2024-04-05 16:25:02 -07:00
Lei Xu
138760bc6e
chore: bump pylance to 0.9.2 ( #754 )
2024-04-05 16:25:02 -07:00
Xin Hao
0bddf77a73
docs: fix link ( #752 )
2024-04-05 16:25:02 -07:00
Chang She
154dc508ba
feat(python): first cut batch queries for remote api ( #753 )
...
issue separate requests under the hood and concatenate results
2024-04-05 16:25:02 -07:00
Lance Release
0b8fe76590
[python] Bump version: 0.4.1 → 0.4.2
2024-04-05 16:25:02 -07:00
Chang She
c22eacb8b6
chore(python): update embedding API to use openai 1.6.1 ( #751 )
...
API has changed significantly, namely `openai.Embedding.create` no
longer exists.
https://github.com/openai/openai-python/discussions/742
Update the OpenAI embedding function and put a minimum on the openai sdk
version.
2024-04-05 16:25:02 -07:00