lancedb

mirror of https://github.com/lancedb/lancedb.git synced 2025-12-25 14:29:56 +00:00

Author	SHA1	Message	Date
Ayush Chaurasia	1c42894918	[DOCS][PYTHON] Update embeddings API docs & Example (#516 ) This PR adds an overview of embeddings docs: - 2 ways to vectorize your data using lancedb - explicit & implicit - explicit - manually vectorize your data using `wit_embedding` function - Implicit - automatically vectorize your data as it comes by ingesting your embedding function details as table metadata - Multi-modal example w/ disappearing embedding function	2024-04-05 16:22:59 -07:00
Lance Release	87c69e74fc	[python] Bump version: 0.3.0 → 0.3.1	2024-04-05 16:22:59 -07:00
Ayush Chaurasia	0e9a7f0dc7	Add cohere embedding function (#550 )	2024-04-05 16:22:59 -07:00
Will Jones	c07207c661	feat: cleanup and compaction (#518 ) #488	2024-04-05 16:22:59 -07:00
Chang She	8469d010f8	feat: add to_list and to_pandas api's (#556 ) Add `to_list` to return query results as list of python dict (so we're not too pandas-centric). Closes #555 Add `to_pandas` API and add deprecation warning on `to_df`. Closes #545 Co-authored-by: Chang She <chang@lancedb.com>	2024-04-05 16:22:59 -07:00
Lei Xu	a26c8f3316	feat: use GPU for index creation. (#540 ) Bump lance to 0.8.3 to include GPU training --------- Co-authored-by: Rob Meng <rob.xu.meng@gmail.com>	2023-10-05 20:49:00 -07:00
Lance Release	a022368426	[python] Bump version: 0.2.6 → 0.3.0	2023-10-03 21:48:22 +00:00
Lei Xu	8b815ef5a8	chore: upgrade lance to 0.8.1 (#536 ) Bump to lance 0.8.1 for both javascript and python sdk	2023-10-03 14:29:18 -07:00
Lance Release	d326146a40	[python] Bump version: 0.2.5 → 0.2.6	2023-10-01 17:48:59 +00:00
Chang She	693bca1eba	feat(python): expose prefilter to lancedb (#522 ) We have experimental support for prefiltering (without ANN) in pylance. This means that we can now apply a filter BEFORE vector search is performed. This can be done via the `.where(filter_string, prefilter=True)` kwargs of the query. Limitations: - When connecting to LanceDB cloud, `prefilter=True` will raise NotImplemented - When an ANN index is present, `prefilter=True` will raise NotImplemented - This option is not available for full text search query - This option is not available for empty search query (just filter/project) Additional changes in this PR: - Bump pylance version to v0.8.0 which supports the experimental prefiltering. --------- Co-authored-by: Chang She <chang@lancedb.com>	2023-10-01 10:34:12 -07:00
Will Jones	343e274ea5	fix: define minimum dependency versions (#515 ) Closes #513 For each of these, I found the minimum version that would allow the unit tests to pass.	2023-09-29 09:04:49 -07:00
Rob Meng	a695fb8030	fix `import attr` to use `import attrs` (#510 ) Thanks to #508, I used `attr` instead of the correct package `attrs` s/attr/attrs	2023-09-23 00:30:56 -04:00
Hynek Schlawack	bc8670d7af	[Python] Fix attrs dependency (#508 ) The `attr` project is unrelated to `attrs` that also provides the `attr` namespace (see also <https://hynek.me/articles/import-attrs/>). It used to _usually_ work, because attrs is a dependency of aiohttp and somehow took precedence over `attr`'s `attr`. Yes, sorry, it's a mess.	2023-09-21 12:35:34 -04:00
Lance Release	74004161ff	[python] Bump version: 0.2.4 → 0.2.5	2023-09-19 16:43:06 +00:00
Chang She	f20f19b804	feat: improve pydantic 1.x compat (#503 )	2023-09-18 19:01:30 -07:00
Chang She	55207ce844	feat: add lancedb.__version__ (#504 )	2023-09-18 18:51:51 -07:00
Chang She	c21f9cdda0	ci: fix docs build (#496 ) python/python.md contains typos in the class references --------- Co-authored-by: Chang She <chang@lancedb.com>	2023-09-18 13:07:21 -07:00
Chang She	31dad71c94	multi-modal embedding-function (#484 )	2023-09-16 21:23:51 -04:00
Will Jones	9585f550b3	fix: increase S3 timeouts (#494 ) Closes #493	2023-09-15 20:21:34 -07:00
Lance Release	8dc2315479	[python] Bump version: 0.2.3 → 0.2.4	2023-09-15 14:23:26 +00:00
Rob Meng	f6bfb5da11	chore: upgrade lance to 0.7.4 (#491 )	2023-09-14 16:02:23 -04:00
Lance Release	661fcecf38	[python] Bump version: 0.2.2 → 0.2.3	2023-09-14 17:48:32 +00:00
Lei Xu	b315ea3978	[Python] Pydantic vector field with default value (#474 ) Rename `lance.pydantic.vector` to `Vector` and deprecate `vector(dim)`	2023-09-08 22:35:31 -07:00
Ayush Chaurasia	aa7806cf0d	[Python]Fix record_batch_generator (#483 ) Should fix - https://github.com/lancedb/lancedb/issues/482	2023-09-08 21:18:50 +05:30
Lei Xu	6799613109	feat: upgrade lance to 0.7.3 (#481 )	2023-09-07 17:01:45 -07:00
Chang She	32163063dc	Fix up docs (#477 )	2023-09-05 22:29:50 -07:00
Chang She	9a9a73a65d	[python] Use pydantic for embedding function persistence (#467 ) 1. Support persistent embedding function so users can just search using query string 2. Add fixed size list conversion for multiple vector columns 3. Add support for empty query (just apply select/where/limit). 4. Refactor and simplify some of the data prep code --------- Co-authored-by: Chang She <chang@lancedb.com> Co-authored-by: Weston Pace <weston.pace@gmail.com>	2023-09-05 21:30:45 -07:00
Ayush Chaurasia	52fa7f5577	[Docs] Small typo fixes (#460 )	2023-09-02 22:17:19 +05:30
Chang She	0cba0f4f92	[python] Temporary update feature (#457 ) Combine delete and append to make a temporary update feature that is only enabled for the local python lancedb. The reason why this is temporary is because it first has to load the data that matches the where clause into memory, which is technical unbounded. --------- Co-authored-by: Chang She <chang@lancedb.com>	2023-08-30 00:25:26 -07:00
Lance Release	fe8848efb9	[python] Bump version: 0.2.1 → 0.2.2	2023-08-24 23:18:10 +00:00
Chang She	e587a17a64	[python] Support schema evolution in local LanceDB (#452 ) Previously if you needed to add a column to a table you'd have to rewrite the whole table. Instead, we use the merge functionality from Lance format to incrementally add columns from another table or dataframe. --------- Co-authored-by: Chang She <chang@lancedb.com> Co-authored-by: Weston Pace <weston.pace@gmail.com>	2023-08-24 14:40:49 -07:00
Chang She	2f1f9f6338	[python] improve restore functionality (#451 ) Previously the temporary restore feature required copying data. The new feature in pylance does not. --------- Co-authored-by: Chang She <chang@lancedb.com> Co-authored-by: Weston Pace <weston.pace@gmail.com>	2023-08-24 11:00:34 -07:00
Lance Release	909b7e90cd	[python] Bump version: 0.2.0 → 0.2.1	2023-08-24 04:00:11 +00:00
QianZhu	ae8486cc8f	bump lance version to 0.6.5 for lancedb release (#453 )	2023-08-23 20:59:03 -07:00
Ayush Chaurasia	f35f8e451f	[DOCS] Update integrations + small typos (#432 ) Depends on - https://github.com/lancedb/lancedb/pull/430 --------- Co-authored-by: Kevin Tse <NivekT@users.noreply.github.com>	2023-08-18 09:59:22 +05:30
Ayush Chaurasia	0b9924b432	Make creating (and adding to) tables via Iterators more flexible & intuitive (#430 ) It improves the UX as iterators can be of any type supported by the table (plus recordbatch) & there is no separate requirement. Also expands the test cases for pydantic & arrow schema. If this is looks good I'll update the docs. Example usage: ``` class Content(LanceModel): vector: vector(2) item: str price: float def make_batches(): for _ in range(5): yield from [ # pandas pd.DataFrame({ "vector": [[3.1, 4.1], [1, 1]], "item": ["foo", "bar"], "price": [10.0, 20.0], }), # pylist [ {"vector": [3.1, 4.1], "item": "foo", "price": 10.0}, {"vector": [5.9, 26.5], "item": "bar", "price": 20.0}, ], # recordbatch pa.RecordBatch.from_arrays( [ pa.array([[3.1, 4.1], [5.9, 26.5]], pa.list_(pa.float32(), 2)), pa.array(["foo", "bar"]), pa.array([10.0, 20.0]), ], ["vector", "item", "price"], ), # pydantic list [ Content(vector=[3.1, 4.1], item="foo", price=10.0), Content(vector=[5.9, 26.5], item="bar", price=20.0), ]] db = lancedb.connect("db") tbl = db.create_table("tabley", make_batches(), schema=Content, mode="overwrite") tbl.add(make_batches()) ``` Same should with arrow schema. --------- Co-authored-by: Weston Pace <weston.pace@gmail.com>	2023-08-18 09:56:30 +05:30
Chang She	e3061d4cb4	[python] Temporary restore feature (#428 ) This adds LanceTable.restore as a temporary feature. It reads data from a previous version and creates a new snapshot version using that data. This makes the version writeable unlike checkout. This should be replaced once the feature is implemented in pylance. Co-authored-by: Chang She <chang@lancedb.com>	2023-08-14 20:10:29 -07:00
Lance Release	b1a5c251ba	[python] Bump version: 0.1.16 → 0.2.0	2023-08-12 04:43:16 +00:00
Will Jones	722462c38b	chore: upgrade Lance and rename score to _distance (#398 ) BREAKING CHANGE: The `score` column has been renamed to `_distance` to more accurately describe the semantics (smaller means closer / better). --------- Co-authored-by: Lei Xu <lei@lancedb.com>	2023-08-11 21:42:33 -07:00
Ashis Kumar Naik	902a402951	implementation of drop_database (#418 ) #416 Fixed. added drop_database() method . This deletes all the tables from the database with a single command. --------- Signed-off-by: Ashis Kumar Naik <ashishami2002@gmail.com>	2023-08-11 20:59:56 -07:00
Rob Meng	2f2cb984d4	[breaking change] make schema a property (#414 )	2023-08-11 18:58:41 -04:00
Lei Xu	aa91f35a28	[Python][Remote] Raise meaningful exception for to_arrow() / to_pandas() (#413 )	2023-08-08 14:40:09 -07:00
Rob Meng	fd65887d87	implement remote drop table call (#411 ) Also moves `request_id` to header instead of request param	2023-08-08 13:24:16 -04:00
Chang She	a54d1e5618	Automatically convert pydantic model (#400 ) Saves users from having to explicitly call `LanceModel.to_arrow_schema()` when creating an empty table. See new docs for full details. --------- Co-authored-by: Chang She <chang@lancedb.com>	2023-08-06 14:50:03 -07:00
Ayush Chaurasia	bbfadfe58d	[python] Allow adding via iterators (#391 ) Makes the following work so all the formats accepted by `create_table()` are also accepted by `add()` ``` import lancedb import pyarrow as pa db = lancedb.connect("/tmp") def make_batches(): for i in range(5): yield pa.RecordBatch.from_arrays( [ pa.array([[3.1, 4.1], [5.9, 26.5]]), pa.array(["foo", "bar"]), pa.array([10.0, 20.0]), ], ["vector", "item", "price"], ) schema = pa.schema([ pa.field("vector", pa.list_(pa.float32())), pa.field("item", pa.utf8()), pa.field("price", pa.float32()), ]) tbl = db.create_table("table4", make_batches(), schema=schema) tbl.add(make_batches()) ```	2023-08-04 12:49:44 -07:00
Lei Xu	f0e1290ae6	Restrict semver version to 3.0 (#389 )	2023-07-31 22:26:24 -07:00
Chang She	4b45128bd6	add LanceModel to docs (#386 ) Co-authored-by: Chang She <chang@lancedb.com>	2023-07-31 15:12:02 -04:00
Lance Release	b06e214d29	[python] Bump version: 0.1.15 → 0.1.16	2023-07-31 18:32:40 +00:00
Chang She	c1f8feb6ed	make pandas an optional dependency in lancedb as well (#385 )	2023-07-31 14:08:58 -04:00
Chang She	cada35d5b7	Improve pydantic integration (#384 )	2023-07-31 12:16:44 -04:00

... 10 11 12 13 14

699 Commits