lancedb

mirror of https://github.com/lancedb/lancedb.git synced 2026-05-13 18:10:41 +00:00

Author	SHA1	Message	Date
Weston Pace	5c759505b8	feat: upgrade lance 0.22.1b1 (#2029 ) Now the version actually exists :)	2025-01-15 07:37:37 -08:00
BubbleCal	d57bed90e5	docs: add missing example code (#2025 )	2025-01-14 21:17:05 -08:00
BubbleCal	648327e90c	docs: show how to pack bits for binary vector (#2020 ) Signed-off-by: BubbleCal <bubble-cal@outlook.com>	2025-01-14 09:00:57 -08:00
Lance Release	995bd9bf37	Bump version: 0.18.0-beta.1 → 0.18.0	2025-01-14 01:02:26 +00:00
Lance Release	36cc06697f	Bump version: 0.18.0-beta.0 → 0.18.0-beta.1	2025-01-14 01:02:25 +00:00
Will Jones	92dcf24b0c	feat: upgrade Lance to v0.22.0 (#2017 ) Upstream changelog: https://github.com/lancedb/lance/releases/tag/v0.22.0	2025-01-13 15:06:01 -08:00
BubbleCal	66cbf6b6c5	feat: support multivector type (#2005 ) Signed-off-by: BubbleCal <bubble-cal@outlook.com>	2025-01-13 14:10:40 -08:00
Prashant Dixit	b66cd943a7	fix: broken voyageai embedding API (#2013 ) This PR fixes the broken Embedding API for Voyageai.	2025-01-13 08:52:38 -08:00
Weston Pace	d8d11f48e7	feat: upgrade to lance 0.22.0b1 (#2011 )	2025-01-10 12:51:52 -08:00
Lance Release	fbffe532a8	Bump version: 0.17.2-beta.2 → 0.18.0-beta.0	2025-01-10 19:01:20 +00:00
Will Jones	6eacae18c4	test: fix test failure from merge (#2007 )	2025-01-09 11:27:24 -08:00
Bert	f4afe456e8	feat!: change default from postfiltering to prefiltering for sync python (#2000 ) BREAKING CHANGE: prefiltering is now the default in the synchronous python SDK resolves: #1872	2025-01-08 19:13:58 -05:00
Renato Marroquin	ea5c2266b8	feat(python): support .rerank() on non-hybrid queries in Async API (WIP) (#1972 ) Fixes https://github.com/lancedb/lancedb/issues/1950 --------- Co-authored-by: Renato Marroquin <renato.marroquin@oracle.com>	2025-01-08 16:42:47 -05:00
Will Jones	c557e77f09	feat(python)!: support inserting and upserting subschemas (#1965 ) BREAKING CHANGE: For a field "vector", list of integers will now be converted to binary (uint8) vectors instead of f32 vectors. Use float values instead for f32 vectors. * Adds proper support for inserting and upserting subsets of the full schema. I thought I had previously implemented this in #1827, but it turns out I had not tested carefully enough. * Refactors `_santize_data` and other utility functions to be simpler and not require `numpy` or `combine_chunks()`. * Added a new suite of unit tests to validate sanitization utilities. ## Examples ```python import pandas as pd import lancedb db = lancedb.connect("memory://demo") intial_data = pd.DataFrame({ "a": [1, 2, 3], "b": [4, 5, 6], "c": [7, 8, 9] }) table = db.create_table("demo", intial_data) # Insert a subschema new_data = pd.DataFrame({"a": [10, 11]}) table.add(new_data) table.to_pandas() ``` ``` a b c 0 1 4.0 7.0 1 2 5.0 8.0 2 3 6.0 9.0 3 10 NaN NaN 4 11 NaN NaN ``` ```python # Upsert a subschema upsert_data = pd.DataFrame({ "a": [3, 10, 15], "b": [6, 7, 8], }) table.merge_insert(on="a").when_matched_update_all().when_not_matched_insert_all().execute(upsert_data) table.to_pandas() ``` ``` a b c 0 1 4.0 7.0 1 2 5.0 8.0 2 3 6.0 9.0 3 10 7.0 NaN 4 11 NaN NaN 5 15 8.0 NaN ```	2025-01-08 10:11:10 -08:00
BubbleCal	3c0a64be8f	feat: support distance range in queries (#1999 ) this also updates the docs --------- Signed-off-by: BubbleCal <bubble-cal@outlook.com>	2025-01-08 11:03:27 +08:00
Will Jones	0e496ed3b5	docs: contributing guide (#1970 ) * Adds basic contributing guides. * Simplifies Python development with a Makefile.	2025-01-07 15:11:16 -08:00
QianZhu	17c9e9afea	docs: add async examples to doc (#1941 ) - added sync and async tabs for python examples - moved python code to tests/docs --------- Co-authored-by: Will Jones <willjones127@gmail.com>	2025-01-07 15:10:25 -08:00
Gagan Bhullar	b474f98049	feat(python): `flatten` in `AsyncQuery` (#1967 ) PR fixes #1949 --------- Co-authored-by: Will Jones <willjones127@gmail.com>	2025-01-06 10:52:03 -08:00
Takahiro Ebato	2c05ffed52	feat(python): add `to_polars` to `AsyncQueryBase` (#1986 ) Fixes https://github.com/lancedb/lancedb/issues/1952 Added `to_polars` method to `AsyncQueryBase`.	2025-01-06 09:35:28 -08:00
Lance Release	a27c5cf12b	Bump version: 0.17.2-beta.1 → 0.17.2-beta.2	2025-01-06 05:34:27 +00:00
BubbleCal	f4dea72cc5	feat: support vector search with distance thresholds (#1993 ) Signed-off-by: BubbleCal <bubble-cal@outlook.com>	2025-01-06 13:23:39 +08:00
Lei Xu	f76c4a5ce1	chore: add pyright static type checking and fix some of the table interface (#1996 ) * Enable `pyright` in the project * Fixed some pyright typing errors in `table.py`	2025-01-04 15:24:58 -08:00
BubbleCal	445a312667	fix: selecting columns failed on FTS and hybrid search (#1991 ) it reports error `AttributeError: 'builtins.FTSQuery' object has no attribute 'select_columns'` because we missed `select_columns` method in rust Signed-off-by: BubbleCal <bubble-cal@outlook.com>	2025-01-03 13:08:12 +08:00
Lance Release	92d845fa72	Bump version: 0.17.2-beta.0 → 0.17.2-beta.1	2024-12-31 23:36:18 +00:00
Lei Xu	397813f6a4	chore: bump pylance to 0.21.1b1 (#1989 )	2024-12-31 15:34:27 -08:00
Lei Xu	50c30c5d34	chore(python): fix typo of the synchronized checkout API (#1988 )	2024-12-30 18:54:31 -08:00
Renato Marroquin	0cb6da6b7e	docs: add new indexes to python docs (#1945 ) closes issue #1855 Co-authored-by: Renato Marroquin <renato.marroquin@oracle.com>	2024-12-28 15:35:10 -08:00
BubbleCal	aec8332eb5	chore: add `dynamic = ["version"]` to pass build check (#1977 ) Signed-off-by: BubbleCal <bubble-cal@outlook.com>	2024-12-28 10:45:23 -08:00
Lance Release	dae8334d0b	Bump version: 0.17.1 → 0.17.2-beta.0	2024-12-25 08:28:59 +00:00
BubbleCal	16cf2990f3	feat: create IVF_FLAT on remote table (#1978 ) Signed-off-by: BubbleCal <bubble-cal@outlook.com>	2024-12-25 14:57:07 +08:00
Lance Release	27404c8623	Bump version: 0.17.1-beta.7 → 0.17.1	2024-12-24 18:37:28 +00:00
Lance Release	f181c7e77f	Bump version: 0.17.1-beta.6 → 0.17.1-beta.7	2024-12-24 18:37:27 +00:00
BubbleCal	e70fd4fecc	feat: support IVF_FLAT, binary vectors and hamming distance (#1955 ) binary vectors and hamming distance can work on only IVF_FLAT, so introduce them all in this PR. --------- Signed-off-by: BubbleCal <bubble-cal@outlook.com>	2024-12-24 10:36:20 -08:00
verma nakul	ac0068b80e	feat(python): add `ignore_missing` to the async drop_table() method (#1953 ) - feat(db): add `ignore_missing` to async `drop_table` method Fixes #1951 --------- Co-authored-by: Will Jones <willjones127@gmail.com>	2024-12-24 10:33:47 -08:00
Hezi Zisman	ebac960571	feat(python): add `bypass_vector_index` to sync api (#1947 ) Hi lancedb team, This PR adds the `bypass_vector_index` logic to the sync API, as described in [Issue #535](https://github.com/lancedb/lancedb/issues/535). (Closes #535). Iv'e implemented it only for the regular vector search. If you think it should also be supported for FTS, Hybrid, or Empty queries and for the cloud solution, please let me know, and I’ll be happy to extend it. Since there’s no `CONTRIBUTING.md` or contribution guidelines, I opted for the simplest implementation to get this started. Looking forward to your feedback! Thanks! --------- Co-authored-by: Will Jones <willjones127@gmail.com>	2024-12-24 10:33:26 -08:00
Lance Release	cf8c2edaf4	Bump version: 0.17.1-beta.5 → 0.17.1-beta.6	2024-12-19 19:39:08 +00:00
Will Jones	61a714a459	docs: improve optimization docs (#1957 ) * Add `See Also` section to `cleanup_old_files` and `compact_files` so they know it's linked to `optimize`. * Fixes link to `compact_files` arguments * Improves formatting of note.	2024-12-19 10:55:11 -08:00
Will Jones	5ddd84cec0	feat: upgrade lance to 0.21.0-beta.5 (#1961 )	2024-12-19 10:54:59 -08:00
Lance Release	144b7f5d54	Bump version: 0.17.1-beta.4 → 0.17.1-beta.5	2024-12-13 22:37:13 +00:00
LuQQiu	edc9b9adec	chore: bump Lance version to v0.21.0-beta.4 (#1939 )	2024-12-13 14:36:13 -08:00
Will Jones	980aa70e2d	feat(python): async-sync feature parity on Table (#1914 ) ### Changes to sync API * Updated `LanceTable` and `LanceDBConnection` reprs * Add `storage_options`, `data_storage_version`, and `enable_v2_manifest_paths` to sync create table API. * Add `storage_options` to `open_table` in sync API. * Add `list_indices()` and `index_stats()` to sync API * `create_table()` will now create only 1 version when data is passed. Previously it would always create two versions: 1 to create an empty table and 1 to add data to it. ### Changes to async API * Add `embedding_functions` to async `create_table()` API. * Added `head()` to async API ### Refactors * Refactor index parameters into dataclasses so they are easier to use from Python * Moved most tests to use an in-memory DB so we don't need to create so many temp directories Closes #1792 Closes #1932 --------- Co-authored-by: Weston Pace <weston.pace@gmail.com>	2024-12-13 12:56:44 -08:00
Lance Release	e3c6213333	Bump version: 0.17.1-beta.3 → 0.17.1-beta.4	2024-12-13 05:33:34 +00:00
Weston Pace	00552439d9	feat: upgrade lance to 0.21.0b3 (#1936 )	2024-12-12 21:32:59 -08:00
QianZhu	c0ee370f83	docs: improve schema evolution api examples (#1929 )	2024-12-12 10:52:06 -08:00
Lance Release	bcbbeb7a00	Bump version: 0.17.1-beta.2 → 0.17.1-beta.3	2024-12-11 19:17:54 +00:00
Weston Pace	d6c0f75078	feat: upgrade to lance prerelease 0.21.0b2 (#1933 )	2024-12-11 11:17:10 -08:00
Lance Release	f9789ec962	Bump version: 0.17.1-beta.1 → 0.17.1-beta.2	2024-12-11 17:57:18 +00:00
Lei Xu	347515aa51	fix: support list of numpy f16 floats as query vector (#1931 ) User reported on Discord, when using `table.vector_search([np.float16(1.0), np.float16(2.0), ...])`, it yields `TypeError: 'numpy.float16' object is not iterable`	2024-12-10 16:17:28 -08:00
BubbleCal	3324e7d525	feat: support 4bit PQ (#1916 )	2024-12-10 10:36:03 +08:00
Will Jones	ab5316b4fa	feat: support offset in remote client (#1923 ) Closes https://github.com/lancedb/lancedb/issues/1876	2024-12-09 17:04:18 -08:00

1 2 3 4 5 ...

581 Commits