lancedb

mirror of https://github.com/lancedb/lancedb.git synced 2026-07-03 11:00:40 +00:00

Author	SHA1	Message	Date
Jack Ye	7bf020b3d5	chore: fix clippy when remote flag is not set (#2943 ) Also add a step in CI to ensure this does not happen in the future	2026-01-26 13:59:31 -08:00
Jack Ye	e4552e577a	chore(revert): revert update lance dependency to v2.0.0-rc.1 (#2936 ) (#2941 ) This reverts commit `bd84bba14d`, so that we can bump version to 1.0.4-rc.1	2026-01-26 11:13:59 -08:00
Will Jones	f979a902ad	ci(rust): fix MSRV check (#2940 ) Realized our MSRV check was inert because `rust-toolchain.toml` was overriding the Rust version. We set the `RUSTUP_TOOLCHAIN` environment variable, which overrides that. Also needed to update to MSRV 1.88 (due to dependencies like Lance and DataFusion) and fix some clippy warnings.	2026-01-23 15:57:09 -08:00
Colin Patrick McCabe	5a7a8da567	feat: check AZURE_STORAGE_ACCOUNT_NAME in remote conns (#2918 ) Unlike in Amazon S3, in Azure bucket names are not globally unique. Instead, the combination of (storage_account_name, bucket_name) is unique. Therefore, when using Azure blob store, we always need a way to configure the storage account name. One way is to use the storage_options hash map and set azure_storage_account_name. Another way is to set an environment variable, AZURE_STORAGE_ACCOUNT_NAME. Prior to this PR, the second way (environment variable) did not work with remote connections. This is because the existing code that checks for these environment variables happens inside the Azure object store implementation itself, which does not run locally when using remote connections. This PR addresses that situation by adding a check of the environment variable. This functions as a default if the relevant storage option is not set in the storage_options hash map.	2026-01-22 13:36:05 -08:00
Jack Ye	0db8176445	test: fix failing remote doctest reference to aws feature (#2935 ) Closes https://github.com/lancedb/lancedb/issues/2933	2026-01-22 13:17:03 -08:00
LanceDB Robot	bd84bba14d	chore: update lance dependency to v2.0.0-rc.1 (#2936 ) ## Summary - bump Lance dependencies to v2.0.0-rc.1 (git tag) - align Arrow/DataFusion/PyO3 versions for the new Lance release - update Python bindings for PyO3 0.26 (attach API + Py<PyAny>) ## Verification - `cargo clippy --workspace --tests --all-features -- -D warnings` - `cargo fmt --all` ## Reference - https://github.com/lance-format/lance/releases/tag/v2.0.0-rc.1 --------- Co-authored-by: Jack Ye <yezhaoqin@gmail.com> Co-authored-by: Will Jones <willjones127@gmail.com> Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com> Co-authored-by: BubbleCal <bubble_cal@outlook.com>	2026-01-22 13:14:38 -08:00
Lance Release	ac07f8068c	Bump version: 0.24.0-beta.1 → 0.24.0	2026-01-22 01:10:15 +00:00
Lance Release	bba362d372	Bump version: 0.24.0-beta.0 → 0.24.0-beta.1	2026-01-22 01:09:53 +00:00
Jack Ye	4e65748abf	chore: update lance dependency to v1.0.3-rc.1 (#2927 ) Supercedes https://github.com/lancedb/lancedb/pull/2925 We accidentally upgraded lance to 2.0.0-beta.8. This PR reverts that first and then bump to 1.0.3-rc.1 --------- Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-21 11:52:07 -08:00
Colin Patrick McCabe	e897f3edab	test: assert remote behavior of drop_table (#2926 ) Add support for testing remote connections in drop_table in `rust/lancedb/src/connection.rs`.	2026-01-21 08:42:40 -08:00
Lance Release	790ba7115b	Bump version: 0.23.1 → 0.24.0-beta.0	2026-01-21 12:21:53 +00:00
Ryan Green	cd5f91bb7d	feat: expose table uri (#2922 ) * Expose `table.uri` property for all tables, including remote tables * Fix bug in path calculation on windows file systems	2026-01-20 19:56:46 -03:30
LanceDB Robot	4da01a0e65	chore: update lance dependency to v2.0.0-beta.8 (#2907 ) ## Summary - bump Lance crates to v2.0.0-beta.8 and align arrow/datafusion/regex/half and PyO3 dependencies - update Rust/Python bindings for upstream API changes (namespace/table requests, query select columns, storage option providers) - verified with cargo clippy --workspace --tests --all-features -D warnings and cargo fmt --all Triggered by refs/tags/v2.0.0-beta.8. --------- Signed-off-by: BubbleCal <bubble-cal@outlook.com> Co-authored-by: BubbleCal <bubble-cal@outlook.com>	2026-01-16 01:46:52 +08:00
Will Jones	1840aa7edc	feat(rust)!: remove default features (#2912 ) BREAKING CHANGE: removes `aws`, `dynamodb`, `azure`, `gcs`, `oss`, `huggingface` from default Rust features. They can be enabled by users as needed. They are still enabled for Python and NodeJS, since those users don't control the compilation of artifacts. Closes #2911	2026-01-13 11:23:14 -08:00
Xuanwo	489c91c5d6	feat: enable huggingface feature by default (#2910 )	2026-01-13 20:42:11 +05:30
Qichao Chu	4494eb9e56	feat: parallelize embedding computations (#2896 ) Implement parallel execution of multiple embedding functions using std:🧵:scope to improve performance when a table has multiple embedding columns. Key changes: - Add compute_embeddings_parallel() helper method to WithEmbeddings - Use fast path for single embeddings (no threading overhead) - Use scoped threads for parallel execution of multiple embeddings - Add comprehensive tests including parallelization timing verification - Update WithEmbeddings documentation Performance improvements: - I/O-bound embeddings (OpenAI, Bedrock): High benefit from concurrent API calls - CPU-bound embeddings (sentence-transformers): Medium benefit from core utilization - Single embedding: No overhead (fast path) Closes TODO on line 266 in rust/lancedb/src/embeddings.rs	2026-01-06 14:35:56 -08:00
LuQQiu	d67a8743ba	feat: support remote ivf rq (#2863 )	2026-01-02 15:35:33 -08:00
Colin Patrick McCabe	ac164c352b	test: convert test_table_names to test both remote and local (#2888 ) Convert test_table_names to test both remote and local connections. This PR also includes some miscellaneous improvements in src/test_utils/connection.rs. It starts a thread to drain stdout from the server process. It adds the PRINT_LANCEDB_TEST_CONNECTION_SCRIPT_OUTPUT environment variable, which optionally displays server stdout. Fix a bash conditional in run_with_test_connection.sh.	2026-01-02 15:08:44 -08:00
Lance Release	8bcac7e372	Bump version: 0.23.1-beta.2 → 0.23.1	2026-01-02 17:39:19 +00:00
Lance Release	e496184ab2	Bump version: 0.23.1-beta.1 → 0.23.1-beta.2	2026-01-02 17:38:54 +00:00
Lance Release	0667fa38d4	Bump version: 0.23.1-beta.0 → 0.23.1-beta.1	2025-12-17 06:59:29 +00:00
Jack Ye	1628f7e3f3	fix: pass namespace storage options provider into native table (#2873 ) Previously the native table is created with static credentials and could not auto-refresh credentials when expired.	2025-12-16 22:58:04 -08:00
Lance Release	2fd712312f	Bump version: 0.23.0 → 0.23.1-beta.0	2025-12-17 03:30:51 +00:00
Jack Ye	9e60fda0ec	fix: use post for describe_namespace and allow access to underlying client (#2871 ) Issues found during integration tests: 1. describe_namespace should use POST 2. service needs to access the underlying namespace to be able to do operations like create_empty_table directly, or get credentials in isolated paths like a remote take	2025-12-16 19:29:27 -08:00
Lance Release	94bdffe13c	Bump version: 0.23.0-beta.2 → 0.23.0	2025-12-16 16:58:35 +00:00
Lance Release	b93ea3a388	Bump version: 0.23.0-beta.1 → 0.23.0-beta.2	2025-12-16 16:57:55 +00:00
Lance Release	0960e19559	Bump version: 0.23.0-beta.0 → 0.23.0-beta.1	2025-12-05 00:36:39 +00:00
Lance Release	6f79770248	Bump version: 0.22.4-beta.3 → 0.23.0-beta.0	2025-12-04 19:33:37 +00:00
BubbleCal	a61461331c	feat: add IVF SQ index support and HNSW aliases (#2832 ) Adds IVF_SQ index config through Rust core and Python bindings, plus alias names IvfHnswSq/Pq for backward compatibility. Updates remote/table helpers and types to accept the new index type. Includes tests covering IVF SQ creation and alias usage.	2025-12-04 00:25:44 +08:00
Jack Ye	b0170ea86a	fix: table_names error at root namespace (#2842 ) Root namepace should be passed in as an empty vector, not None.	2025-12-02 23:53:29 -08:00
Jack Ye	d1efc6ad8a	refactor!: use namespace models directly for namespace operations (#2806 ) 1. Use generated models in lance-namespace for request response models to avoid multiple layers of conversions 2. Make sure the API is consistent with the namespace spec 3. Deprecate the table_names API in favor of the list_tables API in namespace that allows full pagination support without the need to have sorted table names 4. Add describe_namespace API which was a miss in the original implementation	2025-12-02 22:41:04 -08:00
Jack Ye	9d638cb3c7	feat: support namespace server side query (#2811 ) Currently a table in a namespace is still backed with a `NativeTable`, which means after getting the location of the table and optional storage options override from `namespace.describe_table`, all things work like a normal local table. However, namespace also supports `query_table`, which is exactly the same API as remote table. This PR adds a `server_side_query` capability, when enabled, it runs the query by calling `namespace.query_table`. For namespace that implements the operation (e.g. REST namespace), this could hit a backend server that could execute the query faster (e.g. using a distributed engine).	2025-12-02 21:04:12 -08:00
Lance Release	b2d06a3a73	Bump version: 0.22.4-beta.2 → 0.22.4-beta.3	2025-12-02 22:01:59 +00:00
Jonathan Hsieh	44878dd9a5	feat: support stable row IDs via storage_options (#2831 ) Add support for enabling stable row IDs when creating tables via the `new_table_enable_stable_row_ids` storage option. Stable row IDs ensure that row identifiers remain constant after compaction, update, delete, and merge operations. This is useful for materialized views and other use cases that need to track source rows across these operations. The option can be set at two levels: - Connection level: applies to all tables created with that connection - Table level: per-table override via create_table storage_options 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-12-02 13:57:00 -08:00
Rudi Floren	03a1a99270	feat: remove remote default features on lance-namespace-impls (#2828 ) This tries to fix #2771. It is not a complete fix because `lance-namespace-impls` uses `lance` which has its default features enabled. Thus, to close #2771, the lance repo also needs an update. The `dir-*` features are enabled by the respective remote feature (`aws`, `gcp`, `azure`, `oss`). The `rest` feature is enabled via `remote`.	2025-12-01 10:53:22 -08:00
Xuanwo	0110e3b6f8	chore: clippy::string_to_string has been replaced by implicit_clone (#2817 ) clippy::string_to_string has been replaced by implicit_clone, so lancedb will raise a build error in Rust 1.91. This PR suppresses it. --- This PR was primarily authored with Codex using GPT-5-Codex and then hand-reviewed by me. I AM responsible for every change made in this PR. I aimed to keep it aligned with our goals, though I may have missed minor issues. Please flag anything that feels off, I'll fix it quickly. Signed-off-by: Xuanwo <github@xuanwo.io>	2025-11-26 16:30:35 +08:00
Prashanth Rao	135dfdc7ec	docs: 404 and outdated URLs should now work (#2800 ) Did a full scan of all URLs that used to point to the old mkdocs pages, and now links to the appropriate pages on lancedb.com/docs or lance.org docs. --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-11-20 11:14:20 -08:00
Lance Release	3531393523	Bump version: 0.22.4-beta.1 → 0.22.4-beta.2	2025-11-19 20:25:41 +00:00
Wyatt Alt	386fc9e466	feat: add num_attempts to merge insert result (#2795 ) This pipes the num_attempts field from lance's merge insert result through lancedb. This allows callers of merge_insert to get a better idea of whether transaction conflicts are occurring.	2025-11-19 09:32:57 -08:00
Lance Release	ce1bafec1a	Bump version: 0.22.4-beta.0 → 0.22.4-beta.1	2025-11-19 12:58:59 +00:00
Will Jones	1cf3917a87	ci: make rust ci faster, get ci green (#2782 ) * Add `ci` profile for smaller build caches. This had a meaningful impact in Lance, and I expect a similar impact here. https://github.com/lancedb/lance/pull/5236 * Get caching working in Rust. Previously was not working due to `workspaces: rust`. * Get caching working in NodeJs lint job. Previously wasn't working because we installed the toolchain after we called `- uses: Swatinem/rust-cache@v2`, which invalidates the cache locally. * Fix broken pytest from async io transition (`pytest.PytestRemovedIn9Warning`) * Altered `get_num_sub_vectors` to handle bug in case of 4-bit PQ. This was cause of `rust future panicked: unknown error`. Raised an issue upstream to change panic to error: https://github.com/lancedb/lance/issues/5257 * Call `npm run docs` to fix doc issue. * Disable flakey Windows test for consistency. It's just an OS-specific timer issue, not our fault. * Fix Windows absolute path handling in namespaces. Was causing CI failure `OSError: [WinError 123] The filename, directory name, or volume label syntax is incorrect: `	2025-11-18 09:04:56 -08:00
Lance Release	e2d7640021	Bump version: 0.22.3 → 0.22.4-beta.0	2025-11-17 08:43:51 +00:00
Jack Ye	e47f552a86	feat: support namespace credentials vending (#2778 ) Based on https://github.com/lancedb/lance/pull/4984 1. Bump to 1.0.0-beta.2 2. Use DirectoryNamespace in lance to perform all testing in python and rust for much better coverage 3. Refactor `ListingDatabase` to be able to accept location and namespace. This is because we have to leverage listing database (local lancedb connection) for using namespace, namespace only resolves the location and storage options but we don't want to bind all the way to rust since user will plug-in namespace from python side. And thus `ListingDatabase` needs to be able to accept location and namespace that are created from namespace connection. 4. For credentials vending, we also pass storage options provider all the way to rust layer, and the rust layer calls back to the python function to fetch next storage option. This is exactly the same thing we did in pylance.	2025-11-17 00:42:24 -08:00
BubbleCal	3e42a43bbf	feat: let lance determine the default num_partitions param (#2775 )	2025-11-12 09:43:19 +08:00
Colin Patrick McCabe	1ff594a6a4	feat: bump lance version to 0.40-0-beta.2 (#2772 ) Bump the bump lance version to 0.40-0-beta.2.	2025-11-10 14:36:37 -08:00
Lance Release	e34f51713a	Bump version: 0.22.3-beta.6 → 0.22.3	2025-11-07 04:59:18 +00:00
Lance Release	abaf5ac27f	Bump version: 0.22.3-beta.5 → 0.22.3-beta.6	2025-11-07 04:58:38 +00:00
Weston Pace	aeac9c7644	feat: add python Permutation class to mimic hugging face dataset and provide pytorch dataloader (#2725 )	2025-11-06 16:15:33 -08:00
Mark	6ddd271627	fix: relax bytemuck and crunchy version pins (#2768 ) Closes #2767	2025-11-05 14:07:35 -08:00
Will Jones	7ef8bafd51	feat: add `source` to TableNotFound errors (#2765 ) This will make it easier to see if there are underlying problems. We should see the actual object store HTTP request error within the error chain after this.	2025-11-04 15:31:45 -08:00

1 2 3 4 5 ...

612 Commits