mirror of https://github.com/lancedb/lancedb.git synced 2026-07-03 11:00:40 +00:00

Files

Will Jones 0e486511fa feat: hook up new writer for insert (#3029 )

This hooks up a new writer implementation for the `add()` method. The
main immediate benefit is it allows streaming requests to remote tables,
and at the same time allowing retries for most inputs.

In NodeJS, we always convert the data to `Vec<RecordBatch>`, so it's
always retry-able.

For Python, all are retry-able, except `Iterator` and
`pa.RecordBatchReader`, which can only be consumed once. Some, like
`pa.datasets.Dataset` are retry-able *and* streaming.

A lot of the changes here are to make the new DataFusion write pipeline
maintain the same behavior as the existing Python-based preprocessing,
such as:

* casting input data to target schema
* rejecting NaN values if `on_bad_vectors="error"`
* applying embedding functions.

In future PRs, we'll enhance these by moving the embedding calls into
DataFusion and making sure we parallelize them. See:
https://github.com/lancedb/lancedb/issues/3048

---------

Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>

2026-02-23 14:43:31 -08:00

python

feat: hook up new writer for insert (#3029 )

2026-02-23 14:43:31 -08:00

src

feat: hook up new writer for insert (#3029 )

2026-02-23 14:43:31 -08:00

.bumpversion.toml

Bump version: 0.30.0-beta.0 → 0.30.0-beta.1

2026-02-23 18:33:28 +00:00

.gitignore

feat(embeddings): add siglip embedding support to lancedb (#2499 )

2025-08-04 11:42:39 -07:00

AGENTS.md

ci: add agents and add reviewing instructions (#2754 )

2025-10-29 17:28:26 -07:00

ASYNC_MIGRATION.md

feat: add support for add to async python API (#1037 )

2024-04-05 16:31:36 -07:00

build.rs

ci: check license headers (#2076 )

2025-01-29 08:27:07 -08:00

Cargo.toml

Bump version: 0.30.0-beta.0 → 0.30.0-beta.1

2026-02-23 18:33:28 +00:00

CLAUDE.md

ci: add agents and add reviewing instructions (#2754 )

2025-10-29 17:28:26 -07:00

CONTRIBUTING.md

chore!: change support python version from 3.10 to 3.13 (#2955 )

2026-01-30 01:47:50 +08:00

LICENSE

chore: bump lance to 0.8.5 (#561 )

2024-04-05 16:22:59 -07:00

license_header.txt

ci: check license headers (#2076 )

2025-01-29 08:27:07 -08:00

Makefile

ci: add support for lance-format fury index for downloading pylance (#2804 )

2025-11-20 23:29:36 -08:00

pyproject.toml

chore!: change support python version from 3.10 to 3.13 (#2955 )

2026-01-30 01:47:50 +08:00

PYTHON_THIRD_PARTY_LICENSES.md

feat: add third party licenses lists (#3010 )

2026-02-09 16:16:46 -08:00

README.md

docs: contributing guide (#1970 )

2025-01-07 15:11:16 -08:00

RUST_THIRD_PARTY_LICENSES.html

feat: add third party licenses lists (#3010 )

2026-02-09 16:16:46 -08:00

uv.lock

feat: add third party licenses lists (#3010 )

2026-02-09 16:16:46 -08:00

README.md

LanceDB

A Python library for LanceDB.

Installation

pip install lancedb

Preview Releases

Stable releases are created about every 2 weeks. For the latest features and bug fixes, you can install the preview release. These releases receive the same level of testing as stable releases, but are not guaranteed to be available for more than 6 months after they are released. Once your application is stable, we recommend switching to stable releases.

pip install --pre --extra-index-url https://pypi.fury.io/lancedb/ lancedb

Usage

Basic Example

import lancedb
db = lancedb.connect('<PATH_TO_LANCEDB_DATASET>')
table = db.open_table('my_table')
results = table.search([0.1, 0.3]).limit(20).to_list()
print(results)

Development

See CONTRIBUTING.md for information on how to contribute to LanceDB.