lancedb

mirror of https://github.com/lancedb/lancedb.git synced 2026-07-03 11:00:40 +00:00

Go to file

Ayush Chaurasia bbfadfe58d [python] Allow adding via iterators (#391 )

Makes the following work so all the formats accepted by `create_table()`
are also accepted by `add()`
```
import lancedb
import pyarrow as pa

db = lancedb.connect("/tmp")

def make_batches():
    for i in range(5):
        yield pa.RecordBatch.from_arrays(
            [
                pa.array([[3.1, 4.1], [5.9, 26.5]]),
                pa.array(["foo", "bar"]),
                pa.array([10.0, 20.0]),
            ],
            ["vector", "item", "price"],
        )

schema = pa.schema([
    pa.field("vector", pa.list_(pa.float32())),
    pa.field("item", pa.utf8()),
    pa.field("price", pa.float32()),
])

tbl = db.create_table("table4", make_batches(), schema=schema)
tbl.add(make_batches())
```

2023-08-04 12:49:44 -07:00

.github/workflows

[WIP] Workflow to trigger vectordb-recipes workflow (#371 )

2023-08-02 11:27:08 -07:00

ci: build node in manylinux docker container (#350 )

2023-07-24 11:31:47 -07:00

docs

add LanceModel to docs (#386 )

2023-07-31 15:12:02 -04:00

node

fix(node) Give preference to local index.node lib (#393 )

2023-08-01 15:29:15 -07:00

python

[python] Allow adding via iterators (#391 )

2023-08-04 12:49:44 -07:00

rust

feat(node): Improve concurrency (#376 )

2023-08-01 14:22:04 -07:00

.bumpversion.cfg

Bump version: 0.1.18 → 0.1.19

2023-07-27 21:06:17 +00:00

.gitignore

feat(node): pull node binaries into separate packages (3) (#285 )

2023-07-12 16:52:04 -07:00

.pre-commit-config.yaml

Handle NaN input data (#241 )

2023-07-04 20:00:46 -07:00

Cargo.toml

fix(node) Replace panic errors with friendlier ones (#366 )

2023-07-26 13:44:58 -07:00

LICENSE

initial commit

2023-03-17 18:15:19 -07:00

README.md

Add docs code testing & documentation syntax changes (#196 )

2023-06-28 11:07:26 -07:00

README.md

Developer-friendly, serverless vector database for AI applications

Documentation • Blog • Discord • Twitter

LanceDB is an open-source database for vector-search built with persistent storage, which greatly simplifies retrevial, filtering and management of embeddings.

The key features of LanceDB include:

Production-scale vector search with no servers to manage.
Store, query and filter vectors, metadata and multi-modal data (text, images, videos, point clouds, and more).
Support for vector similarity search, full-text search and SQL.
Native Python and Javascript/Typescript support.
Zero-copy, automatic versioning, manage versions of your data without needing extra infrastructure.
Ecosystem integrations with LangChain 🦜️🔗, LlamaIndex 🦙, Apache-Arrow, Pandas, Polars, DuckDB and more on the way.

LanceDB's core is written in Rust 🦀 and is built using Lance, an open-source columnar format designed for performant ML workloads.

Quick Start

Javascript

npm install vectordb

const lancedb = require('vectordb');
const db = await lancedb.connect('data/sample-lancedb');

const table = await db.createTable('vectors', 
      [{ id: 1, vector: [0.1, 0.2], item: "foo", price: 10 },
       { id: 2, vector: [1.1, 1.2], item: "bar", price: 50 }])

const query = table.search([0.1, 0.3]);
query.limit = 20;
const results = await query.execute();

Python

pip install lancedb

import lancedb

uri = "data/sample-lancedb"
db = lancedb.connect(uri)
table = db.create_table("my_table",
                         data=[{"vector": [3.1, 4.1], "item": "foo", "price": 10.0},
                               {"vector": [5.9, 26.5], "item": "bar", "price": 20.0}])
result = table.search([100, 100]).limit(2).to_df()

Blogs, Tutorials & Videos

Languages

HTML 35.2%

Rust 32.7%

Python 23.8%

TypeScript 7.8%

Shell 0.3%

Other 0.1%

README.md Unescape Escape

Quick Start

Blogs, Tutorials & Videos

README.md