lancedb

mirror of https://github.com/lancedb/lancedb.git synced 2026-01-05 19:32:56 +00:00

Go to file

Bert c9f248b058 feat: add hybrid search to node and rust SDKs (#1940 )

Support hybrid search in both rust and node SDKs.

- Adds a new rerankers package to rust LanceDB, with the implementation
of the default RRF reranker
- Adds a new hybrid package to lancedb, with some helper methods related
to hybrid search such as normalizing scores and converting score column
to rank columns
- Adds capability to LanceDB VectorQuery to perform hybrid search if it
has both a nearest vector and full text search parameters.
- Adds wrappers for reranker implementations to nodejs SDK.

Additional rerankers will be added in followup PRs

https://github.com/lancedb/lancedb/issues/1921

---
Notes about how the rust rerankers are wrapped for calling from JS:

I wanted to keep the core reranker logic, and the invocation of the
reranker by the query code, in Rust. This aligns with the philosophy of
the new node SDK where it's just a thin wrapper around Rust. However, I
also wanted to have support for users who want to add custom rerankers
written in Javascript.

When we add a reranker to the query from Javascript, it adds a special
Rust reranker that has a callback to the Javascript code (which could
then turn around and call an underlying Rust reranker implementation if
desired). This adds a bit of complexity, but overall I think it moves us
in the right direction of having the majority of the query logic in the
underlying Rust SDK while keeping the option open to support custom
Javascript Rerankers.

2024-12-30 09:03:41 -05:00

.cargo

ci: musl x64,arm64 (#1853 )

2024-11-20 10:53:19 -08:00

.github

chore: fix typos (#1976 )

2024-12-24 12:50:54 -08:00

ci: aarch64-pc-windows-msvc (#1890 )

2024-12-02 11:17:37 -08:00

dockerfiles

A simple base usage that install the dependencies necessary to use FT… (#1036 )

2024-04-05 16:31:36 -07:00

docs

docs: add new indexes to python docs (#1945 )

2024-12-28 15:35:10 -08:00

java

Bump version: 0.14.1-beta.7 → 0.14.1

2024-12-24 18:38:11 +00:00

node

Updating package-lock.json

2024-12-26 07:40:12 +00:00

nodejs

feat: add hybrid search to node and rust SDKs (#1940 )

2024-12-30 09:03:41 -05:00

python

docs: add new indexes to python docs (#1945 )

2024-12-28 15:35:10 -08:00

rust

feat: add hybrid search to node and rust SDKs (#1940 )

2024-12-30 09:03:41 -05:00

.bumpversion.toml

Bump version: 0.14.1-beta.7 → 0.14.1

2024-12-24 18:38:11 +00:00

.gitignore

docs(nodejs): add @lancedb/lancedb examples everywhere (#1411 )

2024-07-10 13:29:03 -05:00

.pre-commit-config.yaml

docs(nodejs): add @lancedb/lancedb examples everywhere (#1411 )

2024-07-10 13:29:03 -05:00

Cargo.toml

feat: upgrade lance to 0.21.0-beta.5 (#1961 )

2024-12-19 10:54:59 -08:00

docker-compose.yml

feat: expose storage options in LanceDB (#1204 )

2024-04-10 10:12:04 -07:00

LICENSE

initial commit

2023-03-17 18:15:19 -07:00

README.md

docs: introducing LanceDB Guru on Gurubase.io (#1797 )

2024-11-08 10:55:22 -08:00

release_process.md

ci: enable java auto release (#1602 )

2024-09-19 10:51:03 -07:00

rust-toolchain.toml

ci(rust): check MSRV and upgrade toolchain (#1960 )

2024-12-19 08:43:25 -08:00

README.md

Developer-friendly, database for multimodal AI

LanceDB is an open-source database for vector-search built with persistent storage, which greatly simplifies retrieval, filtering and management of embeddings.

The key features of LanceDB include:

Production-scale vector search with no servers to manage.
Store, query and filter vectors, metadata and multi-modal data (text, images, videos, point clouds, and more).
Support for vector similarity search, full-text search and SQL.
Native Python and Javascript/Typescript support.
Zero-copy, automatic versioning, manage versions of your data without needing extra infrastructure.
GPU support in building vector index(*).
Ecosystem integrations with LangChain 🦜️🔗, LlamaIndex 🦙, Apache-Arrow, Pandas, Polars, DuckDB and more on the way.

LanceDB's core is written in Rust 🦀 and is built using Lance, an open-source columnar format designed for performant ML workloads.

Quick Start

Javascript

npm install @lancedb/lancedb

import * as lancedb from "@lancedb/lancedb";

const db = await lancedb.connect("data/sample-lancedb");
const table = await db.createTable("vectors", [
	{ id: 1, vector: [0.1, 0.2], item: "foo", price: 10 },
	{ id: 2, vector: [1.1, 1.2], item: "bar", price: 50 },
], {mode: 'overwrite'});


const query = table.vectorSearch([0.1, 0.3]).limit(2);
const results = await query.toArray();

// You can also search for rows by specific criteria without involving a vector search.
const rowsByCriteria = await table.query().where("price >= 10").toArray();

Python

pip install lancedb

import lancedb

uri = "data/sample-lancedb"
db = lancedb.connect(uri)
table = db.create_table("my_table",
                         data=[{"vector": [3.1, 4.1], "item": "foo", "price": 10.0},
                               {"vector": [5.9, 26.5], "item": "bar", "price": 20.0}])
result = table.search([100, 100]).limit(2).to_pandas()

Blogs, Tutorials & Videos

Languages

Rust 42.7%

Python 42%

TypeScript 14.2%

Shell 0.6%

Java 0.3%

README.md Unescape Escape

Quick Start

Blogs, Tutorials & Videos

README.md