lancedb

mirror of https://github.com/lancedb/lancedb.git synced 2025-12-24 22:09:58 +00:00

Author	SHA1	Message	Date
rmeng	24526bda4c	patch	2024-05-15 13:44:27 -04:00
Cory Grinstead	055efdcdb6	refactor(nodejs): use biomejs instead of eslint & prettier (#1304 ) I've been noticing a lot of friction with the current toolchain for '/nodejs'. Particularly with the usage of eslint and prettier. [Biome](https://biomejs.dev/) is an all in one formatter & linter that replaces the need for two different ones that can potentially clash with one another. I've been using it in the [nodejs-polars](https://github.com/pola-rs/nodejs-polars) repo for quite some time & have found it much more pleasant to work with. --- One other small change included in this PR: use [ts-jest](https://www.npmjs.com/package/ts-jest) so we can run our tests without having to rebuild typescript code first	2024-05-14 11:11:18 -05:00
Cory Grinstead	bc582bb702	fix(nodejs): add better error handling when missing embedding functions (#1290 ) note: running the default lint command `npm run lint -- --fix` seems to have made a lot of unrelated changes.	2024-05-14 08:43:39 -05:00
Will Jones	df9c41f342	ci: write down breaking change policy (#1294 ) * Enforce conventional commit PR titles * Add automatic labelling of PRs * Write down breaking change policy. Left for another PR: * Validation of breaking change version bumps. (This is complicated due to separate releases for Python and other package.)	2024-05-13 10:25:55 -07:00
Raghav Dixit	0bd6ac945e	Documentation : Langchain doc bug fix (#1301 ) nav bar update	2024-05-13 20:56:34 +05:30
Raghav Dixit	c9d5475333	Documentation: Langchain Integration (#1297 ) Integration doc update	2024-05-13 10:19:33 -04:00
asmith26	3850d5fb35	Add ollama embeddings function (#1263 ) Following the docs [here](https://lancedb.github.io/lancedb/python/python/#lancedb.embeddings.openai.OpenAIEmbeddings) I've been trying to use ollama embedding via the OpenAI API interface, but unfortunately I couldn't get it to work (possibly related to https://github.com/ollama/ollama/issues/2416) Given the popularity of ollama I thought it could be helpful to have a dedicated Ollama Embedding function in lancedb. Very much welcome any thought on this or my code etc. Thanks!	2024-05-13 13:09:19 +05:30
Lance Release	b37c58342e	[python] Bump version: 0.6.12 → 0.6.13 python-v0.6.13	2024-05-10 16:15:13 +00:00
Lance Release	a06e64f22d	Updating package-lock.json	2024-05-09 22:46:19 +00:00
Lance Release	e983198f0e	Updating package-lock.json	2024-05-09 22:12:17 +00:00
Lance Release	76e7b4abf8	Updating package-lock.json	2024-05-09 21:14:47 +00:00
Lance Release	5f6eb4651e	Bump version: 0.4.19 → 0.4.20 v0.4.20	2024-05-09 21:14:30 +00:00
Bert	805c78bb20	chore: bump lance to v0.10.18 (#1287 ) https://github.com/lancedb/lance/releases/tag/v0.10.18	2024-05-09 17:06:26 -03:00
QianZhu	4746281b21	fix rename_table api and cache pop (#1283 )	2024-05-08 13:41:18 -07:00
Aman Kishore	7b3b6bdccd	Remove semvar strict dependancy (#1253 )	2024-05-08 11:16:15 -07:00
Ryan Green	37e1124c0f	chore: upgrade lance to 0.10.17 (#1280 )	2024-05-08 09:56:48 -02:30
Lance Release	93f037ee41	Updating package-lock.json	2024-05-07 20:50:44 +00:00
Lance Release	e4fc06825a	Updating package-lock.json	2024-05-07 20:09:25 +00:00
Lance Release	fe89a373a2	[python] Bump version: 0.6.11 → 0.6.12 python-v0.6.12	2024-05-07 19:27:17 +00:00
Lance Release	3d3915edef	Updating package-lock.json	2024-05-07 19:04:42 +00:00
Lance Release	e2e8b6aee4	Bump version: 0.4.18 → 0.4.19 v0.4.19	2024-05-07 19:04:31 +00:00
Will Jones	12dbca5248	ci: better test for test_syntax (#1278 ) The syntax error was fixed in tantivy 0.22.0, so I changed the test case to something more wrong.	2024-05-07 11:52:39 -07:00
Will Jones	a6babfa651	fix(node/vectordb): parse value not key (#1276 )	2024-05-07 10:16:05 -07:00
Will Jones	75ede86fab	fix: clearer error that FTS is not supported on object stores (#1273 ) Closes #1272	2024-05-07 10:15:53 -07:00
Will Jones	becd649130	docs: add tip about using allow_http on local servers (#1277 ) Based on user question https://discord.com/channels/1030247538198061086/1197630499926057021/1237350091191222293	2024-05-07 10:15:26 -07:00
Cory Grinstead	9d2fb7d602	feat: rust embedding registry (#1259 ) Todo: - [x] add proper documentation - [x] add unit tests - [x] better handling of the registry1 - [x] allow user defined registry2 1 The python implementation just uses a global registry so it makes things a bit easier. I attached it to the db/connection to prevent future conflicts if running multiple connections/databases. I mostly modeled the registry & pattern off of datafusion's [FunctionRegistry](https://docs.rs/datafusion/latest/datafusion/execution/trait.FunctionRegistry.html). 2 Ideally, the user should be able to provide it's own registry entirely, but currently it just uses an in memory registry by default (_which isn't configurable_) `rust/lancedb/examples/embedding_registry.rs` provides a thorough example of expected usage. --- Some additional notes: This does not provide any of the out of box functionality that the python registry does. _i.e there are no built-in embedding functions._ You can think of this as the ground work for adding those built in functions, So while this is part of https://github.com/lancedb/lancedb/issues/994, it does not yet offer feature parity.	2024-05-06 18:39:07 -05:00
Ben Poulson	fdb5d6fdf1	Update README.md to correct LangChain URL (#1262 ) URL in the README for LangChain is currently 404ing. Here's the new URL.	2024-05-06 11:50:34 +05:30
Ayush Chaurasia	2f13fa225f	Chore (python): Better retry loop logging when embedding api fails (#1267 ) https://github.com/lancedb/lancedb/issues/1266#event-12703166915 This happens because openai API errors out with None values. The current log level didn't really print out the msg on screen. Changed the log level to warning, which better suits this case. Also, retry loop can be disabled by setting `max_retries=0` (I'm not sure if we should also set this as the default behaviour as hitting api rate is quite common when ingesting large corpus) ``` func = get_registry().get("openai").create(max_retries=0) ````	2024-05-06 11:49:11 +05:30
Nehil Jain	e933de003d	fix: Docs for embed_func fixed in youtube transcript search notebook (#1269 ) Fixes issue https://github.com/lancedb/lancedb/issues/1268	2024-05-06 11:48:25 +05:30
Ikko Eltociear Ashimine	05fd387425	docs: update README.md (#1270 ) retrevial -> retrieval	2024-05-06 11:46:48 +05:30
Will Jones	82a1da554c	fix(python): return ValueError if passed unknown args to `connect()` (#1265 ) It's confusing to users that keyword arguments from the async API like `storage_options` are accepted by `connect()`, but don't do anything. We should error if unknown arguments are passed instead.	2024-05-03 17:00:08 -07:00
Rohit Rastogi	a7c0d80b9e	Implement convertors to and from Polars DataFrames in Rust SDK using convertors based on C FFI #1099 (#1260 ) https://github.com/lancedb/lancedb/issues/1099 Took the same general approach from: https://github.com/lancedb/lancedb/pull/1235. Instead of using high-level convertors implemented in polars-arrow (with the arrow-rs feature flag, which adds a dependency on arrow-rs), I used convertors based on the C FFI to avoid dependency conflicts. --------- Co-authored-by: Rohit Rastogi <rohitrastogi@Rohits-MacBook-Pro.local> Co-authored-by: Weston Pace <weston.pace@gmail.com>	2024-05-03 16:15:14 -07:00
Cory Grinstead	71323a064a	chore(nodejs): update docs on "table.ts" (#1255 ) closes https://github.com/lancedb/lancedb/issues/1007	2024-05-01 23:00:22 -05:00
asmith26	df48454b70	Update embedding_functions.md (#1250 ) `clip.ndims` seems to be a function (I installed with `pip install open_clip_torch`).	2024-05-01 09:33:42 -07:00
Lance Release	6603414885	Updating package-lock.json	2024-04-30 20:57:12 +00:00
Lance Release	c256f6c502	Updating package-lock.json	2024-04-30 19:58:49 +00:00
Lance Release	cc03f90379	Updating package-lock.json	2024-04-30 19:21:48 +00:00
Lance Release	975da09b02	Bump version: 0.4.17 → 0.4.18 v0.4.18	2024-04-30 19:21:37 +00:00
Cory Grinstead	c32e17b497	chore(nodejs): remove "optionalDependencies" (#1252 ) closes #1248 the binding specific `optionalDependencies` are added automatically as part of the `prepublishOnly` hook, and they are not supposed to be committed to `package.json`. --- npm lifecycle scripts: https://docs.npmjs.com/cli/v7/using-npm/scripts#life-cycle-scripts	2024-04-30 10:51:10 -05:00
Ryan Green	0528abdf97	fix: fix path on remote create_table and check for error response (#1244 )	2024-04-28 11:33:05 -02:30
Lance Release	1090c311e8	[python] Bump version: 0.6.10 → 0.6.11 python-v0.6.11	2024-04-27 03:54:58 +00:00
Weston Pace	e767cbb374	chore: update to Lance version 0.10.16 and Arrow version 51 (#1247 )	2024-04-26 16:26:57 -07:00
Weston Pace	3d7c48feca	feat: allow the index_cache_size to be configured when opening a table (#1245 ) This was already configurable in the rust API but it wasn't actually being passed down to the underlying dataset. I added this option to both the async python API and the new nodejs API. I also added this option to the synchronous python API. I did not add the option to vectordb.	2024-04-26 13:42:02 -07:00
Bert	08d62550bb	fix: passing data to createTable as option (#1242 ) Fixes issue where we would throw `Either data or schema needs to defined` when passing `data` to `createTable` as a property of the first argument (an object). ```ts await db.createTable({ name: 'table1', data, schema }) ```	2024-04-26 15:26:08 -04:00
Lei Xu	b272408b05	chore: fix main branch test failure (#1240 )	2024-04-24 13:49:37 -07:00
Weston Pace	46ffa87cd4	chore: disable the remote feature by default (#1239 ) The rust implementation of the remote client is not yet ready. This is understandably confusing for users since it is enabled by default. This PR disables it by default. We can re-enable it when we are ready (even then it is not clear this is something that should be a default feature). --------- Co-authored-by: Will Jones <willjones127@gmail.com>	2024-04-24 09:28:24 -07:00
QianZhu	cd9fc37b95	add rename_table fn and more data for index_stats to return (#1234 ) 1. added rename_table fn to enable dashboard to rename a table 2. added index_type and distance_type (for vector index) to index_stats so that more detailed data can be shown on the table page.	2024-04-23 16:42:26 -07:00
Lance Release	431f94e564	[python] Bump version: 0.6.9 → 0.6.10 python-v0.6.10	2024-04-22 17:42:24 +00:00
Alex Kohler	c1a7d65473	chore: fix get_registry call in baai embeddings example (#1230 )	2024-04-20 07:25:16 +05:30
Rob Meng	1e5ccb1614	chore: upgrade lance to 0.10.15 (#1229 )	2024-04-19 10:31:39 -04:00

1 2 3 4 5 ...

974 Commits