lancedb

mirror of https://github.com/lancedb/lancedb.git synced 2025-12-25 14:29:56 +00:00

Author	SHA1	Message	Date
Weston Pace	9c7e00eec3	Remove remote integration workflow (#1076 )	2024-03-07 12:00:04 -08:00
Weston Pace	1453bf4e7a	feat: reconfigure typescript linter / formatter for nodejs (#1042 ) The eslint rules specify some formatting requirements that are rather strict and conflict with vscode's default formatter. I was unable to get auto-formatting to setup correctly. Also, eslint has quite recently [given up on formatting](https://eslint.org/blog/2023/10/deprecating-formatting-rules/) and recommends using a 3rd party formatter. This PR adds prettier as the formatter. It restores the eslint rules to their defaults. This does mean we now have the "no explicit any" check back on. I know that rule is pedantic but it did help me catch a few corner cases in type testing that weren't covered in the current code. Leaving in draft as this is dependent on other PRs.	2024-03-04 10:49:08 -08:00
Weston Pace	abaf315baf	feat: add support for add to async python API (#1037 ) In order to add support for `add` we needed to migrate the rust `Table` trait to a `Table` struct and `TableInternal` trait (similar to the way the connection is designed). While doing this we also cleaned up some inconsistencies between the SDKs: * Python and Node are garbage collected languages and it can be difficult to trigger something to be freed. The convention for these languages is to have some kind of close method. I added a close method to both the table and connection which will drop the underlying rust object. * We made significant improvements to table creation in `cc5f2136a6` for the `node` SDK. I copied these changes to the `nodejs` SDK. * The nodejs tables were using fs to create tmp directories and these were not getting cleaned up. This is mostly harmless but annoying and so I changed it up a bit to ensure we cleanup tmp directories. * ~~countRows in the node SDK was returning `bigint`. I changed it to return `number`~~ (this actually happened in a previous PR) * Tables and connections now implement `std::fmt::Display` which is hooked into python's `__repr__`. Node has no concept of a regular "to string" function and so I added a `display` method. * Python method signatures are changing so that optional parameters are always `Optional[foo] = None` instead of something like `foo = False`. This is because we want those defaults to be in rust whenever possible (though we still need to mention the default in documentation). * I changed the python `AsyncConnection/AsyncTable` classes from abstract classes with a single implementation to just classes because we no longer have the remote implementation in python. Note: this does NOT add the `add` function to the remote table. This PR was already large enough, and the remote implementation is unique enough, that I am going to do all the remote stuff at a later date (we should have the structure in place and correct so there shouldn't be any refactor concerns) --------- Co-authored-by: Will Jones <willjones127@gmail.com>	2024-03-04 09:27:41 -08:00
Chang She	62632cb90b	doc: fix docs deployment GHA (#1055 )	2024-03-03 16:04:45 -08:00
Chang She	f95402af7c	doc: fix langchain link (#1053 )	2024-03-03 15:20:48 -08:00
Rob Meng	adf1a38f4d	fix: fix columns type for pydantic 2.x (#1045 )	2024-02-29 14:47:56 -05:00
Weston Pace	294c33a42e	feat: Initial remote table implementation for rust (#1024 ) This will eventually replace the remote table implementations in python and node.	2024-02-29 10:55:49 -08:00
Weston Pace	a6bcbd007b	feat: add a basic async python client starting point (#1014 ) This changes `lancedb` from a "pure python" setuptools project to a maturin project and adds a rust lancedb dependency. The async python client is extremely minimal (only `connect` and `Connection.table_names` are supported). The purpose of this PR is to get the infrastructure in place for building out the rest of the async client. Although this is not technically a breaking change (no APIs are changing) it is still a considerable change in the way the wheels are built because they now include the native shared library.	2024-02-27 04:52:02 -08:00
Weston Pace	f1596122e6	refactor: rename the rust crate from vectordb to lancedb (#1012 ) This also renames the new experimental node package to lancedb. The classic node package remains named vectordb. The goal here is to avoid introducing piecemeal breaking changes to the vectordb crate. Instead, once the new API is stabilized, we will officially release the lancedb crate and deprecate the vectordb crate. The same pattern will eventually happen with the npm package vectordb.	2024-02-22 19:56:39 -08:00
Will Jones	aec85f7875	ci: fix Node ARM release build (#971 ) When we turned on fat LTO builds, we made the release build job much more compute and memory intensive. The ARM runners have particularly low memory per core, which makes them susceptible to OOM errors. To avoid issues, I have enabled memory swap on ARM and bumped the side of the runner.	2024-02-14 13:02:09 -08:00
Will Jones	51f92ecb3d	ci: reduce number of build jobs on aarch64 to avoid OOM (#970 )	2024-02-13 17:33:09 -08:00
Weston Pace	24e8043150	chore: use a bigger runner for NPM publish jobs on aarch64 to avoid OOM (#955 )	2024-02-10 09:57:33 -08:00
Ayush Chaurasia	d982ee934a	feat(python): Reranker DX improvements (#904 ) - Most users might not know how to use `QueryBuilder` object. Instead we should just pass the string query. - Add new rerankers: Colbert, openai	2024-02-06 13:59:31 +05:30
Lei Xu	62f053ac92	ci: bump to new version of python action to use node 20 gIthub action runtime (#909 ) Github action is deprecating old node-16 runtime.	2024-02-01 11:36:03 -08:00
Lei Xu	a42df158a3	ci: change apple silicon runner to free OSS macos-14 target (#901 )	2024-01-30 11:05:42 -08:00
Lei Xu	b9c5323265	doc: use snippet for rust code example and make sure rust examples run through CI (#885 )	2024-01-28 14:30:30 -08:00
Lei Xu	e41a52863a	fix: fix doc build to include the source snippet correctly (#883 )	2024-01-28 11:55:58 -08:00
Lei Xu	22b9eceb12	chore: convert all js doc test to use snippet. (#881 )	2024-01-28 11:39:25 -08:00
Lei Xu	5f62302614	doc: use code snippet for typescript examples (#880 ) The typescript code is in a fully function file, that will be run via the CI.	2024-01-27 22:52:37 -08:00
Lei Xu	ac94b2a420	chore: upgrade lance, pylance and datafusion (#879 )	2024-01-27 12:31:38 -08:00
Lei Xu	e910809de0	chore: bump github actions to v4 due to GHA warnings of node version deprecation (#874 )	2024-01-26 15:52:47 -08:00
Lei Xu	1cd5426aea	feat: rework NodeJS SDK using napi (#847 ) Use Napi to write a Node.js SDK that follows Polars for better maintainability, while keeping most of the logic in Rust.	2024-01-23 15:14:45 -08:00
Lei Xu	ccfd043939	feat: change create table to accept Arrow table (#845 )	2024-01-23 13:25:15 -08:00
Lei Xu	9a9fc77a95	doc: improve docs for nodejs connect functions (#833 ) * improve the docstring for NodeJS connect functions and `ConnectOptions` parameters. * Simplify `npm run build` steps.	2024-01-19 16:07:53 -08:00
Will Jones	d012db24c2	ci: lint and enforce linting (#829 ) @eddyxu added instructions for linting here: `7af213801a/python/README.md (L45-L50)` However, we had a lot of failures and weren't checking this in CI. This PR fixes all lints and adds a check to CI to keep us in compliance with the lints.	2024-01-19 13:09:14 -08:00
Lei Xu	fe2fb91a8b	chore: remove black as dependency (#808 ) We use `ruff` in CI and dev workflow now.	2024-01-11 10:58:49 -08:00
QianZhu	4d8e401d34	SaaS JS API sdk doc (#740 ) Co-authored-by: Aidan <64613310+aidangomar@users.noreply.github.com>	2024-01-03 16:24:21 -08:00
Chang She	7bbb2872de	bug(python): fix path handling in windows (#724 ) Use pathlib for local paths so that pathlib can handle the correct separator on windows. Closes #703 --------- Co-authored-by: Will Jones <willjones127@gmail.com>	2023-12-20 15:41:36 -08:00
Will Jones	e81d2975da	chore: add issue templates (#732 ) This PR adds issue templates, which help two recurring issues: * Users forget to tell us whether they are using the Node or Python SDK * Issues don't get appropriate tags This doesn't force the use of the templates. Because we set `blank_issues_enabled: true`, users can still create a custom issue.	2023-12-20 15:15:24 -08:00
Will Jones	2c7f96ba4f	ci: check formatting and clippy (#730 )	2023-12-20 13:37:51 -08:00
Chang She	bd0034a157	feat: support nested pydantic schema (#707 )	2023-12-14 18:20:45 -08:00
Will Jones	144b3b5d83	ci: fix broken npm publication (#704 ) Most recent release failed because `release` depends on `node-macos`, but we renamed `node-macos` to `node-macos-{x86,arm64}`. This fixes that by consolidating them back to a single `node-macos` job, which also has the side effect of making the file shorter.	2023-12-14 12:09:28 -08:00
Chang She	244b6919cc	chore: Use m1 runner for npm publish (#687 ) We had some build issues with npm publish for cross-compiling arm64 macos on an x86 macos runner. Switching to m1 runner for now until someone has time to deal with the feature flags. follow-up tracked here: #688	2023-12-07 15:49:52 -08:00
Lei Xu	d43ef7f11e	chore: apple silicon runner (#633 ) Close #632	2023-11-06 21:04:32 -08:00
Lei Xu	554e068917	chore: improve create_table API consistency between local and remote SDK (#627 )	2023-11-03 13:15:11 -07:00
Bert	8068a2bbc3	ci: cancel in progress runs on new push (#620 )	2023-11-01 11:33:48 -04:00
Will Jones	b2b70ea399	increment pylance (#618 )	2023-10-31 18:07:03 -07:00
Chang She	f20f19b804	feat: improve pydantic 1.x compat (#503 )	2023-09-18 19:01:30 -07:00
Chang She	c21f9cdda0	ci: fix docs build (#496 ) python/python.md contains typos in the class references --------- Co-authored-by: Chang She <chang@lancedb.com>	2023-09-18 13:07:21 -07:00
Rob Meng	731f86e44c	add health check to wait for all service ready before next step (#501 ) aws integration tests are flaky because we didn't wait for the services to become healthy. (we only waited for the localstack service, this PR adds wait for sub services)	2023-09-18 15:17:45 -04:00
Chang She	31dad71c94	multi-modal embedding-function (#484 )	2023-09-16 21:23:51 -04:00
Rob Meng	0554db03b3	progagate uri query string to lance; add aws integration tests (#486 ) # WARNING: specifying engine is NOT a publicly supported feature in lancedb yet. THE API WILL CHANGE. This PR exposes dynamodb based commit to `vectordb` and JS SDK (will do python in another PR since it's on a different release track) This PR also added aws integration test using `localstack` ## What? This PR adds uri parameters to DB connection string. User may specify `engine` in the connection string to let LanceDB know that the user wants to use an external store when reading and writing a table. User may also pass any parameters required by the commitStore in the connection string, these parameters will be propagated to lance. e.g. ``` vectordb.connect("s3://my-db-bucket?engine=ddb&ddbTableName=my-commit-table") ``` will automatically convert table path to ``` s3+ddb://my-db-bucket/my_table.lance?&ddbTableName=my-commit-table ```	2023-09-09 13:33:16 -04:00
Leon Yee	cf977866d8	[WIP] Workflow to trigger vectordb-recipes workflow (#371 )	2023-08-02 11:27:08 -07:00
gsilvestrin	1daecac648	fix(python): Pin pylance and add pandas as test dependency (#373 )	2023-07-27 15:21:45 -07:00
Will Jones	8829988ada	ci: build node in manylinux docker container (#350 ) Closes #359 TODO: * [x] test in a sample of Linux distro docker containers	2023-07-24 11:31:47 -07:00
gsilvestrin	4383848d53	feat(node): Add Linux ARM build (#348 )	2023-07-21 15:33:02 -07:00
gsilvestrin	473c43860c	bugfix: Set Github token when pushing changes (#351 )	2023-07-21 15:31:44 -07:00
gsilvestrin	21b1a71a6b	bugfix(node): Don't persist credentials on make-release-commit.yml (#345 )	2023-07-20 13:24:06 -07:00
gsilvestrin	2d899675e8	bugfix(node): Make release task can't push to repo (#344 )	2023-07-20 13:15:29 -07:00
gsilvestrin	a2bb497135	feat(node) Move native packages to @lancedb NPM org (#341 ) - Move native packages to @lancedb org - Move package-lock.json update to a reusable action and created a target to run it manually.	2023-07-20 12:54:39 -07:00

1 2

85 Commits