lancedb

mirror of https://github.com/lancedb/lancedb.git synced 2025-12-25 14:29:56 +00:00

Author	SHA1	Message	Date
Will Jones	2fc174f532	docs: add sync/async tabs to quickstart (#2087 ) Closes #2033	2025-01-31 15:43:54 -08:00
Will Jones	980aa70e2d	feat(python): async-sync feature parity on Table (#1914 ) ### Changes to sync API * Updated `LanceTable` and `LanceDBConnection` reprs * Add `storage_options`, `data_storage_version`, and `enable_v2_manifest_paths` to sync create table API. * Add `storage_options` to `open_table` in sync API. * Add `list_indices()` and `index_stats()` to sync API * `create_table()` will now create only 1 version when data is passed. Previously it would always create two versions: 1 to create an empty table and 1 to add data to it. ### Changes to async API * Add `embedding_functions` to async `create_table()` API. * Added `head()` to async API ### Refactors * Refactor index parameters into dataclasses so they are easier to use from Python * Moved most tests to use an in-memory DB so we don't need to create so many temp directories Closes #1792 Closes #1932 --------- Co-authored-by: Weston Pace <weston.pace@gmail.com>	2024-12-13 12:56:44 -08:00
Will Jones	0fd8a50bd7	ci(node): run examples in CI (#1796 ) This is done as setup for a PR that will fix the OpenAI dependency issue. * [x] FTS examples * [x] Setup mock openai * [x] Ran `npm audit fix` * [x] sentences embeddings test * [x] Double check formatting of docs examples	2024-11-13 11:10:56 -08:00
Rithik Kumar	fde636ca2e	docs: fix links - quick start to embedding (#1591 )	2024-09-02 21:55:35 +05:30
Cory Grinstead	2276b114c5	docs: add installation note about yarn (#1459 ) I noticed that setting up a simple project with [Yarn](https://yarnpkg.com/) failed because unlike others [npm, pnpm, bun], yarn does not automatically resolve peer dependencies, so i added a quick note about it in the installation guide.	2024-07-19 18:48:24 -05:00
Cory Grinstead	31be9212da	docs(nodejs): add @lancedb/lancedb examples everywhere (#1411 ) Co-authored-by: Will Jones <willjones127@gmail.com>	2024-07-10 13:29:03 -05:00
Ayush Chaurasia	5e30648f45	docs: fix example path (#1367 )	2024-06-07 19:40:50 -07:00
Ayush Chaurasia	76fc16c7a1	docs: add retriever guide, address minor onboarding feedbacks & enhancement (#1326 ) - Tried to address some onboarding feedbacks listed in https://github.com/lancedb/lancedb/issues/1224 - Improve visibility of pydantic integration and embedding API. (Based on onboarding feedback - Many ways of ingesting data, defining schema but not sure what to use in a specific use-case) - Add a guide that takes users through testing and improving retriever performance using built-in utilities like hybrid-search and reranking - Add some benchmarks for the above - Add missing cohere docs --------- Co-authored-by: Weston Pace <weston.pace@gmail.com>	2024-06-08 06:25:31 +05:30
Will Jones	5349e8b1db	ci: make preview releases (#1302 ) This PR changes the release process. Some parts are more complex, and other parts I've simplified. ## Simplifications * Combined `Create Release Commit` and `Create Python Release Commit` into a single workflow. By default, it does a release of all packages, but you can still choose to make just a Python or just Node/Rust release through the arguments. This will make it rarer that we create a Node release but forget about Python or vice-versa. * Releases are automatically generated once a tag is pushed. This eliminates the manual step of creating the release. * Release notes are automatically generated and changes are categorized based on the PR labels. * Removed the use of `LANCEDB_RELEASE_TOKEN` in favor of just using `GITHUB_TOKEN` where it wasn't necessary. In the one place it is necessary, I left a comment as to why it is. * Reused the version in `python/Cargo.toml` so we don't have two different versions in Python LanceDB. ## New changes * We now can create `preview` / `beta` releases. By default `Create Release Commit` will create a preview release, but you can select a "stable" release type and it will create a full stable release. * For Python, pre-releases go to fury.io instead of PyPI * `bump2version` was deprecated, so upgraded to `bump-my-version`. This also seems to better support semantic versioning with pre-releases. * `ci` changes will now be shown in the changelog, allowing changes like this to be visible to users. `chore` is still hidden. ## Versioning NOTE: unlike how it is in lance repo right now, the version in main is the last one released, including beta versions. --------- Co-authored-by: Lance Release <lance-dev@lancedb.com> Co-authored-by: Weston Pace <weston.pace@gmail.com>	2024-05-17 11:24:38 -07:00
Weston Pace	e21b56293c	docs: add a reference to @lancedb/lance in the docs (#1166 ) We aren't yet ready to switch over the examples since almost all JS examples rely on embeddings and we haven't yet ported those over. However, this makes it possible for those that are interested to start using `@lancedb/lancedb`	2024-04-05 16:34:39 -07:00
Weston Pace	f97c7dad8c	docs: add the async python API to the docs (#1156 )	2024-04-05 16:34:37 -07:00
Weston Pace	0fe0976a0e	docs: add links to rust SDK docs, remove references to rust SDK being unstable / experimental (#1131 )	2024-04-05 16:33:05 -07:00
Prashanth Rao	f9c244e608	[docs]: Fix issues with Rust code snippets in "quick start" (#1047 ) The renaming of `vectordb` to `lancedb` broke the [quick start docs](https://lancedb.github.io/lancedb/basic/#__tabbed_5_3) (it's pointing to a non-existent directory). This PR fixes the code snippets and the paths in the docs page. Additionally, more fixes related to indexing docs below 👇🏽.	2024-04-05 16:31:36 -07:00
Lei Xu	911d063237	doc: fix js example of create index (#886 )	2024-04-05 16:28:56 -07:00
Lei Xu	12e776821a	doc: use snippet for rust code example and make sure rust examples run through CI (#885 )	2024-04-05 16:28:56 -07:00
Chang She	1d0578ce25	doc(rust): minor fixes for Rust quick start. (#878 )	2024-04-05 16:28:56 -07:00
Lei Xu	d811b89de2	doc: use code snippet for typescript examples (#880 ) The typescript code is in a fully function file, that will be run via the CI.	2024-04-05 16:28:56 -07:00
Lei Xu	36dbf47d60	chore: add one rust SDK e2e example (#876 ) Co-authored-by: Chang She <759245+changhiskhan@users.noreply.github.com>	2024-04-05 16:28:56 -07:00
Lei Xu	fd2fd94862	doc: update quick start for full rust example (#872 )	2024-04-05 16:28:56 -07:00
Prashanth Rao	4d5d748acd	docs: Updates and refactor (#683 ) This PR makes incremental changes to the documentation. * Closes #697 * Closes #698 - [x] Add dark mode - [x] Fix headers in navbar - [x] Add `extra.css` to customize navbar styles - [x] Customize fonts for prose/code blocks, navbar and admonitions - [x] Inspect all admonition boxes (remove redundant dropdowns) and improve clarity and readability - [x] Ensure that all images in the docs have white background (not transparent) to be viewable in dark mode - [x] Improve code formatting in code blocks to make them consistent with autoformatters (eslint/ruff) - [x] Add bolder weight to h1 headers - [x] Add diagram showing the difference between embedded (OSS) and serverless (Cloud) - [x] Fix [Creating an empty table](https://lancedb.github.io/lancedb/guides/tables/#creating-empty-table) section: right now, the subheaders are not clickable. - [x] In critical data ingestion methods like `table.add` (among others), the type signature often does not match the actual code - [x] Proof-read each documentation section and rewrite as necessary to provide more context, use cases, and explanations so it reads less like reference documentation. This is especially important for CRUD and search sections since those are so central to the user experience. - [x] The section for [Adding data](https://lancedb.github.io/lancedb/guides/tables/#adding-to-a-table) only shows examples for pandas and iterables. We should include pydantic models, arrow tables, etc. - [x] Add conceptual tutorial for IVF-PQ index - [x] Clearly separate vector search, FTS and filtering sections so that these are easier to find - [x] Add docs on refine factor to explain its importance for recall. Closes #716 - [x] Add an FAQ page showing answers to commonly asked questions about LanceDB. Closes #746 - [x] Add simple polars example to the integrations section. Closes #756 and closes #153 - [ ] Add basic docs for the Rust API (more detailed API docs can come later). Closes #781 - [x] Add a section on the various storage options on local vs. cloud (S3, EBS, EFS, local disk, etc.) and the tradeoffs involved. Closes #782 - [x] Revamp filtering docs: add pre-filtering examples and redo headers and update content for SQL filters. Closes #783 and closes #784. - [x] Add docs for data management: compaction, cleaning up old versions and incremental indexing. Closes #785 - [ ] Add a benchmark section that also discusses some best practices. Closes #787 --------- Co-authored-by: Ayush Chaurasia <ayush.chaurarsia@gmail.com> Co-authored-by: Will Jones <willjones127@gmail.com>	2024-04-05 16:27:12 -07:00
Chang She	ac3d95ec34	feat(python): allow the entire table to be converted a polars dataframe (#814 )	2024-04-05 16:26:36 -07:00
Chang She	5376970e87	doc(javascript): minor improvement on docs for working with tables (#736 ) Closes #639 Closes #638	2024-04-05 16:24:47 -07:00
Chang She	8469d010f8	feat: add to_list and to_pandas api's (#556 ) Add `to_list` to return query results as list of python dict (so we're not too pandas-centric). Closes #555 Add `to_pandas` API and add deprecation warning on `to_df`. Closes #545 Co-authored-by: Chang She <chang@lancedb.com>	2024-04-05 16:22:59 -07:00
Prashanth Rao	1d1f8964d2	[DOCS][PYTHON] Update docs for clarity (#531 ) I only modified those docs pages that are untouched in existing unmerged PRs, so hopefully there are no merge conflicts! 1. The `tantivy-py` version specified in the docs doesn't work (pip install fails), but with the latest version of pip and wheel installed on my Mac, I was able to just `pip install tantivy` and FTS works great for me. I updated the docs page to include this in `7ca4b757ce` but can always modify to another specific version in case this breaks any tests. 2. The `.add()` method for Python should take in a list of dicts as the first option (to also align with the JS API), and additionally, users can pass an existing pandas DataFrame to add to a table. Hope this makes sense. 3. I've had multiple conversations with users who are unclear that the terms "exhaustive", "flat" and "KNN" are all the same kind of search, so I've updated the verbiage of this section to clarify this. 4. Fixed typos and improved clarity in the ANN indexes page.	2023-10-03 09:46:53 +05:30
Ayush Chaurasia	cc916389a6	[DOCS] Major Docs Revamp (#435 )	2023-08-22 14:06:26 -07:00
Chang She	a54d1e5618	Automatically convert pydantic model (#400 ) Saves users from having to explicitly call `LanceModel.to_arrow_schema()` when creating an empty table. See new docs for full details. --------- Co-authored-by: Chang She <chang@lancedb.com>	2023-08-06 14:50:03 -07:00
Will Jones	3537afb2c3	docs: show how to delete rows in user guide (#309 ) Closes #265	2023-07-18 08:19:48 -07:00
Tevin Wang	b731a6aed9	Add docs code testing & documentation syntax changes (#196 ) - Creates testing files `md_testing.py` and `md_testing.js` for testing python and nodejs code in markdown files in the documentation This listens for HTML tags as well: `<!--[language] code code code...-->` will create a set-up file to create some mock tables or to fulfill some assumptions in the documentation. - Creates a github action workflow that triggers every push/pr to `docs/**` - Modifies documentation so tests run (mostly indentation, some small syntax errors and some missing imports) A list of excluded files that we need to take a closer look at later on: ```javascript const excludedFiles = [ "../src/fts.md", "../src/embedding.md", "../src/examples/serverless_lancedb_with_s3_and_lambda.md", "../src/examples/serverless_qa_bot_with_modal_and_langchain.md", "../src/examples/youtube_transcript_bot_with_nodejs.md", ]; ``` Many of them can't be done because we need the OpenAI API key :(. `fts.md` has some issues with the library, I believe this is still experimental? Closes #170 --------- Co-authored-by: Will Jones <willjones127@gmail.com>	2023-06-28 11:07:26 -07:00
Jai	8af5f19cc1	js docs, modal example, doc notebook integration, update doc styles (#131 )	2023-06-02 15:24:16 -07:00
Chang She	d7c5793803	Add mode to overwrite table if already exists	2023-04-19 16:22:11 -07:00
Chang She	cdb534076f	[DOC] basic operations	2023-04-19 14:29:03 -07:00

31 Commits