lancedb

mirror of https://github.com/lancedb/lancedb.git synced 2025-12-27 23:12:58 +00:00

Author	SHA1	Message	Date
Rithik Kumar	aa269199ad	docs: fix archived examples links (#1751 )	2024-10-29 22:55:27 +05:30
BubbleCal	32fdcf97db	feat!: upgrade lance to 0.19.1 (#1762 ) BREAKING CHANGE: default tokenizer no longer does stemming or stop-word removal. Users should explicitly turn that option on in the future. - upgrade lance to 0.19.1 - update the FTS docs - update the FTS API Upstream change notes: https://github.com/lancedb/lance/releases/tag/v0.19.1 --------- Signed-off-by: BubbleCal <bubble-cal@outlook.com> Co-authored-by: Will Jones <willjones127@gmail.com>	2024-10-29 09:03:52 -07:00
Will Jones	48f46d4751	docs(node): update `indexStats` signature and regenerate docs (#1742 ) `indexStats` still referenced UUID even though in https://github.com/lancedb/lancedb/pull/1702 we changed it to take name instead.	2024-10-18 10:53:28 -07:00
Dominik Weckmüller	e7b56b7b2a	docs: add permanent link chain icon to headings without impacting SEO (#1746 ) I noted that there are no permanent links in the docs. Adapted the current best solution from https://github.com/squidfunk/mkdocs-material/discussions/3535. It adds a GitHub-like chain icon to the left of each heading (right on mobile) and does not impact SEO unlike the default solution with pilcrow char `¶` that might show up on google search results. <img alt="image" src="https://user-images.githubusercontent.com/182589/153004627-6df3f8e9-c747-4f43-bd62-a8dabaa96c3f.gif">	2024-10-14 11:58:23 -07:00
Olzhas Alexandrov	5ccd0edec2	docs: clarify infrastructure requirements for S3 Express One Zone (#1745 )	2024-10-11 14:06:28 -06:00
Rithik Kumar	6ceaf8b06e	docs: add langchainjs writing assistant (#1719 )	2024-10-03 00:55:00 +05:30
Prashant Dixit	e2ca8daee1	docs: saleforce's sfr rag (#1717 ) This PR adds Salesforce's newly released SFR RAG	2024-10-02 21:15:24 +05:30
Rithik Kumar	7b2cdd2269	docs: revamp Voxel51 v1 (#1714 ) Revamp Voxel51 ![image](https://github.com/user-attachments/assets/7ac34457-74ec-4654-b1d1-556e3d7357f5)	2024-10-01 11:59:03 +05:30
Akash Saravanan	d6b5054778	feat(python): add support for trust_remote_code in hf embeddings (#1712 ) Resovles #1709. Adds `trust_remote_code` as a parameter to the `TransformersEmbeddingFunction` class with a default of False. Updated relevant documentation with the same.	2024-10-01 01:06:28 +05:30
Ayush Chaurasia	86978e7588	feat!: enforce all rerankers always return relevance score & deprecate linear combination fixes (#1687 ) - Enforce all rerankers always return _relevance_score. This was already loosely done in tests before but based on user feedback its better to always have _relevance_score present in all reranked results - Deprecate LinearCombinationReranker in docs. And also fix a case where it would not return _relevance_score if one result set was missing	2024-09-23 12:12:02 +05:30
Rithik Kumar	11072b9edc	docs: phidata integration page (#1678 ) Added new integration page for phidata : ![image](https://github.com/user-attachments/assets/8cd9b420-f249-4eac-ac13-ae53983822be)	2024-09-21 00:40:47 +05:30
Rithik Kumar	dcd5f51036	docs: add understand embeddings v1 (#1643 ) Before getting started with managing embeddings. Let's understand embeddings (LanceDB way) ![Screenshot 2024-09-14 012144](https://github.com/user-attachments/assets/7c5435dc-5316-47e9-8d7d-9994ab13b93d)	2024-09-14 02:07:00 +05:30
BubbleCal	bf7d2d6fb0	docs: update FTS docs for JS SDK (#1634 ) Signed-off-by: BubbleCal <bubble-cal@outlook.com>	2024-09-13 05:48:29 -07:00
Prashant Dixit	b3bf6386c3	docs: rag section in guide (#1619 ) This PR adds the RAG section in the Guides. It includes all the RAGs with code snippet and some advanced techniques which improves RAG.	2024-09-11 21:13:55 +05:30
BubbleCal	4b79db72bf	docs: improve the docs and API param name (#1629 ) Signed-off-by: BubbleCal <bubble-cal@outlook.com>	2024-09-11 10:18:29 +08:00
BubbleCal	2bde5401eb	feat: support to build FTS without positions (#1621 )	2024-09-10 22:51:32 +08:00
Jon X	7eb3b52297	docs: added a blank line between a paragraph and a list block (#1604 ) Though the markdown can be rendered well on GitHub (GFM style?), but it seems that it's required to insert a blank line between a paragraph and a list block to make it render well with `mkdocs`? see also the web page: https://lancedb.github.io/lancedb/concepts/index_hnsw/	2024-09-06 09:38:19 +05:30
Philip Zeyliger	1d61717d0e	docs: fix get_registry() usage (#1601 ) Docs used `get_registry.get(...)` whereas what works is `get_registry().get(...)`. Fixing the two instances I found. I tested the open clip version by trying it locally in a Jupyter notebook.	2024-09-06 01:48:24 +05:30
Rithik Kumar	2bc7dca3ca	docs: add changes to Embeddings-> Available models-> overview page (#1596 ) adding features and improvements to - Manage Embeddings page Before: ![Screenshot 2024-09-04 223743](https://github.com/user-attachments/assets/f1e116b5-6ebb-4d59-9d29-b20084998cd0) After: ![Screenshot 2024-09-05 214214](https://github.com/user-attachments/assets/8c94318e-68af-447e-97e1-8153860a2914) ![Screenshot 2024-09-05 213623](https://github.com/user-attachments/assets/55c82770-6df9-4bab-9c5c-1ea1552138de) ![Screenshot 2024-09-05 215931](https://github.com/user-attachments/assets/9bfac7d4-16a6-454e-801e-50789ff75261)	2024-09-05 22:19:08 +05:30
Jon X	2b8e872be0	docs: removed the unnecessary fence code tag (#1599 )	2024-09-05 14:40:38 +05:30
Ayush Chaurasia	03ef1dc081	feat: update default reranker to RRF (#1580 ) - Both LinearCombination (the current default) and RRF are pretty fast compared to model based rerankers. RRF is slightly faster. - In our tests RRF has also been slightly more accurate. This PR: - Makes RRF the default reranker - Removed duplicate docs for rerankers	2024-09-03 14:00:13 +05:30
Rithik Kumar	fde636ca2e	docs: fix links - quick start to embedding (#1591 )	2024-09-02 21:55:35 +05:30
Ayush Chaurasia	51966a84f5	docs: add multi-vector reranking, answerdotai and studies section (#1579 )	2024-08-31 04:09:14 +05:30
Rithik Kumar	38015ffa7c	docs: improve overall language on all example pages (#1582 ) Refine and improve the language clarity and quality across all example pages in the documentation to ensure better understanding and readability. --------- Co-authored-by: Ayush Chaurasia <ayush.chaurarsia@gmail.com>	2024-08-31 03:48:11 +05:30
Ayush Chaurasia	dc72ece847	feat!: better api for manual hybrid queries (#1575 ) Currently, the only documented way of performing hybrid search is by using embedding API and passing string queries that get automatically embedded. There are use cases where users might like to pass vectors and text manually instead. This ticket contains more information and historical context - https://github.com/lancedb/lancedb/issues/937 This breaks a undocumented pathway that allowed passing (vector, text) tuple queries which was intended to be temporary, so this is marked as a breaking change. For all practical purposes, this should not really impact most users ### usage ``` results = table.search(query_type="hybrid") .vector(vector_query) .text(text_query) .limit(5) .to_pandas() ```	2024-08-30 17:37:58 +05:30
Ayush Chaurasia	bfe8fccfab	docs: add hnsw docs (#1570 )	2024-08-29 15:16:27 +05:30
Rithik Kumar	6f6eb170a9	docs: revamp Python example: Overview page and remove redundant examples and notebooks (#1574 ) before: ![Screenshot 2024-08-29 131656](https://github.com/user-attachments/assets/81cb5d70-5dff-4e57-8bbe-3461327aed7d) After: ![Screenshot 2024-08-29 131715](https://github.com/user-attachments/assets/62109a37-7f66-4fd4-90ed-906a85472117) --------- Co-authored-by: Ayush Chaurasia <ayush.chaurarsia@gmail.com>	2024-08-29 13:48:10 +05:30
Rithik Kumar	dd1c16bbaf	docs: fix links, convert backslash to forward slash in mkdocs.yml (#1571 ) Co-authored-by: Ayush Chaurasia <ayush.chaurarsia@gmail.com>	2024-08-28 16:07:57 +05:30
Rithik Kumar	ae85008714	docs: revamp embedding models (#1568 ) before: ![Screenshot 2024-08-27 151525](https://github.com/user-attachments/assets/d4f8f2b9-37e6-4a31-b144-01b804019e11) After: ![Screenshot 2024-08-27 151550](https://github.com/user-attachments/assets/79fe7d27-8f14-4d80-9b41-a1e91f8c708f) --------- Co-authored-by: Ayush Chaurasia <ayush.chaurarsia@gmail.com>	2024-08-27 17:14:35 +05:30
Bill Chambers	9c25998110	docs: update serverless_lancedb_with_s3_and_lambda.md (#1559 )	2024-08-26 14:55:28 +05:30
Ayush Chaurasia	549ca51a8a	feat: add answerdotai rerankers support and minor improvements (#1560 ) This PR: - Adds missing license headers - Integrates with answerdotai Rerankers package - Updates ColbertReranker to subclass answerdotai package. This is done to keep backwards compatibility as some users might be used to importing ColbertReranker directly - Set `trust_remote_code` to ` True` by default in CrossEncoder and sentence-transformer based rerankers	2024-08-26 13:25:10 +05:30
Rithik Kumar	632007d0e2	docs: add recommender system example (#1561 ) before: ![Screenshot 2024-08-24 230216](https://github.com/user-attachments/assets/cc8a810a-b032-45d7-b086-b2ef0720dc16) After: ![Screenshot 2024-08-24 230228](https://github.com/user-attachments/assets/eaa1dc31-ac7f-4b81-aa79-b4cf94f0cbd5) --------- Co-authored-by: Ayush Chaurasia <ayush.chaurarsia@gmail.com>	2024-08-25 12:30:30 +05:30
rahuljo	6ad5553eca	docs: add dlt-lancedb integration page (#1551 ) Co-authored-by: Akela Drissner-Schmid <32450038+akelad@users.noreply.github.com>	2024-08-22 15:18:49 +05:30
Rithik Kumar	758c82858f	docs: add AI agent example (#1553 ) before: ![Screenshot 2024-08-21 225014](https://github.com/user-attachments/assets/e5b05586-87c5-4739-a4df-2d6cd0704ba5) After: ![Screenshot 2024-08-21 225029](https://github.com/user-attachments/assets/504959db-f560-49b2-9492-557e9846a793) --------- Co-authored-by: Ayush Chaurasia <ayush.chaurarsia@gmail.com>	2024-08-22 00:54:05 +05:30
Rithik Kumar	0cbc9cd551	docs: add evaluation example (#1552 ) before: ![Screenshot 2024-08-21 194228](https://github.com/user-attachments/assets/68d96658-7579-4934-85af-e8c898b64660) After: ![Screenshot 2024-08-21 195258](https://github.com/user-attachments/assets/81ddb9cd-cb93-47fc-a121-ff82701fd11f) --------- Co-authored-by: Ayush Chaurasia <ayush.chaurarsia@gmail.com>	2024-08-21 20:37:04 +05:30
Ayush Chaurasia	85bb7e54e4	docs: missing griffe dependency for mkdocs deployment (#1545 )	2024-08-19 07:48:23 +05:30
Rithik Kumar	21014cab45	docs: add chatbot example and improve quality of other examples (#1544 )	2024-08-17 12:35:33 +05:30
Lei Xu	5857cb4c6e	docs: add a section to describe scalar index (#1495 )	2024-08-16 18:48:29 -07:00
Rithik Kumar	09ce6c5bb5	docs: add vector search example (#1543 )	2024-08-16 21:30:45 +05:30
Lei Xu	b2317c904d	feat: create bitmap and label list scalar index using python async api (#1529 ) * Expose `bitmap` and `LabelList` scalar index type via Rust and Async Python API * Add documents	2024-08-11 09:16:11 -07:00
Ayush Chaurasia	a88e9bb134	docs: add lancedb embedding fcn on cloud docs (#1521 )	2024-08-09 07:21:04 +05:30
BubbleCal	f9d5fa88a1	feat!: migrate FTS from tantivy to lance-index (#1483 ) Lance now supports FTS, so add it into lancedb Python, TypeScript and Rust SDKs. For Python, we still use tantivy based FTS by default because the lance FTS index now misses some features of tantivy. For Python: - Support to create lance based FTS index - Support to specify columns for full text search (only available for lance based FTS index) For TypeScript: - Change the search method so that it can accept both string and vector - Support full text search For Rust - Support full text search The others: - Update the FTS doc BREAKING CHANGE: - for Python, this renames the attached score column of FTS from "score" to "_score", this could be a breaking change for users that rely the scores --------- Signed-off-by: BubbleCal <bubble-cal@outlook.com>	2024-08-08 15:33:15 +08:00
Rithik Kumar	a62f661d90	docs: revamp example docs (#1512 ) Before: ![Screenshot 2024-08-07 015834](https://github.com/user-attachments/assets/b817f846-78b3-4d6f-b4a0-dfa3f4d6be87) After: ![Screenshot 2024-08-07 015852](https://github.com/user-attachments/assets/53370301-8c40-45f8-abe3-32f9d051597e) ![Screenshot 2024-08-07 015934](https://github.com/user-attachments/assets/63cdd038-32bb-4b3e-b9c4-1389d2754014) ![Screenshot 2024-08-07 015941](https://github.com/user-attachments/assets/70388680-9c2b-49ef-ba00-2bb015988214) ![Screenshot 2024-08-07 015949](https://github.com/user-attachments/assets/76335a33-bb6f-473c-896f-447320abcc25) --------- Co-authored-by: Ayush Chaurasia <ayush.chaurarsia@gmail.com>	2024-08-07 03:56:59 +05:30
Robby	8d2ff7b210	feat(python): add watsonx embeddings to registry (#1486 ) Related issue: https://github.com/lancedb/lancedb/issues/1412 --------- Co-authored-by: Robby <h0rv@users.noreply.github.com>	2024-08-06 10:58:33 +05:30
Rithik Kumar	d297da5a7e	docs: update examples docs (#1488 ) Testing Workflow with my first PR. Before: ![Screenshot 2024-08-01 183326](https://github.com/user-attachments/assets/83d22101-8bbf-4b18-81e4-f740e605727a) After: ![Screenshot 2024-08-01 183333](https://github.com/user-attachments/assets/a5e4cd2c-c524-4009-81d5-75b2b0361f83)	2024-08-01 18:54:45 +05:30
Cory Grinstead	a062a92f6b	docs: custom embedding function for ts (#1479 )	2024-07-30 18:19:55 -05:00
Ayush Chaurasia	513926960d	docs: add rrf docs and update reranking notebook with Jina reranker results (#1474 ) - RRF reranker - Jina Reranker results --------- Co-authored-by: Weston Pace <weston.pace@gmail.com>	2024-07-25 22:29:46 +05:30
inn-0	cc507ca766	docs: add missing whitespace before markdown table to fix rendering issue (#1471 ) ### Fix markdown table rendering issue This PR adds a missing whitespace before a markdown table in the documentation. This issue causes the table to not render properly in mkdocs, while it does render properly in GitHub's markdown viewer. #### Change Details: - Added a single line of whitespace before the markdown table to ensure proper rendering in mkdocs. #### Note: - I wasn't able to test this fix in the mkdocs environment, but it should be safe as it only involves adding whitespace which won't break anything. --- Cohere supports following input types: \| Input Type \| Description \| \|-------------------------\|---------------------------------------\| \| "`search_document`" \| Used for embeddings stored in a vector\| \| \| database for search use-cases. \| \| "`search_query`" \| Used for embeddings of search queries \| \| \| run against a vector DB \| \| "`semantic_similarity`" \| Specifies the given text will be used \| \| \| for Semantic Textual Similarity (STS) \| \| "`classification`" \| Used for embeddings passed through a \| \| \| text classifier. \| \| "`clustering`" \| Used for the embeddings run through a \| \| \| clustering algorithm \| Usage Example:	2024-07-24 22:26:28 +05:30
Lei Xu	c9c61eb060	docs: expose merge_insert doc for remote python SDK (#1464 ) `merge_insert` API is not shown up on [`RemoteTable`](https://lancedb.github.io/lancedb/python/saas-python/#lancedb.remote.table.RemoteTable) today * Also bump `ruff` version as well	2024-07-22 10:48:16 -07:00
Cory Grinstead	69295548cc	docs: minor updates for js migration guides (#1451 ) Co-authored-by: Will Jones <willjones127@gmail.com>	2024-07-22 10:26:49 -07:00

1 2 3 4 5 ...

396 Commits