feat(python): batch Ollama embed calls (#2453)

mirror of https://github.com/lancedb/lancedb.git synced 2026-01-17 01:02:59 +00:00

Other embedding integrations such as Cohere and OpenAI already send
requests in batches. We should do that for Ollama too to improve
throughput.

The Ollama [`.embed`
API](63ca747622/ollama/_client.py (L359-L378))
was added in version 0.3.0 (almost a year ago) so I updated the version
requirement in pyproject.

<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->

## Summary by CodeRabbit

- **Bug Fixes**
- Improved compatibility with newer versions of the "ollama" package by
requiring version 0.3.0 or higher.
- Enhanced embedding generation to process batches of texts more
efficiently and reliably.
- **Refactor**
	- Improved type consistency and clarity for embedding-related methods.

<!-- end of auto-generated comment: release notes by coderabbit.ai -->

This commit is contained in:

Haoyu Weng

2025-06-30 11:28:14 -04:00

committed by

GitHub

parent ba755626cc

commit eb9784d7f2

2 changed files with 16 additions and 17 deletions

									
										2

python/pyproject.toml
									
												View File
												
				@@ -85,7 +85,7 @@ embeddings = [

				    "boto3>=1.28.57",

				    "awscli>=1.29.57",

				    "botocore>=1.31.57",

				    "ollama",

				    "ollama>=0.3.0",

				    "ibm-watsonx-ai>=1.1.2",

				]

				azure = ["adlfs>=2024.2.0"]

feat(python): batch Ollama embed calls (#2453)

2 python/pyproject.toml Unescape Escape View File

2

python/pyproject.toml

View File