mirror of
https://github.com/lancedb/lancedb.git
synced 2026-05-22 14:30:41 +00:00
This wires Lance's existing `jieba/*` and `lindera/*` native FTS tokenizers through the Python SDK instead of leaving them behind disabled features and narrow public typing. It also documents the `LANCE_LANGUAGE_MODEL_HOME` model layout and adds Python coverage for successful CJK indexing plus missing-model error guidance. Closes #2168.
4 lines
95 B
YAML
4 lines
95 B
YAML
segmenter:
|
|
mode: "normal"
|
|
dictionary:
|
|
path: "./python/tests/models/lindera/ipadic/main" |