mirror of
https://github.com/lancedb/lancedb.git
synced 2026-05-23 15:00:39 +00:00
This wires Lance's existing `jieba/*` and `lindera/*` native FTS tokenizers through the Python SDK instead of leaving them behind disabled features and narrow public typing. It also documents the `LANCE_LANGUAGE_MODEL_HOME` model layout and adds Python coverage for successful CJK indexing plus missing-model error guidance. Closes #2168.
9 lines
105 B
Plaintext
9 lines
105 B
Plaintext
我们 98740 r
|
|
都 202780 d
|
|
有 423765 v
|
|
光明 1219 n
|
|
的 318825 uj
|
|
前途 1263 n
|
|
前 62779 f
|
|
途 857 n
|