Search results
3 packages found
Additional tokenizers for Orama
- full-text search
- search
- fuzzy search
- typo-tolerant search
- full-text
- vector search
- stemming
- tokenizers
- mandarin
- chinese
published 2.1.1 5 days ago
M
Q
P
cpp tokenizer module for fibjs.
published 1.2.1 2 months ago
M
Q
P
This repository holds the code for the TokenGeeX Rust crate and Python package. TokenGeeX is a tokenizer for [CodeGeeX](https://github.com/THUDM/Codegeex2) aimed at code and Chinese. It is based on [UnigramLM (Taku Kudo 2018)](https://arxiv.org/abs/1804.1
published 0.6.2 6 months ago
M
Q
P