Search results

460 packages found

A class Tokenizer to convert text documents into sequences of tokens

published 0.0.3 a year ago
M
Q
P

Wix Restaurants credit-cards tokenizer

published 1.2.902 4 years ago
M
Q
P

TS tokenizer for Mistral-based LLMs

published 1.2.0 5 months ago
M
Q
P

stream-json is the micro-library of Node.js stream components for creating custom JSON processing pipelines with a minimal memory footprint. It can parse JSON files far exceeding available memory streaming individual primitives using a SAX-inspired API. I

published 1.8.0 a year ago
M
Q
P

JavaScript implementation of Japanese morphological analyzer

published 0.1.2 6 years ago
M
Q
P

A React Native supported JavaScript implementation of Japanese morphological analyzer

published 0.2.1 5 months ago
M
Q
P

Fastly VCL tokenizer

published 0.1.0 2 months ago
M
Q
P

Isomorphic utilities for GPT-3 tokenization and prompt building.

published 1.2.0 a year ago
M
Q
P

Tokenizes a string that represents a regular expression.

published 0.5.0 a year ago
M
Q
P

Simple algorithm to tokenize Chinese texts into words using CC-CEDICT.

published 2.4.0 5 years ago
M
Q
P

A CLI tool to concatenate all text files in your CWD with headers for GPT prompt engineering.

published 1.0.2 4 months ago
M
Q
P

Parse PHP code from JS and returns its AST

published 3.1.5 a year ago
M
Q
P

TS tokenizer for Mistral-based LLMs

published 1.2.2 2 months ago
M
Q
P

Simple synchronous string tokenizer using Regex

published 2.0.0 7 years ago
M
Q
P

TDOP parser

published 10.0.2 2 months ago
M
Q
P

Simple, but powerful lexical scanner that is a more minimal implementation of X-Scanner

published 0.7.9 2 years ago
M
Q
P

A streaming JSON tokenizer

published 1.1.0 7 years ago
M
Q
P

A tokenizer for Google-like search queries

published 2.1.1 7 years ago
M
Q
P

A small ECMAScript parser, tokenizer and minifier written in JavaScript.

published 2.5.4 8 years ago
M
Q
P

Time a JavaScript tokenizer

published 1.1.0 7 years ago
M
Q
P