Search results
88 packages found
A command-line tool for processing text
Simple parser, triggers events upon data receival and processing
This module provides a set of NLP tools
Feature hashing, also known as the hashing trick, a fast and space-efficient way of vectorizing features.
- machine learning
- bag of words
- feature vector
- natural language processing
- nlp
- bow
- document classification
- information retrieval
- sparse vector
- ml
- classifier
- regression
- hash
- md5
- View more
Persian (Farsi) text pre processing (normalize, number, punctuation, white space, stop word & ...)
- persian
- farsi
- text
- preprocess
- pre process
- normalize
- number
- punctuation
- whitespace
- white space
- stop word
- stopword
Tokenizer for Vietnamese in Nodejs and Javascript
A text description classifier for classifying arbitrary strings into provided labels
- naive
- bayes
- text
- classifier
- machine-learning
- nlp
- natural-language-processing
- bayesian
- text-classifier
- bayes-classifier
- naive-bayes-classification
- natural-language
A package for processing string to html
Nodejs text cleaner, for Data Mining in JS
A zero-dependency package to parse various time expressions
Perform trim, grow, extract, scrub, deduplication, and structured splitting operations on lines of text in a chainable fashion. This ain't yo grand daddy's find & replace tool.
Splits text into tokens in as many languages as possible, using libicu
Build generic nested structures with a Teacup-like syntax
Services for Recurrent Text-related Tasks
Some classes to represent elements in a text corpus.
A dynamic text processing library.
Find Unicode tr29 word boundaries, using libicu