Search results

26 packages found

Feature hashing, also known as the hashing trick, a fast and space-efficient way of vectorizing features.

published 1.0.0 9 years ago
M
Q
P

A CJK text tokenizer

published 0.1.0 8 years ago
M
Q
P

Count N-Grams in a String or Array

published 0.0.0 4 years ago
M
Q
P

This is a language detection library implemented in plain node.js. (aliases: language identification, language guessing). This is ported from Java langdetect project(https://code.google.com/p/language-detection/)

published 0.0.3 10 years ago
M
Q
P
M
Q
P

a simple near duplicate detection by Shingling method

published 0.1.15 10 years ago
M
Q
P