Search results

13 packages found

Spam Assassin public mail corpus.

published 0.2.2 2 months ago
M
Q
P

npm package includes TypeScript and is designed to calculate various statistics from text in order to determine the readability, complexity, and grade level of a given corpus.

published 1.0.11 7 months ago
M
Q
P

The text of Moby Dick by Herman Melville.

published 0.2.2 2 months ago
M
Q
P

State of the Union addresses by U.S. Presidents.

published 0.2.2 2 months ago
M
Q
P

Spam Assassin public mail corpus.

published 0.2.2 2 months ago
M
Q
P

The text of Moby Dick by Herman Melville.

published 0.2.2 2 months ago
M
Q
P

State of the Union addresses by U.S. Presidents.

published 0.2.2 2 months ago
M
Q
P

break down corpus of text into words/tokens

published 1.1.1 8 years ago
M
Q
P

Text mining library

published 1.1.2 8 years ago
M
Q
P

translate languages using a statistical model

published 0.8.3 5 years ago
M
Q
P

Feature hashing, also known as the hashing trick, a fast and space-efficient way of vectorizing features.

published 1.0.0 9 years ago
M
Q
P

Some classes to represent elements in a text corpus.

published 0.0.2 4 years ago
M
Q
P

A CJK text tokenizer

published 0.1.0 8 years ago
M
Q
P