Search results

30 packages found

Spam Assassin public mail corpus.

published version 0.2.2, 10 months ago2 dependents licensed under $Apache-2.0
464

List of ~636,000 Spanish words

published version 2.0.0, 5 years ago4 dependents licensed under $MIT
190

The text of Moby Dick by Herman Melville.

published version 0.2.2, 10 months ago1 dependents licensed under $Apache-2.0
170

List of ~336,000 French words

published version 2.0.0, 5 years ago2 dependents licensed under $MIT
127

A Standard Corpus of Present-Day Edited American English, for use with Digital Computers.

published version 1.9.80, 7 years ago0 dependents licensed under $MIT
124

State of the Union addresses by U.S. Presidents.

published version 0.2.2, 10 months ago1 dependents licensed under $Apache-2.0
95

A wrapper for CETEMPúblico, an European Portuguese corpus of news extracts from the newspaper Público, with 180 million words tagged automatically using PALAVRAS.

published version 1.4.0, 5 years ago0 dependents licensed under $ISC
82

A CJK text tokenizer

published version 0.1.0, 9 years ago0 dependents licensed under $MIT
61

日本語で書かれた技術書のコーパス

published version 3.0.0, 2 years ago0 dependents licensed under $MIT
52

Text mining library

published version 1.1.2, 9 years ago0 dependents licensed under $MIT
35

translate languages using a statistical model

published version 0.8.3, 5 years ago0 dependents licensed under $MIT
39

A core type to handle CoNLL-U format

published version 0.1.5, 2 years ago0 dependents licensed under $MIT
34

Corpus CRUD API wrapper

published version 2.0.3, 10 years ago2 dependents licensed under $MIT
31

State of the Union addresses by U.S. Presidents.

published version 0.2.2, 10 months ago0 dependents licensed under $Apache-2.0
30

Corpus representaion stored in JSON and wrapped into Corpus CRUD API

published version 1.0.2, 11 years ago0 dependents licensed under $MIT
25

Merge multiple sentiment libraries for better sentiment analysis

published version 1.0.6, 10 years ago0 dependents licensed under $MIT
29

Text corpus calculation in Javascript.

published version 0.1.0-dev, 13 years ago0 dependents
26

A node.js module for generating usernames based on a specified corpus.

published version 0.1.0, 13 years ago0 dependents
22

A Node.js library for concordancing a corpus formatted according to the Data Format for Digital Linguistis (DaFoDiL)

published version 0.4.0, 6 years ago0 dependents licensed under $MIT
20

Transform a directory of conll files (treebank) into a directory of svg files.

published version 0.1.2, 8 years ago0 dependents licensed under $AGPL-3.0
20