Search results
31 packages found
Corpus representaion stored in JSON and wrapped into Corpus CRUD API
translate languages using a statistical model
Spam Assassin public mail corpus.
The text of Moby Dick by Herman Melville.
State of the Union addresses by U.S. Presidents.
- stdlib
- datasets
- dataset
- data
- speeches
- politics
- usa
- us
- president
- sotu
- state of the union
- addresses
- text
- corpus
- View more
日本語で書かれた技術書のコーパス
State of the Union addresses by U.S. Presidents.
- stdlib
- datasets
- dataset
- data
- speeches
- politics
- usa
- us
- president
- sotu
- state of the union
- addresses
- text
- corpus
- View more
Spam Assassin public mail corpus.
The text of Moby Dick by Herman Melville.
Text corpora from Project Gutenburg used by NLTK.
Calculate how many documents contain a certain term, within a list (`Array`) of text documents.
A JavaScript (Node.js) library that converts a tagged (monolinear) text to DLx JSON format
A CJK text tokenizer
List of ~636,000 Spanish words
List of ~336,000 French words
A Node.js library for concordancing a corpus formatted according to the Data Format for Digital Linguistis (DaFoDiL)
A dashboard to visualize a synthesis on a structured corpus, using several charts (pie, histogram, ...)