32 packages found

Corpus representaion stored in JSON and wrapped into Corpus CRUD API

published 1.0.2 9 years ago
M
Q
P

Corpus CRUD API wrapper

published 2.0.3 8 years ago
M
Q
P

日本語で書かれた技術書のコーパス

published 3.0.0 7 months ago
M
Q
P
M
Q
P

The text of Moby Dick by Herman Melville.

published 0.0.8 a year ago
M
Q
P

State of the Union addresses by U.S. Presidents.

published 0.0.8 a year ago
M
Q
P

Corpus CRUD API wrapper

published 1.0.0 9 years ago
M
Q
P

Text corpora from Project Gutenburg used by NLTK.

published 1.0.1 8 years ago
M
Q
P

translate languages using a statistical model

published 0.8.3 3 years ago
M
Q
P

A package that finds the frequency of a word per million words, using Chapter 1, List 1.2 from https://ucrel.lancs.ac.uk/bncfreq/flists.html as it's source of word frequency data.

published 4.4.0 3 months ago
M
Q
P

Text mining library

published 1.1.2 7 years ago
M
Q
P

A dashboard to visualize a synthesis on a structured corpus, using several charts (pie, histogram, ...)

published 6.8.5 8 years ago
M
Q
P

A Node.js library for concordancing a corpus formatted according to the Data Format for Digital Linguistis (DaFoDiL)

published 0.4.0 4 years ago
M
Q
P

Calculate how many documents contain a certain term, within a list (`Array`) of text documents.

published 0.0.1 9 years ago
M
Q
P

A CJK text tokenizer

published 0.1.0 7 years ago
M
Q
P

List of ~636,000 Spanish words

published 2.0.0 3 years ago
M
Q
P

A JavaScript (Node.js) library that converts a tagged (monolinear) text to DLx JSON format

published 0.4.0 4 years ago
M
Q
P

Merge multiple sentiment libraries for better sentiment analysis

published 1.0.6 8 years ago
M
Q
P

List of ~336,000 French words

published 2.0.0 3 years ago
M
Q
P

A wrapper for CETEMPúblico, an European Portuguese corpus of news extracts from the newspaper Público, with 180 million words tagged automatically using PALAVRAS.

published 1.4.0 3 years ago
M
Q
P