cebuano-stemmer
TypeScript icon, indicating that this package has built-in type declarations

0.1.2 • Public • Published

Cebuano dictionary and stemmer

Build Status npm version License

Live demo in browser

A dictionary and stemmer for the Cebuano language spoken in the Philippines.

Written in TypeScript, compiled to ES5 CommonJS module (for use in Node.js) and a single-file ES5 UMD module (for use in the browser).

Based on J. Hellingman's Cebuano stemmer for Java and dictionary for Android, which is in turn based on J. Wolff's seminal work A Dictionary of Cebuano Visayan.

Dictionary

The dictionary data is converted from a SQLite database to a publicly-readable CouchDB instance hosted at https://publicdomainreview.cloudant.com/cebuano_dictionary.

The primary index for an entry is the normalized root form of the Cebuano word plus an identification number.

You can use the regular CouchDB API to query. For instance, to get the first 20 entries starting with "k":

https://publicdomainreview.cloudant.com/cebuano_dictionary/_all_docs?include_docs=true&limit=20&startkey="k"

Search

The database provides full-text search for English and Cebuano. You can query using the Lucene Query Parser Syntax.

Examples:

description url
English full text search through entries for "fire" https://publicdomainreview.cloudant.com/cebuano_dictionary/_design/search/_search/fromEnglish?q=fire
English synonym search for "fire" https://publicdomainreview.cloudant.com/cebuano_dictionary/_design/search/_search/fromEnglish?q=synonym:fire
Cebuano full text search through entries for "abat" https://publicdomainreview.cloudant.com/cebuano_dictionary/_design/search/_search/fromCebuano?q=abat
Cebuano synonym search for "abat" https://publicdomainreview.cloudant.com/cebuano_dictionary/_design/search/_search/fromCebuano?q=synonym:abat

(for more information, add the query parameter include_docs=true)

Stemmer

A JSON list of word roots used in the stemmer is available at https://publicdomainreview.cloudant.com/cebuano_dictionary/_design/show/_list/keyset/normalized_heads_where_pos_is_not_empty_string?group_level=1

A JSON list of all heads is available at https://publicdomainreview.cloudant.com/cebuano_dictionary/_design/show/_list/keyset/heads?group_level=1

Package Sidebar

Install

npm i cebuano-stemmer

Weekly Downloads

3

Version

0.1.2

License

GPL-3.0

Last publish

Collaborators

  • digitalheir