Search results

25 packages found

A module for node.js and the browser that takes in text and returns text that is stripped of stopwords. Has pre-defined stopword lists for 62 languages and also takes lists with custom stopwords as input.

published version 3.1.4, 3 months ago127 dependents licensed under $MIT
384,467

novel-segment segment data

published version 2.3.210, 4 months ago2 dependents licensed under $ISC
50,991
published version 1.0.24, 4 months ago1 dependents licensed under $ISC
5,513

Chinese word segmentation 簡繁中文分词模块 以網路小說為樣本

published version 2.7.121, 4 months ago9 dependents licensed under $ISC
7,776
published version 1.0.23, a year ago4 dependents licensed under $ISC
5,598
published version 1.0.19, a year ago1 dependents licensed under $ISC
5,558
published version 1.0.19, a year ago1 dependents licensed under $ISC
5,582
published version 1.0.13, a year ago7 dependents licensed under $ISC
5,512
published version 1.0.17, a year ago2 dependents licensed under $ISC
5,482
published version 1.0.19, a year ago1 dependents licensed under $ISC
5,373

原版 node-segment 的格式

published version 1.0.22, a year ago2 dependents licensed under $ISC
5,581
published version 1.0.22, a year ago1 dependents licensed under $ISC
5,566

A node module exposing nltk stopwords corpora and provide utility functions for removing stopwords

published version 1.0.3, 8 years ago0 dependents licensed under $MIT
424

Persian (Farsi) text pre processing (normalize, number, punctuation, white space, stop word & ...)

published version 1.0.0, 6 years ago0 dependents licensed under $MIT
58

A module for creating stopword lists for any language, based on a set of documents.

published version 1.1.1, 3 years ago0 dependents licensed under $MIT
44
published version 1.0.29, 4 months ago0 dependents licensed under $ISC
30
published version 1.0.25, 4 months ago0 dependents licensed under $ISC
30
published version 1.0.26, a year ago2 dependents licensed under $ISC
28
published version 1.0.27, 4 months ago0 dependents licensed under $ISC
26
published version 1.0.12, a year ago0 dependents licensed under $ISC
16