Search results

29 packages found

General natural language (tokenizing, stemming (English, Russian, Spanish), part-of-speech tagging, sentiment analysis, classification, inflection, phonetics, tfidf, WordNet, jaro-winkler, Levenshtein distance, Dice's Coefficient) facilities for node.

published version 8.1.0, 23 days ago613 dependents licensed under $MIT
847,563

Calculate the similarity between posts by tf-idf and Cosine similarity.

published version 2.0.3, 3 years ago0 dependents licensed under $MIT
2,859

A simple term frequency lib

published version 0.0.15, 9 years ago2 dependents licensed under $MIT
758

Perform full-text search operations across multiple documents with ease, designed for both browser and Node.js

published version 1.0.0, a year ago1 dependents licensed under $MIT
322

Function that takes a Term-Document map; computes IDFs.

published version 0.1.2, 6 years ago0 dependents licensed under $MIT
299

Computes BM25 Vectorization of Text.

published version 0.1.1, 6 years ago0 dependents licensed under $MIT
329

Minor modifications to the original `natural` node package: General natural language (tokenizing, stemming (English, Russian, Spanish), part-of-speech tagging, sentiment analysis, classification, inflection, phonetics, tfidf, WordNet, jaro-winkler, Levens

published version 0.6.3, 6 years ago0 dependents licensed under $MIT
151

Full text search engine in js. Features BM25 ranking function that can be tuned.

published version 1.7.2, 6 years ago0 dependents licensed under $MIT
61

An implementation of the tf-idf vector space model for keyword search

published version 1.1.3, 5 years ago0 dependents licensed under $MIT
74

A lightweight, zero-dependency NLP summarization package built with TypeScript. Inspired by traditional NLP techniques like those in Python's NLTK — but without any API or AI calls.

published version 1.0.0, a month ago0 dependents licensed under $MIT
73

A simple tf-idf implementation for text documents

published version 1.0.0, 11 years ago0 dependents licensed under $ISC
54

Text tokenization, transformation & analysis transducers, utilities, stop words, porter stemming, vector encodings, similarities

published version 0.2.0, 4 hours ago0 dependents licensed under $Apache-2.0
56

A package to extract important keywords from a document using TF-IDF technique

published version 1.1.1, 6 years ago0 dependents licensed under $MIT
24

General natural language (tokenizing, stemming (English, Russian, Spanish), classification, inflection, phonetics, tfidf, WordNet, jaro-winkler, Levenshtein distance, Dice's Coefficient) facilities for node.

published version 0.2.1, 9 years ago0 dependents licensed under $MIT
21

A small, fast, local-first, searchable index for client side apps written in Typescript.

published version 1.2.0, a year ago0 dependents licensed under $ISC
22

Full text search engine in js that features tunable BM25 ranking function.

published version 1.0.0, a year ago0 dependents licensed under $MIT
19

For private use.

published version 0.2.2, 11 years ago0 dependents
16

A TFIDF analysis package that allows for tokens of any word length

published version 0.2.1, 11 years ago0 dependents licensed under $ISC
17

General natural language (tokenizing, stemming (English, Russian, Spanish), classification, inflection, phonetics, tfidf, WordNet, jaro-winkler, Levenshtein distance, Dice's Coefficient) facilities for node.

published version 0.6.0, 8 years ago0 dependents licensed under $MIT
9

measure the salience / importance of words in a text document -- based on the frequency of the words in the document, versus their frequency in English

published version 0.0.12, 3 years ago0 dependents licensed under $MIT
8