Search results
20 packages found
Automatically extracts structured information from webpages
A fast language detection script for English.
Template engine plugin for dietjs based on underscorejs
Extract content from an HTML page including microdata, rdfa, html, meta tags, images and links.
Machine Learning based language detection module for Node with support for 57 languages, async/await and promises.
Promise based parser for robots.txt files.
Mature content filter using a keyword tokenizer and a banned word list.
Parse keywords from text using ngrams and stopwords.
Real-world load testing for HTTP and WebSocket based applications
Collect request data from a url. Includes status code, redirects, headers and content.
Yeoman generator for arcane development projects
SiteBot is an event driven website crawler.