201 packages found
Harvesting data at the <html> mine.
[](https://www.npmjs.com/package/recrawl-sync) [](https://github.com/aleclarson/recrawl/actions/workflows/release.yml)
Generic utility functions (Mixin Lodash)
- utils
- util
- random
- md5
- sha256
- hash
- crawl
- random datetime
- isEmpty
- isNullOrWhiteSpace
- isNullOrUndefined
- toKb
- toHHMMSS
- parseLinks
- View more
Extract values out of deep structures using a safe and simple extraction language.
Website extensions for the Sajari API. Automatically index site content, add user profiles, render search and recommendations, etc.
Behaviour Assertion Sheets: CSS-like declarative syntax for client-side integration testing and quality assurance.
Crawling Udemy course info and save into JSON format.
Crawl apps from Google Play & iTunes
A node roguelike
SeoCheck module is build to check if there is/are any irregularites in your HTML file. You can customise your own set of rules to check if your end requirements are met.
crawl github issues to build a dependency graph
A node.js module to crawl product reviews from Amazon.
Simple, configurable and extensible webcrawler
Asynchronous options for crawling file system
Scrap the web asynchronously in live, reusing Node.js, all in one file, with a few lines!
- scrap
- scraping
- web
- web-scraping
- webscraping
- electron
- async
- asynchronous
- live
- browser
- automation
- web2os
- harvest
- crawl
- View more
Utils for web resources. Get a web page and save to disk (with minimal dependencies)