Crawl the web breadth-first from a seed url, statefully
Walk a website checking for bad links
A basic nodejs crawler.
Build a npm module from a file
Small-scale webpage archiver
Web crawler configured by JSON configurations defining what data fields to scrape from the visited websites using regular expressions or DOM selectors and how to export them as JSON
A server-side scraping web browser