crawljs

A basic nodejs crawler.

Crawljs

A basic crawler written in nodejs

Note

Crawljs has a dependency on jsdom which in turn has dependency on contextify which is a native nodejs extension. To run this crawler, you will require a C++ compiler on your machine. Details.

Usage

npm install -g crawljs
crawljs http://nodejs.org

crawljs http://nodejs.org 500 Crawls only first 500 urls encountered

var Crawler = require("../lib/Crawler")
,   seed = "http://nodejs.org"
,   limit = 500;
 
var crawler = new Crawler(limit);
crawler.crawl(seed);