scrape-cache

Scrape web pages, storing pages locally to lessen repetitive network requests.

Uses cheerio for scraping.

Installation

git clone https://github.com/kevinschaul/scrape-cache
npm install

Usage

scrape-cache exposes one method: scrape(url, scraper, callback). Its parameters:

url String

The URL to scrape.
scraper($) Function

A function that scrapes the HTML and returns data that will be passed to callback.

The parameter $ is a cheerio jQuery-like object with the HTML already loaded.
callback(result) Function

A function that does something with result.

Full usage example

To scrape the contents of an H1:

var scrapeCache = require('scrape-cache');

var url = 'https://github.com/';

var scrapeH1 = function($) {
    return $('h1').text();
};

scrapeCache.scrape(url, scraper, function(result) {
    console.log(result);
});

scrape-cache

scrape-cache

Installation

Usage

Full usage example

Readme

Keywords

Package Sidebar

Install

Weekly Downloads

Version

License

Unpacked Size

Total Files

Last publish

Collaborators

scrape-cache

scrape-cache

Installation

Usage

Full usage example

Readme

Keywords

Package Sidebar

Install

DownloadsWeekly Downloads

Version

License

Unpacked Size

Total Files

Last publish

Collaborators

Weekly Downloads