30 packages found

Parser for XML Sitemaps to be used with Robots.txt and web crawlers

published 3.2.10 4 months ago
M
Q
P

Parser for XML Sitemaps to be used with Robots.txt and web crawlers

published 3.2.9 a year ago
M
Q
P

Lightweight and easy to use crawling solution for websites.

published 1.2.1 a year ago
M
Q
P

Parser for XML Sitemaps to be used with Robots.txt and web crawlers

published 3.2.6 a year ago
M
Q
P

Parser for XML Sitemaps to be used with Robots.txt and web crawlers

published 3.2.5 a year ago
M
Q
P

Scrape data from any webpage.

published 2.4.7 4 months ago
M
Q
P

Parser for XML Sitemaps to be used with Robots.txt and web crawlers

published 3.2.8 7 months ago
M
Q
P

Url scraper which takes the text input and finds the links/urls, scraps them using cheerio and will returns an object with original text, parsed text (using npm-text-parser) and array of objects where each object contains scraped webpage's information.

published 1.0.2 7 years ago
M
Q
P

Parser for XML Sitemaps to be used with Robots.txt and web crawlers

published 3.0.2 5 years ago
M
Q
P

A simple agent for performing a sequence of http requests in node.js

published 0.1.2 12 years ago
M
Q
P

This is a function that accepts 3 arguments, "url", "tag" and "output", and writes to a file, in the "output" path, the content of an html "tag", relative to a specific "url".

published 1.0.1 a year ago
M
Q
P

Parser for XML Sitemaps to be used with Robots.txt and web crawlers. (Extended version by mastixmc)

published 3.2.0 4 years ago
M
Q
P

Webcrawler script to retrieve the daily menu of the Bern University of Applied Sciences cantina in Biel

published 1.0.3 4 years ago
M
Q
P

Download README files from GitHub repository links

published 1.0.5 2 years ago
M
Q
P

Web Crawler to create directed graph of links among connected sites. Runs with Node.js and stores data with Redis

published 1.0.0 2 years ago
M
Q
P

Simple framework for crawling/scraping web sites. The result is a tree, where each node is a single request.

published 1.1.4 6 years ago
M
Q
P

A friendly javascript pre-rendering engine - BETA (UNSTABLE)

published 1.2.3 6 years ago
M
Q
P

Crawls through provided website, checking for 200 response, content load, ssl cert errors, and more!

published 1.1.8 3 years ago
M
Q
P

Parser for XML Sitemaps to be used with Robots.txt and web crawlers

published 2.1.19 2 years ago
M
Q
P

A simple webcrawler that prints out the URLs of the pages it encounters. Runs in parallel, up to a limit you specify.

published 0.1.4 6 years ago
M
Q
P