Search results
7 packages found
simple polite crawling of the web.
published 5.1.2 8 years ago
M
Q
P
A web crawler for Nodejs.
published 0.8.2 10 years ago
M
Q
P
SyphonX is a tool that extracts data from HTML data, transforming it into JSON of any shape or size. It combines the power of CSS Selectors and jQuery, Regular Expressions, and Javascript into a declarative template format to elegantly solve the simplest
- automation
- cheerio
- crawl
- crawler
- crawling
- chrome
- dom
- headless
- html
- html2json
- jquery
- parse
- parser
- puppeteer
- View more
published 1.2.66 3 months ago
M
Q
P
Streaming pdf fetcher for academic papers.
- papers
- pdfs
- academic articles
- academic papers
- scholarly articles
- scholarly papers
- journals
- scraping
- spidering
- crawling
published 0.0.3 11 years ago
M
Q
P
plosone.org scraper
- papermonk
- plos
- plos one
- plosone.org
- public library of science
- papers
- pdfs
- academic articles
- academic papers
- scholarly articles
- scholarly papers
- journals
- scraping
- View more
published 0.0.7 11 years ago
M
Q
P
A 2nd generation spider to crawl any article site, automatic reading title and content.
published 0.0.7 8 years ago
M
Q
P
Scalable, extensible, web crawler framework.
published 0.0.0 11 years ago
M
Q
P