Search results

61 packages found

Fast dom parser based on regexps

published 1.1.5 7 months ago
M
Q
P

A web page content extractor

published 3.2.0 6 years ago
M
Q
P

A fast and minimalistic HTML/XML DOM parser with CSS selectors

published 2.1.5 8 months ago
M
Q
P

A web page content extractor

published 3.3.4 4 years ago
M
Q
P

Use Hext in a browser or with node

published 1.0.9 2 months ago
M
Q
P

Provides the HtmlRunner class for scrapping.

published 1.0.8 5 years ago
M
Q
P

A web page content extractor

published 1.1.2 7 years ago
M
Q
P

Test whether an href string is absolute, relative, protocol-relative, #fragment, mailto:, tel:, sms:, etc

published 1.0.1 7 years ago
M
Q
P

A web page content extractor

published 3.3.1 4 years ago
M
Q
P

A web page content extractor

published 3.2.3 3 years ago
M
Q
P

A web page content extractor

published 3.2.2 5 years ago
M
Q
P

Domain-specific language for extracting structured data from HTML

published 11.0.9 2 months ago
M
Q
P

Web scraping library for nodejs

published 0.4.1 5 years ago
M
Q
P

A scraping package that allows retrieving emails out of obfuscated html

published 1.0.12 4 years ago
M
Q
P

A library for converting HTML and XML into JSON

published 1.0.5 9 months ago
M
Q
P

JavaScript implementation of Lusail, a domain-specific language for extracting structured data from HTML

published 0.8.1 a year ago
M
Q
P

Harvesting data at the <html> mine.

published 1.1.1 4 years ago
M
Q
P

Access web pages programmatically with PhantomJS, for running tests or scraping information

published 3.1.0 7 years ago
M
Q
P

A web page content extractor based on https://github.com/ageitgey/node-unfluff, but ready for browserify

published 1.3.2 8 years ago
M
Q
P

A simple way to structure your web scraper.

published 1.0.1 3 years ago
M
Q
P