Search results
126 packages found
The fast, flexible & elegant library for parsing and manipulating HTML and XML.
A library to easily scrape metadata from an article on the web using Open Graph, JSON+LD, regular HTML metadata, and series of fallbacks.
Request a url and scrape the metadata from its HTML using Node.js or the browser.
A customisable data scraper for the web based on JSON contracts
An extremely fast web scraper that parses megabytes of HTML in a blink of an eye. No dependencies. PHP5+
- HTML
- XML
- XHTML
- web
- scraper
- scraping
- crawling
- parser
- invalid
- invalid-html
- broken-html
- selectors
- css-selectors
- jquery-selectors
- View more
Tiny, fast, and elegant implementation of core jQuery designed specifically for the server
It parses the html and collects the requested data as desired.
It parses the html and collects the requested data as desired.
Download website to a local directory (including all css, images, js, etc.)
HTML parser for Quizlet flashcards decks
Fork of the fast, flexible & elegant library for parsing and manipulating HTML and XML.
Website scraper
Request a url and scrape the metadata from its HTML using Node.js or the browser.
A `htmlparser2` handler for parsing rich metadata from HTML. Includes HTML metadata, JSON-LD, RDFa, microdata, OEmbed, Twitter cards and AppLinks.
SyphonX is a tool that extracts data from HTML data, transforming it into JSON of any shape or size. It combines the power of CSS Selectors and jQuery, Regular Expressions, and Javascript into a declarative template format to elegantly solve the simplest
- automation
- cheerio
- crawl
- crawler
- crawling
- chrome
- dom
- headless
- html
- html2json
- jquery
- parse
- parser
- puppeteer
- View more
A simple npm package that scrapes m3u8 links from HTML content fetched from a given URL.
Plugin for website-scraper which returns html for dynamic websites using puppeteer
A super fast html parser and manipulator written in rust.
Functional web scraping in typescript
The og-easy package provides a simple and easy-to-use API for getting Open Graph data. It is a Node.js package for retrieving Open Graph metadata from a given URL, if no data is found, It will parse the html to find a proper intuitive replacement. The pac