Search results
161 packages found
A library to recursively retrieve and serialize Notion pages with customization for machine learning applications.
Collection of patches for puppeteer and playwright to avoid automation detection and leaks. Helps to avoid Cloudflare and DataDome CAPTCHA pages. Easy to patch/unpatch, can be enabled/disabled on demand.
- automation
- bot
- bot-detection
- crawler
- crawling
- chromedriver
- webdriver
- headless
- headless-chrome
- stealth
- captcha
- scraping
- web-scraping
- cloudflare
- View more
A set of shared utilities that can be used by crawlers
Distributed web crawler powered by Headless Chrome
Gracefully handle timeout and network error with auto retry.
- graceful
- retry
- retries
- error
- errors
- handling
- timeout
- ERR_NETWORK
- ERR_CONNECTION
- ERR_SOCKET
- page
- crashed
- goto
- playwright
- View more
An extremely fast web scraper that parses megabytes of HTML in a blink of an eye. No dependencies. PHP5+
- HTML
- XML
- XHTML
- web
- scraper
- scraping
- crawling
- parser
- invalid
- invalid-html
- broken-html
- selectors
- css-selectors
- jquery-selectors
- View more
Quick Scraper SDK NodeJS APIs
- quickscraper
- scraper
- web scrapers
- proxies
- CAPTCHAs
- headless browsers
- crawling
- web-crawling
- crawling-websites
Crawlyx is an open-source command-line interface (CLI) based web crawler built using Node.js. It is designed to crawl websites and extract useful information like links, images, and text. It is lightweight, fast, and easy to use.
- web crawler
- web scraping
- data extraction
- SEO analysis
- command-line tool
- Node.js
- HTML reporting
- cross-platform
- configurable options
- plugin system
- open-source
- crawling
- crawler
- scraper
Fast asynchronous NodeJS module for crawling/scraping a web through worker_threads.
crawler for single page applications
Build web scraping agents using AI to auto-extract the data from websites
Dependency free module for scraping and crawling websites using [Crawlbase](https://crawlbase.com) API
- scraping
- crawling
- scraper
- scrape
- crawler
- crawlbase
- scraping-websites
- scraping-framework
- crawlbase-api
- leads
- leads-api
Real transparent HTTP-Proxy-Server. Upstream your requests whatever you want!
- proxy
- tunnel
- ssl
- http-proxy
- mitm
- pinning
- proxy-authentication
- transparent
- upstream
- server
- squid
- privoxy
- tcp
- intercept
- View more
Sasori is a dynamic web crawler powered by Puppeteer, designed for lightning-fast endpoint discovery.
A JavaScript libary to easily use SpeedyShot's capture service
Transform your text with dynamic typing animations! crawling-typer lets you display an array of strings one at a time, each with its own color. Customize typing speed, delete speed, and pauses between strings. Enjoy full control with loop counts, post-loo
A simple crawler made in JavaScript for Node.
A tool to get sitemaps from websites and crawl them
Easily create a scraper api with the @web/scrapper library, which includes a scraper and advanced events for your website.
- Scraper
- Scrape
- Web scrape
- Scraper library
- Scraper API
- Scraper REST API
- RESTful API
- Json API
- Node
- NodeJS
- NodeJS Scraper
- Nodejs Scraper
- Nodejs Scraper library
- NodeJS Scraper library
- View more