keywords:Crawling - npm search

notion-md-crawler

A library to recursively retrieve and serialize Notion pages with customization for machine learning applications.

tompenguin

published 1.0.0 7 months ago

M

Q

P

Collection of patches for puppeteer and playwright to avoid automation detection and leaks. Helps to avoid Cloudflare and DataDome CAPTCHA pages. Easy to patch/unpatch, can be enabled/disabled on demand.

nwebson

published 1.0.9 4 days ago

M

Q

P

@crawlee/utils

A set of shared utilities that can be used by crawlers

apify-service-account

published 3.11.3 18 days ago

M

Q

P

@popstas/headless-chrome-crawler

Distributed web crawler powered by Headless Chrome

popstas

published 1.8.4 6 months ago

M

Q

P

graceful-playwright

Gracefully handle timeout and network error with auto retry.

beenotung

published 1.2.0 2 months ago

M

Q

P

hquery.php

An extremely fast web scraper that parses megabytes of HTML in a blink of an eye. No dependencies. PHP5+

duzun

published 3.2.0 2 months ago

M

Q

P

quickscraper-sdk

Quick Scraper SDK NodeJS APIs

quickscraper

published 2.0.8 6 months ago

M

Q

P

crawlyx

Crawlyx is an open-source command-line interface (CLI) based web crawler built using Node.js. It is designed to crawl websites and extract useful information like links, images, and text. It is lightweight, fast, and easy to use.

ritikchoure

published 2.2.3 a year ago

M

Q

P

@abilashinamdar/node-crawler

Fast asynchronous NodeJS module for crawling/scraping a web through worker_threads.

abilash_inamdar

published 1.0.3 a year ago

M

Q

P

htcrawl

crawler for single page applications

fcavallarin

published 1.2.1 9 months ago

M

Q

P

scrapingai

Build web scraping agents using AI to auto-extract the data from websites

vickyrathee

published 1.0.1 a year ago

M

Q

P

crawlbase

Dependency free module for scraping and crawling websites using [Crawlbase](https://crawlbase.com) API

crawlbase

published 1.0.2 3 months ago

M

Q

P

transparent-proxy

Real transparent HTTP-Proxy-Server. Upstream your requests whatever you want!

gr3p

published 1.15.3 4 months ago

M

Q

P

sasori-crawl

Sasori is a dynamic web crawler powered by Puppeteer, designed for lightning-fast endpoint discovery.

5up3r541y4n

published 1.0.0 2 months ago

M

Q

P

twilight

Twitter API tools

chbrown

published 1.0.5 2 months ago

M

Q

P

@speedyshot/capture

A JavaScript libary to easily use SpeedyShot's capture service

kevinvr

published 1.2.0 5 months ago

M

Q

P

crawling-typer

Transform your text with dynamic typing animations! crawling-typer lets you display an array of strings one at a time, each with its own color. Customize typing speed, delete speed, and pauses between strings. Enjoy full control with loop counts, post-loo