Search results

52 packages found

A web scraper for NodeJs

published version 6.1.3, 2 years ago4 dependents licensed under $ISC
10,533

Web crawler for Node.js

published version 0.3.21, 7 years ago9 dependents licensed under $MIT
5,679

Node tùy chỉnh cho n8n để cào dữ liệu từ trang web, trích xuất nội dung và hình ảnh

published version 1.6.2, a month ago0 dependents licensed under $MIT
1,289

gRPC tokio based web crawler

published version 0.9.9, 8 months ago1 dependents licensed under $MIT
840

A powerful web crawler that extracts content from web pages and converts them to clean Markdown format, with support for code blocks and GitHub Flavored Markdown

published version 1.0.18, 14 days ago0 dependents licensed under $MIT
591

Priority based Semantic Web Crawler.

published version 0.0.2, 7 years ago0 dependents licensed under $MIT
366

Web Crawler MCP

published version 1.0.3, a month ago0 dependents licensed under $MIT
306

Simple node worker that crawls sitemaps in order to keep an algolia index up-to-date.

published version 3.2.3, 5 years ago0 dependents licensed under $MIT
246

Model Context Protocol (MCP) server for Firecrawl Simple - provides web scraping and crawling capabilities to LLMs

published version 1.0.2, 15 days ago0 dependents licensed under $MIT
211

Run headless Chrome (aka Puppeteer) as a service, for web crawling, remote controlling and so on.

published version 0.4.8, 7 years ago1 dependents licensed under $MIT
134

CacheServer is an efficient web page extractor that uses Puppeteer to launch a headless browser and fetch web page content.

published version 2.0.8, a year ago0 dependents licensed under $MIT
149

books.com.tw crawler

published version 1.2.4, 12 days ago0 dependents licensed under $MIT
102

A simple web scraper that can scrape product details from various e-commerce platforms.

published version 1.0.10, 4 months ago0 dependents
115

Crawl website by json

published version 0.8.3, 3 years ago0 dependents licensed under $MIT
108

Unofficial eslite-com API (Plus Long Introduction)

published version 1.1.8, 5 years ago0 dependents licensed under $MIT
75

A tool for extracting structured content from web pages with customizable selectors and crawling options

published version 0.0.25, 2 months ago0 dependents licensed under $MIT
75

基于Node.js的网络爬虫

published version 0.0.25, 5 years ago0 dependents licensed under $MIT
70

A simple and fully customizable web crawler/spider for Node.js with server-side DOM. Comes with elegant and hell-simple APIs.

published version 1.5.0, 7 years ago0 dependents licensed under $MIT
63

Simple and Easy Crawler library for Node.js

published version 1.0.1, 8 years ago1 dependents
51

Web scraping/crawling framework built on top of headless Chrome

published version 1.4.0, 2 years ago0 dependents licensed under $MIT
48