@democracy-deutschland/scapacra
TypeScript icon, indicating that this package has built-in type declarations

1.0.6 • Public • Published

scapacra

Introduction

Scapactra (scraper, parser and crawler) is a framework to extract data from different data sources. The idea for scapactra bases on the ETL (extract, transform and load) process (ETL) and defines an modular design pattern providing a basic ETL workflow.

The framework is structured into three basic modules.

  1. Parser: The parser extracts the data from a defined document.
  2. Browser: The browser navigates through a structure and retrieves the desired fragments for the parser.
  3. Scraper: A scraper executes the browsers an parsers and providing their results over an centralized interface.

Parser

Parser

Browser

Browser

Scraper

Scraper

Readme

Keywords

none

Package Sidebar

Install

npm i @democracy-deutschland/scapacra

Weekly Downloads

1

Version

1.0.6

License

Apache-2.0

Unpacked Size

655 kB

Total Files

50

Last publish

Collaborators

  • mananruck
  • democracy-live