This module extract the content of a HTML document.
Install
To install this module with npm, please run the following command:
npm install --save psy-extract
Usage
const extractor = require('psy-extract'); const extract = extractor(HTML);
Result
{ title: <String> The document title from the <title> tag. lang:<String> The document language (if present in a meta tag) variant:<String> The document language variant (if present in a meta tag) text:<String> The extracted document content. <p> and <h[n]> tags are replaced by two linebreak characters, the <br> tag is replaced by a single linebreak character. }
© 2019 by PSYfiers AG
This program is the property of PSYfiers AG,
Sumpfstrasse 5
CH-6312 Steinhausen
All Rights Reserved