crawler-html-3t

1.0.10 • Public • Published

Crawler Html 3T

Đây là thư viện dùng để bóc tách dữ liệu html

Installation

npm install crawler-html-3t --save

Usage

Class ModelMongoose

  1. mod_sources
  • name_index
  • SourcesNews
  • Articles
  1. mod_baogom
  • name_index
  • mod_acticles
  • mod_links
  • mod_categories

Class HtmlParser

  1. GetHtmlDoc
  • body: html
  • $: jquery
  GetHtmlDoc(url,function(error, body, $));
 

Class HtmlExtract

  1. getTitle
 
 var title =  getTitle($);
 
  1. getDesc
 
 var description =  getDesc($);
 
  1. getImage
 
 var url_image =  getImage($);
 

Class ReadRss

  1. getListFeed
 
getListFeed(url_rss,function(error,list_feed));
 
  1. getListFeedByBodyXml
 
getListFeedByBodyXml(bodyXml,function(error,list_feed));
 

Package Sidebar

Install

npm i crawler-html-3t

Weekly Downloads

1

Version

1.0.10

License

none

Last publish

Collaborators

  • trungdang