wet

Grabbing metadata from webpages

For lagancy version without phantomjs see branch 0.0.x

New Branch 0.1.x with phantomjs & node-phantom-simple

Currently it gets slower (WIP), because:

Page rendered in phantomjs
Browser has to be created
Extra 5s delay is given to load

# Use in terminal
$ npm i -g wet
$ wet https://www.baidu.com  # okay
$ wet https://baidu.com      # also
$ wet http://baidu.com       # also
$ wet //baidu.com            # also
$ wet baidu.com              # also

// Use in node.js
var wet = require('wet')
wet('www.baidu.com', function(err, meta){
  console.log(meta)
})

// The output
{ url: 'http://baidu.com',
  finalUrl: 'https://www.baidu.com/',
  title: '百度一下，你就知道',
  description: '把百度设为主页把百度设为主页关于百度About  Baidu©2015 Baidu 
使用百度前必读 意见反馈 京ICP证030173号 ',
  image: 'http://www.baidu.com/img/bd_logo1.png' }

Todo

More accurate image detected
More accurate title/description detected
Returned html is not completely correct?
Make phantomjs server isolated & public
Render page via phantomjs

wet

wet

New Branch 0.1.x with phantomjs & node-phantom-simple

Todo

Readme

Keywords

Package Sidebar

Install

Repository

Homepage

Weekly Downloads

Version

License

Last publish

Collaborators

wet

wet

New Branch 0.1.x with phantomjs & node-phantom-simple

Todo

Readme

Keywords

Package Sidebar

Install

Repository

Homepage

DownloadsWeekly Downloads

Version

License

Last publish

Collaborators

Weekly Downloads