nty-reader

1.0.1 • Public • Published

nyt-reader

Scrape a NYT article from nytimes.com

Install

npm install nyt-reader --save

Use

   var NYTReader = require('nyt-reader');
   var nytreader = new NYTReader();

   // Promise
   nytreader.read('http://atwar.blogs.nytimes.com/2015/10/23/the-specter-of-addiction-and-suicide-among-veterans').then(function(article) {
      // Do Something with Article
   });

   // Callback
   nytreader.read('http://atwar.blogs.nytimes.com/2015/10/23/the-specter-of-addiction-and-suicide-among-veterans', function(article) {
      // Do Something with Article
   });

Article

var Article = {
   title: '',
   datetime: '',
   body: {
      clean: '',
      minimal: ''
   },
   images: [
      {
         full: ''
      }
   ],
   source: ''
};

title The title of the Article. What appears in the h1 on the page.

datetime The datetime with timezone of the last update of the article. Format: YY-mm-dd H:i:s GMT. The datetime will always be GMT+0000.

body The body of the article. Comes in two formats. clean and minimal. The clean format removes all html elements and separates paragraphs by two newlines. The minimal format uses sanitize-html to remove all html elements except for 'p', 'cite', 'b', 'i', 'em', 'strong', 'a'.

images An array of image urls found in the body. Comes in sizes full for each image.

source The url of the nyt article.

Readme

Keywords

none

Package Sidebar

Install

npm i nty-reader

Weekly Downloads

0

Version

1.0.1

License

MIT

Last publish

Collaborators

  • iamchairs