node package manager
Loosely couple your services. Use Orgs to version and reuse your code. Create a free org »

html5

HTML5 Parser for node.js

Build Status

Examples

A simple example:

var HTML5 = require('html5');
var jsdom = require('jsdom');
var core = jsdom.browserAugmentation(jsdom.level(3));
 
var impl = new core.DOMImplementation();
var document = impl.createDocument();
var parser = new HTML5.JSDOMParser(document, core);
 
parser.parse('<p>I am a very small HTML document</p>');
 
console.log(document.getElementsByTagName("p")[0].innerHTML);

Interesting features

  • Streaming parser: You can pass parser.parse an EventEmitter and the parser will keep adding data as it's received.

  • HTML5 parsing algorithm. If you find something this can't parse, I'll want to know about it. It should make sense out of anything a browser can.

Installation

Use npm, or to use the git checkout, read on.

You'll need to fetch dependencies or initialize git submodules if you're pulling this from my git repository.

npm install

and give it a run:

npm test

Git repository at http://dinhe.net/~aredridel/projects/js/html5.git/