domhandler-daily

2.3.0 • Public • Published

DOMHandler-Daily Build Status

The DOM handler (formally known as DefaultHandler) creates a tree containing all nodes of a page. The tree may be manipulated using the DOMUtils library.

Custom version to handle Twig tags in elements.

Usage

var handler = new DomHandler([ <func> callback(err, dom), ] [ <obj> options ]);
// var parser = new Parser(handler[, options]);

Example

var htmlparser = require("htmlparser2");
var rawHtml = "Xyz <script language= javascript>var foo = '<<bar>>';< /  script><!--<!-- Waah! -- -->";
var handler = new htmlparser.DomHandler(function (error, dom) {
    if (error)
     [...do something for errors...]
    else
     [...parsing done, do something...]
        console.log(dom);
});
var parser = new htmlparser.Parser(handler);
parser.write(rawHtml);
parser.done();

Output:

[{
    data: 'Xyz ',
    type: 'text'
}, {
    type: 'script',
    name: 'script',
    attribs: {
     language: 'javascript'
    },
    children: [{
     data: 'var foo = \'<bar>\';<',
     type: 'text'
    }]
}, {
    data: '<!-- Waah! -- ',
    type: 'comment'
}]

Option: normalizeWhitespace

Indicates whether the whitespace in text nodes should be normalized (= all whitespace should be replaced with single spaces). The default value is "false".

The following HTML will be used:

<font>
    <br>this is the text
<font>

Example: true

[{
    type: 'tag',
    name: 'font',
    children: [{
     data: ' ',
     type: 'text'
    }, {
     type: 'tag',
     name: 'br'
    }, {
     data: 'this is the text ',
     type: 'text'
    }, {
     type: 'tag',
     name: 'font'
    }]
}]

Example: false

[{
    type: 'tag',
    name: 'font',
    children: [{
     data: '\n\t',
     type: 'text'
    }, {
     type: 'tag',
     name: 'br'
    }, {
     data: 'this is the text\n',
     type: 'text'
    }, {
     type: 'tag',
     name: 'font'
    }]
}]

Option: withStartIndices

Indicates whether a startIndex property will be added to nodes. When the parser is used in a non-streaming fashion, startIndex is an integer indicating the position of the start of the node in the document. The default value is "false".

Dependencies (1)

Dev Dependencies (3)

Package Sidebar

Install

npm i domhandler-daily

Weekly Downloads

2

Version

2.3.0

License

none

Last publish

Collaborators

  • romainberger