htmldiff.js
Diff and markup HTML with <ins>
and <del>
tags.
Origin
Quote from the original source of this fork:
htmldiff.js
is a JavaScript port of https://github.com/myobie/htmldiff by
Keanu Lee at Inkling.
htmldiff.js is based on this fork and adds a few things:
- Diffing of video, math, widget, iframe, img and svg tags.
- Ability to set atomic tags via the API.
- A command line interface.
- TypeScript support.
- Better documentation.
See also Credits below.
Description
htmldiff takes two HTML snippets or files and marks the differences between them with
<ins>
and <del>
tags. The diffing understands HTML so it doesn't do a pure text diff,
instead it will insert the appropriate tags for changed/added/deleted text nodes, single
tags or tag hierarchies.
The module can be used as module in Node.js, with RequireJS, or even just as a script tag.
API
The module exports a single default function:
JavaScript:
diff(before, after, className, dataPrefix, atomicTags);
TypeScript:
function diff(before: string, after: string, className?: string | null, dataPrefix?: string | null, atomicTags?: string | null): string;
Parameters
-
before
(string) is the original HTML text. -
after
(string) is the HTML text after the changes have been applied.
The return value is a string with the diff result, marked by <ins>
and del
tags. The
function has three optional parameters. If an empty string or null
is used for any
of these three parameters it will be ignored:
-
className
(string) className will be added as a class attribute on every inserted<ins>
and<del>
tag. -
dataPrefix
(string) The data prefix to use for data attributes. The so called operation index data attribute will be nameddata-${dataPrefix-}operation-index
. If not used, the default attribute namedata-operation-index
will be added on every inserted<ins>
and<del>
tag. The value of this attribute is an auto incremented counter. -
atomicTags
(string) Comma separated list of tag names. The list has to be in the formtag1,tag2,...
e. g.head,script,style
. An atomic tag is one whose child nodes should not be compared - the entire tag should be treated as one token. This is useful for tags where it does not make sense to insert<ins>
and<del>
tags. If not used, the default list will be used:iframe,object,math,svg,script,video,head,style
.
Example
JavaScript:
diff = require('node-htmldiff');
console.log(diff('<p>This is some text</p>', '<p>That is some more text</p>', 'myClass'));
TypeScript:
import diff = require("node-htmldiff");
console.log(diff("<p>This is some text</p>", "<p>That is some more text</p>", "myClass"));
Please note that diff
is only an arbitrary name; since the module exports only one default
function you can use whatever name you like, e. g., diffHTML
.
Result:
<p><del data-operation-index="1" class="myClass">This</del><ins data-operation-index="1" class="myClass">That</ins> is some<ins data-operation-index="3" class="myClass"> more</ins> text.</p>
Command line interface
htmldiff beforeFile afterFile diffedFile [-c className] [-p dataPrefix] [-t atomicTags]
Parameters:
-
beforeFile
An HTML input file in its original form. -
afterFile
An HTML input file, based onbeforeFile
but with changes. -
diffedFile
Name of the diffed HTML output file. All differences betweenbeforeFile
andafterFile
will be surrounded with<ins>
and<del>
tags. If diffedFile is-
(minus) the result will be written withconsole.log()
to stdout.
Options:
-c className
, -p dataPrefix
and -t atomicTags
are all optional. For a
description please see API documentation above.
Development
After cloning the repository run npm i
or npm install
to install the necessary
dependencies. A run of npm run make
creates the JavaScript output file.
npm run lint
checks the TypeScript sources with TSLint. npm test
runs all the
tests from the test
directory. npm run testsample
diffs the HTML sample files
from the directory sample
and logs the result to the console.
The command line interface of htmldiff is developed in TypeScript so you have to run
npm run make
once to create the JavaScript output file.
Credits
This module wouldn't have been possible without code from the following projects/persons:
- Original project: The Network Inc., Github
- Massive improvements of the original code: Inkling, Github
- Support of more tags: Ian White, Github
License
MIT © idesis GmbH, Max-Keith-Straße 66 (E 11), D-45136 Essen
See the LICENSE
file for details.