/htmldiff.js

Diff and markup HTML with <ins> and <del> tags.

Primary LanguageJavaScriptMIT LicenseMIT

htmldiff.js

Diff and markup HTML with <ins> and <del> tags.

Origin

Quote from the original source of this fork:

htmldiff.js is a JavaScript port of https://github.com/myobie/htmldiff by Keanu Lee at Inkling.

htmldiff.js is based on this fork and adds a few things:

  • Diffing of video, math, widget, iframe, img and svg tags.
  • Ability to set atomic tags via the API.
  • A command line interface.
  • TypeScript support.
  • Better documentation.

See also Credits below.

Description

htmldiff takes two HTML snippets or files and marks the differences between them with <ins> and <del> tags. The diffing understands HTML so it doesn't do a pure text diff, instead it will insert the appropriate tags for changed/added/deleted text nodes, single tags or tag hierarchies.

The module can be used as module in Node.js, with RequireJS, or even just as a script tag.

API

The module exports a single default function:

JavaScript:

diff(before, after, className, dataPrefix, atomicTags);

TypeScript:

function diff(before: string, after: string, className?: string | null, dataPrefix?: string | null, atomicTags?: string | null): string;

Parameters

  • before (string) is the original HTML text.
  • after (string) is the HTML text after the changes have been applied.

The return value is a string with the diff result, marked by <ins> and del tags. The function has three optional parameters. If an empty string or null is used for any of these three parameters it will be ignored:

  • className (string) className will be added as a class attribute on every inserted <ins> and <del> tag.
  • dataPrefix (string) The data prefix to use for data attributes. The so called operation index data attribute will be named data-${dataPrefix-}operation-index. If not used, the default attribute name data-operation-index will be added on every inserted <ins> and <del> tag. The value of this attribute is an auto incremented counter.
  • atomicTags (string) Comma separated list of tag names. The list has to be in the form tag1,tag2,... e. g. head,script,style. An atomic tag is one whose child nodes should not be compared - the entire tag should be treated as one token. This is useful for tags where it does not make sense to insert <ins> and <del> tags. If not used, the default list will be used: iframe,object,math,svg,script,video,head,style.

Example

JavaScript:

  diff = require('node-htmldiff');

  console.log(diff('<p>This is some text</p>', '<p>That is some more text</p>', 'myClass'));

TypeScript:

  import diff = require("node-htmldiff");

  console.log(diff("<p>This is some text</p>", "<p>That is some more text</p>", "myClass"));

Please note that diff is only an arbitrary name; since the module exports only one default function you can use whatever name you like, e. g., diffHTML.

Result:

<p><del data-operation-index="1" class="myClass">This</del><ins data-operation-index="1" class="myClass">That</ins> is some<ins data-operation-index="3" class="myClass"> more</ins> text.</p>

Command line interface

htmldiff beforeFile afterFile diffedFile [-c className] [-p dataPrefix] [-t atomicTags]

Parameters:

  • beforeFile An HTML input file in its original form.

  • afterFile An HTML input file, based on beforeFile but with changes.

  • diffedFile Name of the diffed HTML output file. All differences between beforeFile and afterFile will be surrounded with <ins> and <del> tags. If diffedFile is - (minus) the result will be written with console.log() to stdout.

Options:

-c className, -p dataPrefix and -t atomicTags are all optional. For a description please see API documentation above.

Development

After cloning the repository run npm i or npm install to install the necessary dependencies. A run of npm run make creates the JavaScript output file. npm run lint checks the TypeScript sources with TSLint. npm test runs all the tests from the test directory. npm run testsample diffs the HTML sample files from the directory sample and logs the result to the console.

The command line interface of htmldiff is developed in TypeScript so you have to run npm run make once to create the JavaScript output file.

Credits

This module wouldn't have been possible without code from the following projects/persons:

License

MIT © idesis GmbH, Max-Keith-Straße 66 (E 11), D-45136 Essen

See the LICENSE file for details.