/tokenize-htmltext

Tokenize HTML in an array of text tokens

Primary LanguageJavaScriptApache License 2.0Apache-2.0

tokenize-htmltext

Build Status NPM version

Tokenize an html string into a list of text tokens, it keeps reference to indexes in the original string.

Installation

$ npm install tokenize-htmltext

Usage

var tokenizeHTML = require('tokenize-htmltext');


var tokens = tokenizeHTML('<b>hello</b> world');

/*
[ { value: 'hello', index: 3, offset: 5 },
  { value: ' world', index: 12, offset: 6 } ]
*/