yssk22/extractcontent

Utility for extracting title and main contents from an HTML text.

JavaScriptMIT

extractcontent

This module extracts title and main contents from an HTML text.

Algorithm is ported from the original implementation in Ruby

Usage

var ex = require('extractcontent')
ex.extractFromUrl('http://yssk22.blogspot.com/', function(error, result){ 
   console.log(result.title); 
   // -> Relaxed in Japan.
   console.log(result.content); 
   // -> last week ... 
});

Install

npm install extractcontent