/node-article-extractor

Automatically extract body content (and other cool stuff) from an html document. based on https://github.com/ageitgey/node-unfluff, but support Chinese.

Primary LanguageHTMLApache License 2.0Apache-2.0

Stargazers