/boilerpipe.net-core

The Boilerpipe text extraction library ported to .Net Core

Primary LanguageC#GNU Lesser General Public License v3.0LGPL-3.0

boilerpipe.net-core

Boilerpipe text extraction library ported to .Net Core based on rasmusjp's implementation in .NET 4.5 which you can find here https://github.com/rasmusjp/boilerpipe.net

Installation

To install Boilerpipe.Net.Core from the NuGet Gallery, run the following in the Package Manager Console

PM> Install-Package Boilerpipe.Net.Core

Usage

using Boilerpipe.Net.Extractors;
...
string html = ...
// NOTE: Use ArticleExtractor unless DefaultExtractor gives better results for you
string text = CommonExtractors.ArticleExtractor.GetText(html);

License

Boilerpipe.Net is licensed under LGPL V3.