/articletext

Golang package to extract useful text from a HTML document

Primary LanguageGo

ArticleText

Golang package with a function to extract useful text from a HTML document.

A function analyses a html code and drops everything related to navigation, advertising etc. Extracts only useful contents of a document, text of a central element.

Installation

go get github.com/gelembjuk/articletext

Example

package main

import (
	"fmt"
	"os"
	"github.com/gelembjuk/articletext"
)

func main() {

	url := os.Args[1]
	text, err := articletext.GetArticleTextFromUrl(url)
	
	fmt.Println(text)
}

Author

Roman Gelembjuk (@gelembjuk)