/extracty

a set of tools to extract metadata from HTML documents (WIP)

Primary LanguagePythonBSD 2-Clause "Simplified" LicenseBSD-2-Clause

a set of tools to extract metadata from HTML documents, currently only the
following features are supported:

  - authorship
  - title