/PDFparser_TextpropertyExtractor

A basic E-PDF parser that extracts all the Text Properties. Those include the Text, Text Font, Text Style, Text Size, Text Color. The parser performs also performs Data pre-processing by removing stopwords and punctuation.

Primary LanguagePython

Stargazers