/extractor

extracts text from various file formats like pdf, html and docx

Primary LanguageRustApache License 2.0Apache-2.0

extractor

extracts text from various file formats like pdf, html and docx

Usage

  • to extract text from any file do cargo run . example
cargo run samples/a.html

Building

cargo build

Running the binary

./target/debug/extractor samples/a.html