/tika-fork

The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).

Primary LanguageJavaApache License 2.0Apache-2.0

Stargazers

No one’s star this repository yet.