planio-gmbh/plaintext
This gem wraps command line tools to extract plain text from typical files, such as PDF and common office formats.
RubyGPL-2.0
Issues
- 1
Can't parse Japanese PDF.
#8 opened by hhhhub000 - 4
- 3
Allow extension with custom extractors
#6 opened by jkraemer - 0
Support Apple iWork documents
#2 opened by yeah - 0
Support image files (via OCR)
#3 opened by yeah