mrcsparker/ruby_tika_app
A ruby wrapper for the Tika jar (tika-app.jar) that extracts text in a lot of formats from PDF, xls, doc, etc files
DIGITAL Command LanguageMIT
Issues
- 0
Allow custom config file paths
#16 opened by eliotjordan - 0
RubyTikaApp::CommandFailedError: execution failed with status INFO OpenType Layout tables used in font ArialMT are not implemented in PDFBox and will be ignored:
#10 opened by washingon - 0
Option #encoding
#9 opened by mr-dxdy - 0
Undefined method =~
#4 opened by jasonperrone