This is a wrapper for PDFBox, provided by Apache
To install this gem:
(sudo) gem install rpdfbox
require 'rpdfbox'
text = RPDFBox::TextExtraction.get_text_all("somepdf")
text = RPDFBox::TextExtraction.get_text_range("somepdf",2,4)
text = RPDFBox::TextExtraction.get_text_single("somepdf",1)
- spec creation
- Save text to location
- Way more
The API before 0.0.2 breaks. Just an FYI.