manisandro/gImageReader

Setting to ignore pictures

Opened this issue · 2 comments

I have an old PDF that contains quite some pictures. In that case just the text is needed as gImageReader detect a lot of lines and so on a picture as well. I think that contributes to the fact that the pdf gets blown up from 50-80MB to about 340MB.

If i decide that i need a picture i can still take a screenshot :)

This is more a tesseract training issue that something gImageReader can handle.

Interesting, good to know, thanks. What about a setting for exporting to pdf? Ignore graphics/pictures? In the Image settings - section maybe, as a checkbox on the first line and if checked, the rest is not changeable because its not relevant for that export anymore.