schappim/macOCR

Newlines

hugows opened this issue · 2 comments

Hi,
I'm very impressed with the quality of the OCR of this little tool - and paired with Alfred it became super powerful. Thank you for writing this!

The issue: I noticed that every character is correctly detected in my tests, but no newline is ever added in the output.
Is this a known limitation?

This is the only thing missing here that would make it seamless for extracting tabular data from PDFs.

Tatsh commented

The delimiter to join strings is currently hard-coded to be a single space.

How can this be changed in the code? I might see if I can figure out how to fork this JUST for this one change. It's nowhere near as useful to me as it could be without newlines being included.