/TextLinesCounter

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

Python text lines counter

Detects number of text lines for a given scanned textual document (an image or a folder).

  • Text must have only straight text lines arranged in one column (no images or tables)
  • The images may be rotated.
  • Text can be written in any language, any font or size
  • Needs about 2 seconds per picture
  • 100% accuracy of 50-70% of samples (doesn't detect accurately Arabian letters case-10 and text on a gradient background case-9)

Usage

Whole folder

python lines.py
>> samples

Single file

python lines.py
>> samples\case-0.png