/Tab-Aug

Code for augmenting document table images.

Primary LanguagePython

Dataset Augmentation

Use augment_data.py script for augmenting a dataset. You have to provide folders containing images and their corresponding XML and OCR files. If OCR files are not available you may provide an empty directory and the program will generate the missing OCR files and save them there. log_file flag can be used for logging out warnings and errors to a text file. If a file name is not provided the logging will be skipped.

Note: that the script will not over-write generated files in any case. Thus you can call the augmentation script multiple times into the same output directory without a concern for them being over-written.

Requirement:

pip install truthpy

Usage

usage: main.py [-h] -img IMAGE_DIR -xml XML_DIR -ocr OCR_DIR -n NUM_SAMPLES -o
               OUT_DIR [-log LOG_FILE] [-vis]

optional arguments:
  -h, --help            show this help message and exit
  -img IMAGE_DIR, --image_dir IMAGE_DIR
                        Directory for images
  -xml XML_DIR, --xml_dir XML_DIR
                        Directory for xmls
  -ocr OCR_DIR, --ocr_dir OCR_DIR
                        Directory for ocr files. (If an OCR file is not found,
                        it will be generated and saved in this directory for
                        future use)
  -n NUM_SAMPLES, --num_samples NUM_SAMPLES
                        Number of augmented samples to generate
  -o OUT_DIR, --out_dir OUT_DIR
                        Output directory for generated data
  -log LOG_FILE, --log_file LOG_FILE
                        Output file path for error logging.
  -vis, --visualize

Command: python augment_data.py -img data/images/ -xml data/xmls/ -ocr data/ocr/ -n 100 -o augmented_data/ -log error_logs.txt -vis