VietnameseOCR: A Python repository from talk114

VietnameseOCR - Vietnamese Optical Character Recognition

Apply Deep Learning ( CNN networks ) to train a model uses for recognizing Vietnamese characters, it works well with Latin characters.

Dataset in big image ( 10.000 samples, 2800 x 2800 pixel)

Requirements

python 3.6.5
tensorflow
PIL

Model Summary

Layer	Shape	Kernel	Stride	Padding
INPUT	[28, 28, 1]
CONV1		[3, 3, 32, 32]	[1, 1]	SAME
POOL1
CONV2		[3, 3, 32, 64]	[1, 1]	SAME
POOL2
CONV3		[3, 3, 64, 128]	[1, 1]	SAME
POOL3
FC1
FC2	[625, 190]

Results

Training...

......
Epoch: 38 cost = 0.312853018
Epoch: 39 cost = 0.298816641
Epoch: 40 cost = 0.293328794

Evaluation
------------------------------
Test Accuracy: 0.974867469544

Training

Prepare dataset for training

git clone https://github.com/miendinh/VietnameseOCR.git
cd VietnameseOCR/data/train/characters
unzip dataset.zip

Let's train.

python train.py

Create you own dataset

Prepare fonts for generating text-image

You could add more fonts

cd VietnameseOCR/data/train/characters
unzip google.zip
unzip win.zip

Create font list, then save it in fonts.list

source ./list.sh

Generate Text Image Dataset

python generate_data.py

Play with pretrained model

All pretrained weights of model is save to file vocr.brain
Let's test with random character in dataset

python predict.py

talk114/VietnameseOCR

VietnameseOCR - Vietnamese Optical Character Recognition

Dataset in big image ( 10.000 samples, 2800 x 2800 pixel)

Requirements

Model Summary

Results

Training

Prepare dataset for training

Let's train.

Create you own dataset

Prepare fonts for generating text-image

Create font list, then save it in fonts.list

Generate Text Image Dataset

Play with pretrained model

Further working

References

Author mien.hust [at] gmail [dot] com