Library for extract infomation from thai personal identity card. imprement from easyocr and tesseract
Resutls of library extract region of interest
Identification Number |
FullNameTH |
---|---|
NameEN |
LastNameEN |
Birthday |
Address |
Religion |
- Image quality lowest should be 600x350
- Images with minimal reflections should be used. for good results
- Identity Card should be size in the image about 75%, if the image doesn't cropped that to be left only Identity Card area.
Install using pip
for stable release,
pip install thai-personal-card-extract
For latest development release,
pip install git+git://github.com/ggafiled/ThaiPersonalCardExtrac.git
Note 1: for Windows, please install tesseract first by following the official instruction here https://medium.com/@navapat.tpb/734dae2fb4d3 On medium website, be sure to setup already.
Note 2: for Linux os, please install tesseract by following the official instruction https://github.com/tesseract-ocr/tesseract
from ThaiPersonalCardExtract import PersonalCard
reader = PersonalCard(tesseract_cmd="D:/Program Files/Tesseract-OCR/tesseract") # for windows need to pass tesseract_cmd parameter to setup your tesseract command path.
result = reader.extractInfo('examples/card.jpg')
print(result)
Output will be in list format, each item represents result of library can extract, respectively.
{
"Identification_Number": "9999999999999",
"FullNameTH": "นาย อายุมฺมุราเสะ",
"NameEN": "Me Shoys",
"LastNameEN": "Hinata",
"Birthday": "421 fun. 1998",
"Religion": "พุทธ",
"Address": "ท๒ 99/1 มิชีโฮะ เขตฮานามิกาวา อำเภอชิบ;"
}
you can set options to Instance by below keyword
Parameter name | Value Type | Example |
---|---|---|
lang | Enum of PersonalCardLanguage Class | PersonalCardLanguage.THAI \n PersonalCardLanguage.ENGLISH *Default is 'th' |
template_threshold | Double | Rate to cals similarity of template *Default is 0.7 |
sift_rate | Int | Feature Keypoint rate *Default is 25,000 |
tesseract_cmd | String | Path of your tesseract command **For windows only. |
save_extract_result | Boolean | Set True if you want to save extracted image *Default is False |
path_to_save | String | Path that you given it save extracted image, relative with save_extract_result=True |