Thai ID OCR App with Django, PyTesseract, and OpenCV

Overview

Welcome to the Thai ID OCR App, a Django-based application that performs Optical Character Recognition (OCR) on Thai ID card images. PyTesseract is used for OCR, and OpenCV is employed for image preprocessing to enhance readability. The application allows users to extract information from Thai ID cards, save the results in a database, and perform operations such as updating or deleting stored ID cards.

Link

Correctly Hosted Working LINK : https://ocr-thai-production-7e36.up.railway.app/

Technologies Used

Django for web application development
PyTesseract for OCR
OpenCV for image preprocessing
Database: SQLite (default with Django)
Other dependencies: See requirements.txt

Setup Instructions

Tesseract-ocr has to be installed and added to path on your desktop pc to run the project.

For macOS :
```
brew install tesseract
```
For Windows : Download the installer for tesseract from the following link and use windows install wizard to install.
```
https://github.com/UB-Mannheim/tesseract/wiki
```

Clone the repository:

git clone <https://github.com/sparsh-kumar7/Qoala)https://github.com/sparsh-kumar7/Qoala>
cd thai-id-ocr-app

Create Virtual Environment:
```
python -m venv .venv
```

Start Virtual Environment:

For macOS:

source venv/bin/activate

For Windows:

venv\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```
Run migrations:
```
python manage.py migrate
```
Start the Django development server:
```
python manage.py runserver
```

User Interface

Upload a Thai ID card image (PNG, JPEG, JPG) for OCR.
View the extracted information (name, last name, identification number, date of birth, date of issue, date of expiry) in JSON format.