STR_Vietnam_Temple: A Python repository from HoAnhKhoaVN

Try Demo on our website

What's new

11 November 2023 - Version 0.5
- New method to get correct background color and foreground color for image.
- Added support for predict all image in folder.
27 September 2023 - Version 0.4
- Rotate the text according to the rotation angle of the bbox.
- Change the method for determining background and foreground colors.
- Add time to inference.
- Add code to run on Google Colab
- Report
- Web
- Output
11 September 2023 - Version 0.3
- Skip Latin characters.
- Expand bounding box following by polygons instead in rectangles
22 August 2023 - Version 0.2
- Integrate OCR into the pipeline
- Recognition for vertical texts.
- Postprocessing for vertical texts.
11 August 2023 - Version 0.1
- Code fullflow for STR Chinese Temples.
- Code backend and fronents for demo.
- Code postprocessing for final output.

What's coming next

Write log
Add debug code
Remove text
Classify style -> Define vietnamese's font
Mapping color.
What is real image in Vietnamese? -> Increment height of text
Collect data temple in vietnamese.
Train more accurate model detection and recognition text
Collect and label datasets.
Host on server have GPU.
Add XAI support
Add illustration video like 3Blue1Brown
Write thesis
Write paper
Multimodel for STR

Todo

debug code remove background
the size of text increases when rotating
the boundaries of boxes have another color ?????

GUI

Format log file
Delete unnecessary information

Backend

Host on server

Input and output

Input

Output

Installation

Install using pip

pip install -r requirements.txt

To use PP-OCR method

Download wheel PyMuPDF from GG Drive
Install PyMuPDF

pip install PyMuPDF-1.20.2-cp311-cp311-win_amd64.whl

Install PaddleOCR

pip install paddleocr

Install in Azure server

Install paddlepaddle

Without GPU

sh python -m pip install paddlepaddle -i https://pypi.tuna.tsinghua.edu.cn/simple

With GPU sh python -m pip install paddlepaddle-gpu -i https://pypi.tuna.tsinghua.edu.cn/simple SRC: https://github.com/PaddlePaddle/PaddleOCR/blob/release/2.7/doc/doc_en/quickstart_en.md

Install paddleocr sh pip install "paddleocr>=2.0.1" --upgrade PyMuPDF==1.21.1 Thanks: https://stackoverflow.com/questions/76379293/how-can-i-fix-the-error-in-pymupdf-when-installing-paddleocr-with-pip
Install sklearn sh pip install scikit-learn
Update libgomp1 sh apt-get install libgomp1 Thanks: https://stackoverflow.com/questions/43764624/importerror-libgomp-so-1-cannot-open-shared-object-file-no-such-file-or-direc
Update ffmpeg libsm6 libxext6 library sh apt-get update && apt-get install ffmpeg libsm6 libxext6 -y Thanks: https://stackoverflow.com/questions/55313610/importerror-libgl-so-1-cannot-open-shared-object-file-no-such-file-or-directo

Run code

Change input and output path in run.sh. Then run this command:

sh run.sh

Run demo

Run backend

flask run

Run frontend

Double click frontend.html to run the frontend.

Log

We write log in folder log. Each log file will contain the log message every day.

HoAnhKhoaVN/STR_Vietnam_Temple

What's new

What's coming next

Todo

GUI

Backend

Input and output

Input

Output

Installation

To use PP-OCR method

Install in Azure server

Run code

Run demo

Run backend

Run frontend

Log

Fullflow diagram

Postprocessing

Preprocessing

OCR

Text Detection

Text Recognition

Dataset

Synthesis

Reality

Label

Language Model

Translate from accent Vietnamese to modern Vietnamese

Documentation

References