Train the model

Question

Train the model

mrunal2401 opened this issue a year ago · 4 comments

Hey Hussein, you have made a good project but i want to ask one thing, how to train the model for arabic image/handwritten image or how you trained the model?
Answers are appreciated :)

Thank you.

Answer 1 · 2023-04-14T12:32:09.000Z

Hello, This project is designed only for machine-typed text not handwritten. Otherwise, the character segmentation module won't work properly.

…

On Fri, Apr 14, 2023, 12:26 PM Mrunal Shah ***@***.***> wrote: Hey Hussein, you have made a good project but i want to ask one thing, how to train the model for arabic image/handwritten image or how you trained the model? Answers are appreciated :) Thank you. — Reply to this email directly, view it on GitHub <#24>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AFKVWFHOP5W7EUOGH6VJMULXBEQ5JANCNFSM6AAAAAAW6HSAYM> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

Answer 2 · 2023-04-17T07:15:20.000Z

Thank you for the information. But can you please share how you trained the model and provide steps for the same. Thank you again.

Answer 3 · 2023-04-17T10:07:08.000Z

We are using a simple model, a shallow NN I recall, that needs images of single characters for training.

We had a dataset of pages with Arabic text (images and text), and had to break those Arabic pages into single-character images to train the model.
So, we had to break the page to images of lines of text and then break those lines to images words of Arabic text which subsequently we break into many images of single character.

Those images of single character are fed into the model for training. We were using sci-kit learn to build and train the model for simplicity.
We also refer to multiple papers that illustrate the approaches we use for the line, word, and character segmentation.

Answer 4 · 2023-04-17T11:36:15.000Z

Thank you so much @HusseinYoussef