dhlab-epfl/dhSegment

Layout Analysis Use Case: DIVA-HisDB

doobybug opened this issue · 3 comments

Hi,

Are there any plans to issue the training and demo for layout analysis on DIVA-HisDB?

Hello,

No there are no plans to do a special example for the DIVA-HisDB dataset. It should be quite similar to the cBAD and the Page demo. If you're experimenting particular problems to have the training running, feel free to ask.
With a bit of archeology you can also see what we had implemented here.

Hi. thank you so much for answering me. I will definitely look into it. I would like to experiment with your code to classify text and handwritten text in the same document and maybe get their boundary boxes. Which route do you think I should take it? Page segmentation or Layout analysis?

Your task looks more similar to the DIVA-HisDB task than to the page segmentation task. So I would go more into the direction of document image analysis.
If you need a guide for the principal steps you can have also a look at this notebook. However I would recommend in you case to try to annotate the text at pixel level as it is done for the DIVA-HisDB experiment (and not with bounding boxes around text).