gt_image.txt should contain which type of information.

Question

gt_image.txt should contain which type of information.

Closed this issue 5 years ago · 6 comments

Answer 1 · 2019-12-20T07:00:31.000Z

No description provided.

cause i just converted the xml file directly into txt but it when i started training it ,it got stuck in one epoch.........
any help.... what i am thinking is should it only contain coordinates or what?
please reply fast i have a project to complete by 22nd ...........

Answer 2 · 2019-12-20T07:02:45.000Z

will be thankful if someone provide me with an example

Answer 3 · 2019-12-20T12:00:06.000Z

This is an example of how an annotation file of one image should look like.

377,117,463,117,465,130,378,130,Genaxis Theatre
493,115,519,115,519,131,493,131,[06]
374,155,409,155,409,170,374,170,###
492,151,551,151,551,170,492,170,62-03
376,198,422,198,422,212,376,212,Carpark
494,190,539,189,539,205,494,206,###
374,1,494,0,492,85,372,86,###

There are 8 numbers seperated by commas that represent the coordinates of bounding boxes and the last value on each row is the text in that bounding box. Illegible text is substituted by three hashes and isn't considered when calculating loss during training. If you only have bounding box coordinates, you either have to modify the annotation files to add some random text (the text itself isn't used so it can be literally anything) or you have to modify the code that prepares data and add some random text there or even change it so that it doesn't require text values.

Replacing line[-1] here with a random string such as "ABC" should be enough.
https://github.com/kurapan/EAST/blob/2de0cfb98898c3a8c83b1f38835a15ea8d264d43/data_processor.py#L43

Answer 4 · 2019-12-21T14:54:50.000Z

one more thing when i annotated my dataset i only got 4 coordinates,
how to get 8 coordinates,should i have to annotate it again?
i used labelimg for annotation

Answer 5 · 2019-12-21T14:55:23.000Z

those 4 coordinates include xmax , xmin,ymax,ymin

Answer 6 · 2019-12-22T05:52:46.000Z

ohkk. got the solution.Thanks.