[Question] Can you please upload the pickled dataset that you used for training SegLink?

Question

[Question] Can you please upload the pickled dataset that you used for training SegLink?

Closed this issue 6 years ago · 5 comments

Hello,

Can you please upload the pickled dataset that you used for training SegLink?

It would be great if we can just run the code first and then try to understand the pipeline. I am asking this because I am finding it difficult to understand how the data is prepared for training SegLink. I have trained object detectors before but I think I am missing a step when it comes to training text detectors. So checking your data and playing with it would definitely help me understand better the pipeline.

Thanks in advance.

mvoelk commented 6 years ago

Yes!

Answer 1 · 2019-01-21T20:13:02.000Z

You do not need the pickled data. Simply replace

with open('gt_util_synthtext_seglink.pkl', 'rb') as f:
    gt_util = pickle.load(f)

with

 gt_util = GTUtility('data/SynthText/', polygon=True)

The gt_util_synthtext_seglink.pkl is only to speed up the parsing of the dataset. It is serialized in datasets.ipynb and does not contain the image data itself, only filenames, bounding boxes and so on. See also #1 ...

Answer 2 · 2019-01-22T10:23:36.000Z

Thanks for the quick reply. Did you get the SynthText dataset from here http://www.robots.ox.ac.uk/~vgg/data/scenetext/ ?

Answer 3 · 2019-01-22T12:46:33.000Z

Alright thanks!

Answer 4 · 2020-07-30T14:53:58.000Z

Could someone who has generated the .pkl file PR it? I'm on a limited bandwidth network and would like to run the end2end notebook.