Download questions

Question

Download questions

Closed this issue 5 years ago · 9 comments

Great job，thank you for sharing such large-scale document data. However，the speed which i download these datasets is very slow. And, it often disconnected downloads, is there any other way to get these datasets?

Answer 1 · 2019-08-26T22:58:37.000Z

@phexic thanks for your interest. We will look into this issue and solve it asap.

Answer 2 · 2019-08-27T03:49:03.000Z

@zhxgj Great!

Answer 3 · 2019-08-27T04:44:05.000Z

@phexic we tested a few geographic regions and got decent downloading speed from Box. Can you please let us know which geographic region are you downloading the data from?

Answer 4 · 2019-08-27T06:06:04.000Z

@zhxgj Maybe the reason I'm in China

Answer 5 · 2019-08-27T23:16:03.000Z

@phexic Em, maybe Box does not well in China. Let me try to work out a solution for you.

Answer 6 · 2019-08-28T02:50:28.000Z

@zhxgj Oh, Wow! thanks a million.
Will you public the pre-training models about document layout?

Answer 7 · 2019-08-30T05:07:35.000Z

@zhxgj Oh, Wow! thanks a million.
Will you public the pre-training models about document layout?

@phexic This is a great suggestion. I will follow up with our legal team regarding releasing the pre-trained model and maybe the training config file.

Answer 8 · 2019-09-16T06:28:17.000Z

@zhxgj Oh, Wow! thanks a million.
Will you public the pre-training models about document layout?

@phexic This is a great suggestion. I will follow up with our legal team regarding releasing the pre-trained model and maybe the training config file.

@zhxgj Hi, any news from your legal team whether you can release the pre-trained models?

Answer 9 · 2019-10-31T21:41:53.000Z

Hi @phexic The data has been migrated to IBM DAX platform. I think the download should be more stable now. Please see the instructions in README