lukashermann/hulc

task_ABC_D dataset can not be unzipped

Closed this issue · 6 comments

hk-zh commented

Hi, I tried using the dataset ABC to train the model. I downloaded the task_ABC_D.zip and tried to unzip it. I tried the commands unzip and 7z but both can not unzip it successfully. The error said 'start of central directory not found zip file corrupt'. I am sure the zip file has already been fully downloaded (518 GB).

mees commented
hk-zh commented

Have you tried to verify the zip file with the checksum instructions? El vie, 30 dic 2022 a las 13:00, Hongkuan Zhou @.>) escribió:

Hi, I tried using the dataset ABC to train the model. I downloaded the task_ABC_D.zip and tried to unzip it. I tried the commands unzip and 7z but both can not unzip it successfully. The error said 'start of central directory not found zip file corrupt'. I am sure the zip file has already been fully downloaded (518 GB). — Reply to this email directly, view it on GitHub <#9>, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAGH2ZZ6DTQEI6YZYBLLST3WP3FE7ANCNFSM6AAAAAATM2XVBM . You are receiving this because you are subscribed to this thread.Message ID: @.
>

I haven't tried.
If I use the command unzip task_ABC_D.zip, it returns the following message
Archive: task_ABC_D.zip error [task_ABC_D.zip]: start of central directory not found; zipfile corrupt. (please check that you have transferred or created the zip file in the appropriate BINARY mode and that you have compiled UnZip properly) .
If I use the command 7z x task_ABC_D.zip, it could unzip part of the files (around 5000 files). The left files will be recognized as the tail.
ERRORS: Headers Error Unconfirmed start of archive WARNINGS: There are data after the end of archive Physical Size = 1259053692 Tail Size = 554050759013

hk-zh commented

Actually, I also can not unzip the tasks_D_D.zip archive with the unzip command in ubuntu OS. It complains the file is too large. I can only use the 7z command.

hk-zh commented

Maybe I can try task_ABCD_D.zip. Could I ask if the episodes of ABCD datasets are separated in the task_ABCD_D.zip or mixed?

mees commented
hk-zh commented

The problem might be the firewall in my organization. It corrupted the zip file. Thank you for your information.