boite/ty-ed

Deal with sueddeutsche.de zip file

Closed this issue · 0 comments

boite commented

It seems the file cannot be directly downloaded (so the download url was removed from data/discovered.jsonl in 3cbcbd) from google drive. It's a zip file containing 14 dox: the files in it need to be unpacked, named and catalogued.