johnarevalo/gmu-mmimdb

Missing list.txt

Closed this issue · 2 comments

Can you please include list.txt or the format of the file

Each line in this file contains the path to the json files contained in the raw dataset. If you extracted it in the same path of this project, you can build the list.txt with

ls dataset/*.json > list.txt

the file would contain:

dataset/0000008.json
dataset/0000010.json
dataset/0000192.json
dataset/0000439.json
dataset/0000942.json
dataset/0002381.json
dataset/0003014.json
dataset/0003419.json
dataset/0003740.json
dataset/0003772.json
dataset/0004008.json
...

I stumbled over the same issue. You could maybe include that information in the section on how to build the dataset.