load_dataset() doesn't work without specifying dataset_revision?

Question

Opened this issue a year ago · 1 comments

Hello!

Loading any section of the dataset like this:

from datasets import load_dataset
d = load_dataset('nuprl/MultiPL-E', 'humaneval-lua', download_mode='force_redownload')

results in ExpectedMoreDownloadedFiles error, but using revision number from your completions.py works:

d = load_dataset('nuprl/MultiPL-E', 'humaneval-lua', download_mode='force_redownload', revision = "bf4f3c31a1e0a164b7886c9eb04f82534edf4ce9")

is this intended?

thanks a lot in advance!

Answer 1 · 2023-08-07T17:20:04.000Z

Odd. This is not intended. We'll take a look soon.