huggingface/audio-transformers-course

Wrong output on chapter4/fine-tuning

DavidFM43 opened this issue · 1 comments

Hi👋,
I think the output should be:

DatasetDict({
    train: Dataset({
        features: ['genre', 'input_values', 'attention_mask'],
        num_rows: 899
    })
    test: Dataset({
        features: ['genre', 'input_values', 'attention_mask'],
        num_rows: 100
    })
})

Instead of

gtzan_encoded
```
**Output:**
```out
DatasetDict({
train: Dataset({
features: ['genre', 'input_values'],
num_rows: 899
})
test: Dataset({
features: ['genre', 'input_values'],
num_rows: 100
})
})
```

Since return_attention_mask=True in the feature_extractor. Is this the case?

@DavidFM43 you're right. Perhaps you could directly open a PR. In case you do not open a PR for this particular issue, I'll be opening one for this file and will make sure to include this change along with the other code changes.