Deprecate data utils
Closed this issue · 3 comments
PyTorch core has download function and torch.hub.download_url_to_file
.
Torchaudio can use it for dataset download and does not need to maintain its own torchaudio.datasets.utils.download_url
.
In addition to that, there seems to be something wrong with torchaudio.datasets.utils.download_url
.
It fails to download multiple of huge files. I think we should get rid of torchaudio.datasets.utils.download_url
and torchaudio.datasets.utils.stream_url
.
Steps
Phase 1
- Replace the usages of
torchaudio.datasets.utils.download_url
intorchaudio
withtorch.hub.download_url_to_file
andtorchaudio.datasets.utils.validate_file
- Add deprecation warning to
torchaudio.datasets.utils.download_url
andtorchaudio.datasets.utils.stream_url
, saying it will be removed inv0.12
.
Phase 2
Once the v0.11
is released and the main branch is bumped to v0.12
, we can remove these functions.
Note
For setting up the development environment, please refer to CONTRIBUTING.md and torchaudio_unittest
I am working on this issue
@mthrok can I work on this issue?
Sorry for the late reply. The phase 1 is completed and we need to wait until the branch cut of v0.11 to perform the phase 2. I will update this once the branch cut happens.