This is a subset of the LibriTTS dataset that includes British speakers.
Speakers are sorted into libritts-english
, libritts-irish
, libritts-scottish
, and libritts-welsh
subsets.
Speakers were found using two resources, the LibriVox Accents Table and Ruth Golding's Blog, which both compile a list of British LibriVox audiobook speakers.
Please be aware that this dataset is likely not complete, and I make no promises of the regional accuracy.
Files are in a .tar.gz
archive, split into 1GB chunks. This is because GitHub's LFS service imposes size limits, preventing the dataset being uploaded in a single file.
Note: GitHub LFS requires the purchase of "data packs", so I'd advise against using it.
- Install git lfs if you do not have this installed.
- Run
git lfs install
to set up lfs for your user account. git clone https://github.com/OscarVanL/LibriTTS-British-Accents
The original LibriTTS dataset was published under CC BY 4.0 licensing. This gives me permission to share and adapt this dataset as long as I give attribution. You can do the same with this dataset.
This dataset is licensed with CC BY 4.0.