/LibriTTS-British-Accents

A subset of the popular LibriTTS dataset with subsets for English, Scottish, Welsh, and Irish accents.

Creative Commons Attribution 4.0 InternationalCC-BY-4.0

LibriTTS-British

This is a subset of the LibriTTS dataset that includes British speakers.

Speakers are sorted into libritts-english, libritts-irish, libritts-scottish, and libritts-welsh subsets.

Speakers were found using two resources, the LibriVox Accents Table and Ruth Golding's Blog, which both compile a list of British LibriVox audiobook speakers.

Please be aware that this dataset is likely not complete, and I make no promises of the regional accuracy.

Files are in a .tar.gz archive, split into 1GB chunks. This is because GitHub's LFS service imposes size limits, preventing the dataset being uploaded in a single file.

Download Mirrors

Kaggle dataset

Kaggle direct download

GitHub LFS

Note: GitHub LFS requires the purchase of "data packs", so I'd advise against using it.

  1. Install git lfs if you do not have this installed.
  2. Run git lfs install to set up lfs for your user account.
  3. git clone https://github.com/OscarVanL/LibriTTS-British-Accents

License

The original LibriTTS dataset was published under CC BY 4.0 licensing. This gives me permission to share and adapt this dataset as long as I give attribution. You can do the same with this dataset.

This dataset is licensed with CC BY 4.0.