/nst-google

Google's Cloud Speech-to-Text transcriptions of the NST Dataset

Creative Commons Zero v1.0 UniversalCC0-1.0

NST Transcriptions by Google

Google's Cloud Speech-to-Text transcriptions of the NST Dataset

This repository contains the testing part of the NST Acoustic database for Swedish (16 kHz).

Along with the original transcriptions, it also contains the predicted transcriptions as provided by Google's Cloud Speech-to-Text service. We hope that, by publishing this data to the public domain, other researchers can use this as a benchmark to evaluate their ASR systems against commercial services such as the one provided by Google.

The data can be found in the file nst-test.csv, which has the following columns:

  • wav_filename: path to the sound file in the NST Testing set
  • transcription: the original transcription provided in the NST Testing set
  • google_transcription: the transcription predicted by Google's Speech-to-text service

The file paths in the wav_filename column assumes that the testing part of the NST dataset has been extracted into a directory named test. All paths are relative to the directory containing the test directory.