Contains scripts for creating English and Assamese speech dataset.
Use Python3.6+
pip install -r requirements.txt
python -m aeneas.tools.execute_task <abs_path_to_audio_file> <abs_path_to_text_file> "tts=espeak-ng|os_task_file_format=json|is_text_type=plain|tts_path=/usr/bin/espeak-ng|task_language=asm|allow_unlisted_languages=True" syncmap.json -r="allow_unlisted_languages=True|tts=espeak-ng|tts_path=/usr/bin/espeak-ng" --verbose
python audio_chunk.py --audio-file <absolute_path_to_audio_file> --syncmap-file <absolute_path_to_syncmap_file>
Using finetuneas interface for fine tuning sync maps output.