/aws-transcribe-captioning-tools

Convert AWS Transcribe output into multiple caption formats.

Primary LanguagePythonMIT No AttributionMIT-0

AWS VOD Captioning using AWS Transcribe

Add subtitles to video with AWS machine learning services. Inlcuding AWS Polly, AWS Transcribe, and AWS Translate.

Overview

This repository contains code for VOD subtitle creation, described in the AWS blog post “Create video subtitles with translation using machine learning”.

Prerequisites

  • Set up an AWS account. (instructions)
  • Clone this repo.
  • The other requirements are listed in this (blog post)
  • Configure AWS CLI and a local credentials file. (instructions)

Getting Started

Head on over to this blog post to see the instructions to create captions with AWS Transcribe in the SRT format, create alternate language SRT files with AWS Translate, and use AWS Polly to create alternate language video files: https://aws.amazon.com/blogs/machine-learning/create-video-subtitles-with-translation-using-machine-learning/

More AWS Transcribe Tools for Video

If you just want to create an SRT or a VTT file, the tools directory contains Python code to convert AWS Transcribe JSON to an SRT or a VTT file. These files can be imported and used on web or desktop video players.

python srt.py output_file_from_transcribe.json output.srt
name description
srt.py Takes the JSON response from AWS Transcribe and converts to a captions.srt file
vtt.py Takes the JSON response from AWS Transcribe and converts to a captions.vtt file

License Summary

This sample code is made available under a modified MIT license.See the LICENSE file.