/transcripter

Create readable transcripts from YouTube caption files

Primary LanguageJavaScriptApache License 2.0Apache-2.0

Create readable transcripts from YouTube captions

This is a Node application that processes caption files from Google, Android and Chrome Developer channels to create readable transcripts.

The code could easily be adapted to work with caption files from other playlists.

Try it out at devsearch.me/search.


Installation and usage

  1. Clone or download the code.
  2. Add your SRT caption files to the input directory.
  3. From a terminal cd to the src directory and run node index.js, optionally setting flags (see below).
  4. Progress updates and errors will be logged to the console.
  5. When conversion is complete, view the results from index.html in the output directory, the directory used for GitHub Pages. This directory includes a CSS file and a JavaScript file to style the transcripts and enable interaction.

Input and output directories

  • When you clone the repo, the input and output directories contain sample files.
  • You can customize input and output directories — see flags below.

Error handling

Check for errors in error-log.txt.

Command line options

-a, --append   Append output to existing files in output directory
-c, --index    Create index page linking to standalone transcripts
-h, --help     Show help
-i, --input    Input directory, default is [_input_](src/input)
-o, --output   Output directory, default is [_output_](src/docs)
-s, --search   Create search index

Feedback, feature requests and bug reports

  • Please file an issue including input files where relevant.
  • See the TODO file for work in progress.

Known issues

Google Translate widget

This widget is no longer supported and the language selection popup is not laid out responsively.

Probably best to remove unless the layout can be fixed (others have tried!)


Please note that this is not an official Google product.