This is a Golang CLI tool for generating transcripts of video and audio files. It wraps FFmpeg and Whisper.
go run *.go [--format] <file>
--format
: Optional flag to trigger formatting the transcript into paragraphs using basic topic analysis with TF-IDF
go run *.go --format video.mp4
This command will:
- Convert
video.mp4
to a WAV - Transcribe the audio using Whisper
- Format the transcript into paragraphs using topic analysis with TF-IDF
- Save the formatted transcript to
formatted_transcript.txt
I'm still working on the best way to distribute this. For now, you can clone the repository, ensure you have the required dependencies installed, and update the WhisperPath
and ModelPath
constants in the code to match your system.
To build the CLI:
go build -o subber
Then move the binary to somewhere on your PATH (e.g. /usr/local/bin
).