transcription
There are 1238 repositories under transcription topic.
Zackriya-Solutions/meeting-minutes
A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local device (Mac OS and windows OS Support added. Working on adding linux support soon) https://meetily.zackriya.com/ is meetly ai
BasedHardware/omi
AI wearables. Put it on, speak, transcribe, automatically
abus-aikorea/voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
spotify/basic-pitch
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
pluja/whishper
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
floneum/floneum
Instant, controllable, local pre-trained AI models in Rust
sindresorhus/awesome-whisper
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI
bugbakery/audapolis
an editor for spoken-word audio with automatic transcription
juanmc2005/diart
A python package to build AI-powered real-time audio applications
azuwis/pianotrans
Simple GUI for ByteDance's Piano Transcription with Pedals
hardhackerlabs/book
「硬地骇客 - 两个月 $12000 ARR 实践之路」是由 硬地骇客 团队编著,本书是关于 Podwise 产品历程的忠实记录:内容包含 灵感 - 构建 - 发布 - 增长 - 复盘 五个章节。如果你觉得一个人读不够过瘾,欢迎加入「硬地骇客」官方知识星球与专家们一起讨论!Podwise 的故事才刚刚开始,我们也将在星球持续分享我们的认知,成功可能无法复制,但失败一定可以借鉴。现在就点击下方链接加入吧!
rishikanthc/Scriberr
Self-hosted AI audio transcription
YaoFANGUK/video-subtitle-generator
视频音频生成字幕,生成srt文件。无需申请第三方API,本地实现音频转文本。基于Transformer的视频字幕生成框架。A GUI tool for generating subtitle from videos and generating srt files.
kaixxx/noScribe
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
Saik0s/Whisperboard
The open-source iOS app that's making quality voice transcription more accessible on mobile devices.
royshil/obs-localvocal
OBS plugin for local speech recognition and captioning using AI
transcriptionstream/transcriptionstream
turnkey self-hosted offline transcription and diarization service with llm summary
aschmelyun/subvert
Generate subtitles, summaries, and chapters from videos in seconds
nyrahealth/CrisperWhisper
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
mayeaux/generate-subtitles
Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration
freedmand/textra
A command-line application to convert images, PDFs, and audio files to text using Apple's APIs
exPHAT/SwiftWhisper
🎤 The easiest way to transcribe audio in Swift
Picovoice/cheetah
On-device streaming speech-to-text engine powered by deep learning
bbc/react-transcript-editor
A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
Dicklesworthstone/bulk_transcribe_youtube_videos_from_playlist
Easily take an entire YouTube playlist and turn it into high quality transcripts using Whisper.
dsymbol/decipher
Effortlessly add AI-generated transcription subtitles to your videos
vilassn/whisper_android
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
sveinbjornt/hear
Command line interface for the built-in speech recognition and transcription capabilities in macOS.
baxtree/subaligner
Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/
Picovoice/leopard
On-device speech-to-text engine powered by deep learning
woheller69/whisperIME
Android Input Method Editor (IME) based on Whisper
OpenNewsLabs/autoEdit_2
Fast text based video editing, node Electron Os X desktop app, with Backbone front end.
haydenbleasel/orate
The AI toolkit for speech.
bugbakery/transcribee
open source audio and video transcription software
jim-schwoebel/voicebook
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).