whisper
There are 1908 repositories under whisper topic.
ggml-org/whisper.cpp
Port of OpenAI's Whisper model in C/C++
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
chidiwilliams/buzz
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
niedev/RTranslator
Open source real-time translation app for Android that runs locally
xorbitsai/inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
Zackriya-Solutions/meeting-minutes
A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local device (Mac OS and windows OS Support added. Working on adding linux support soon) https://meetily.zackriya.com/ is meetly ai
argmaxinc/WhisperKit
On-device Speech Recognition for Apple Silicon
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
abus-aikorea/voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
sanchit-gandhi/whisper-jax
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
leetcode-mafia/cheetah
Mac app for crushing tech interviews with AI
thewh1teagle/vibe
Transcribe on your own!
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
embarklabs/embark
Framework for serverless Decentralized Applications using Ethereum, IPFS and other platforms
Grt1228/chatgpt-java
ChatGPT Java SDK支持流式输出、Gpt插件、联网。支持OpenAI官方所有接口。ChatGPT的Java客户端。OpenAI GPT-3.5-Turb GPT-4 Api Client for Java
n3d1117/chatgpt-telegram-bot
🤖 A Telegram bot that integrates with OpenAI's official ChatGPT APIs to provide answers, written in Python
collabora/WhisperLive
A nearly-live implementation of OpenAI's Whisper.
alexrudall/ruby-openai
OpenAI API + Ruby! 🤖❤️ GPT-5 & Realtime WebRTC compatible!
xenova/whisper-web
ML-powered speech recognition directly in your browser
betalgo/openai
.NET library for the OpenAI service API by Betalgo Ranul
buxuku/SmartSub
「妙幕」是一款跨平台客户端工具,可以批量为视频或者音频生成字幕文件,并支持对字幕进行翻译,支持百度、火山、openai、ollama、deepseek 等多家翻译
HeyWillow/willow
Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
SamurAIGPT/EmbedAI
An app to interact privately with your documents using the power of GPT, 100% privately, no data leaks
CheshireCC/faster-whisper-GUI
faster_whisper GUI with PySide6
pluja/whishper
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
chenyme/Chenyme-AAVT
这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。
linto-ai/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Purfview/whisper-standalone-win
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
jhj0517/Whisper-WebUI
A Web UI for easy subtitle using whisper model.
umlx5h/LLPlayer
The media player for language learning, with dual subtitles, AI-generated subtitles, real-time translation, and more!
m1guelpf/auto-subtitle
Automatically generate and overlay subtitles for any video.