whisper

There are 1908 repositories under whisper topic.

  • whisper.cpp

    ggml-org/whisper.cpp

    Port of OpenAI's Whisper model in C/C++

    Language:C++43.2k3391.8k4.7k
  • SYSTRAN/faster-whisper

    Faster Whisper transcription with CTranslate2

    Language:Python18.1k1448711.5k
  • m-bain/whisperX

    WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

    Language:Python17.7k1578691.9k
  • chidiwilliams/buzz

    Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

    Language:Python15.3k925401.1k
  • modelscope/FunASR

    A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

    Language:Python12.6k931.5k1.3k
  • PaddlePaddle/PaddleSpeech

    Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

    Language:Python12.2k1882k1.9k
  • niedev/RTranslator

    Open source real-time translation app for Android that runs locally

    Language:C++9.2k67119821
  • xorbitsai/inference

    Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

    Language:Python8.5k582.5k741
  • meeting-minutes

    Zackriya-Solutions/meeting-minutes

    A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local device (Mac OS and windows OS Support added. Working on adding linux support soon) https://meetily.zackriya.com/ is meetly ai

    Language:C++7.5k2324575
  • argmaxinc/WhisperKit

    On-device Speech Recognition for Apple Silicon

    Language:Swift5k48195452
  • MahmoudAshraf97/whisper-diarization

    Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

    Language:Jupyter Notebook5k47254462
  • voice-pro

    abus-aikorea/voice-pro

    Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

    Language:Python4.8k3545412
  • wenet-e2e/wenet

    Production First and Production Ready End-to-End Speech Recognition Toolkit

    Language:Python4.8k931.1k1.2k
  • sanchit-gandhi/whisper-jax

    JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

    Language:Jupyter Notebook4.6k44185406
  • leetcode-mafia/cheetah

    Mac app for crushing tech interviews with AI

    Language:Swift4.2k3937303
  • thewh1teagle/vibe

    Transcribe on your own!

    Language:TypeScript4k40584243
  • huggingface/distil-whisper

    Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

    Language:Python3.9k67110341
  • embark

    embarklabs/embark

    Framework for serverless Decentralized Applications using Ethereum, IPFS and other platforms

    Language:JavaScript3.8k132445489
  • chatgpt-java

    Grt1228/chatgpt-java

    ChatGPT Java SDK支持流式输出、Gpt插件、联网。支持OpenAI官方所有接口。ChatGPT的Java客户端。OpenAI GPT-3.5-Turb GPT-4 Api Client for Java

    Language:Java3.5k38234825
  • n3d1117/chatgpt-telegram-bot

    🤖 A Telegram bot that integrates with OpenAI's official ChatGPT APIs to provide answers, written in Python

    Language:Python3.4k563691.6k
  • collabora/WhisperLive

    A nearly-live implementation of OpenAI's Whisper.

    Language:Python3.4k41241463
  • ruby-openai

    alexrudall/ruby-openai

    OpenAI API + Ruby! 🤖❤️ GPT-5 & Realtime WebRTC compatible!

    Language:Ruby3.2k40156371
  • xenova/whisper-web

    ML-powered speech recognition directly in your browser

    Language:TypeScript3.1k2542383
  • openai

    betalgo/openai

    .NET library for the OpenAI service API by Betalgo Ranul

    Language:C#3k68208541
  • buxuku/SmartSub

    「妙幕」是一款跨平台客户端工具,可以批量为视频或者音频生成字幕文件,并支持对字幕进行翻译,支持百度、火山、openai、ollama、deepseek 等多家翻译

    Language:TypeScript2.9k17161207
  • HeyWillow/willow

    Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative

    Language:C2.9k42163112
  • SamurAIGPT/EmbedAI

    An app to interact privately with your documents using the power of GPT, 100% privately, no data leaks

    Language:JavaScript2.8k3569299
  • CheshireCC/faster-whisper-GUI

    faster_whisper GUI with PySide6

    Language:Python2.7k20274135
  • pluja/whishper

    Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

    Language:Svelte2.7k42136152
  • chenyme/Chenyme-AAVT

    这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。

    Language:Python2.6k1688217
  • linto-ai/whisper-timestamped

    Multilingual Automatic Speech Recognition with word-level timestamps and confidence

    Language:Python2.6k34161197
  • Purfview/whisper-standalone-win

    Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

  • speaches
  • jhj0517/Whisper-WebUI

    A Web UI for easy subtitle using whisper model.

    Language:Python2.4k21317339
  • LLPlayer

    umlx5h/LLPlayer

    The media player for language learning, with dual subtitles, AI-generated subtitles, real-time translation, and more!

    Language:C#2.1k1250108
  • m1guelpf/auto-subtitle

    Automatically generate and overlay subtitles for any video.

    Language:Python2k2385332