whisper
There are 1212 repositories under whisper topic.
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
chidiwilliams/buzz
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
niedev/RTranslator
Open source real-time translation app for Android that runs locally
xorbitsai/inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
sanchit-gandhi/whisper-jax
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
leetcode-mafia/cheetah
Mac app for crushing remote tech interviews with AI
NexaAI/nexa-sdk
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.
argmaxinc/WhisperKit
On-device Speech Recognition for Apple Silicon
embarklabs/embark
Framework for serverless Decentralized Applications using Ethereum, IPFS and other platforms
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Grt1228/chatgpt-java
ChatGPT Java SDK支持流式输出、Gpt插件、联网。支持OpenAI官方所有接口。ChatGPT的Java客户端。OpenAI GPT-3.5-Turb GPT-4 Api Client for Java
n3d1117/chatgpt-telegram-bot
🤖 A Telegram bot that integrates with OpenAI's official ChatGPT APIs to provide answers, written in Python
betalgo/openai
.NET library for the OpenAI service API by Betalgo Ranul
alexrudall/ruby-openai
OpenAI API + Ruby! 🤖❤️
SamurAIGPT/EmbedAI
An app to interact privately with your documents using the power of GPT, 100% privately, no data leaks
toverainc/willow
Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
xenova/whisper-web
ML-powered speech recognition directly in your browser
collabora/WhisperLive
A nearly-live implementation of OpenAI's Whisper.
linto-ai/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Chenyme/Chenyme-AAVT
这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。
FL33TW00D/whisper-turbo
Cross-Platform, GPU Accelerated Whisper 🏎️
CheshireCC/faster-whisper-GUI
faster_whisper GUI with PySide6
pluja/whishper
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
m1guelpf/auto-subtitle
Automatically generate and overlay subtitles for any video.
floneum/floneum
Instant, controllable, local pre-trained AI models in Rust
aallam/openai-kotlin
OpenAI API client for Kotlin with multiplatform and coroutines capabilities.
jhj0517/Whisper-WebUI
A Web UI for easy subtitle using whisper model.
m1guelpf/yt-whisper
Using OpenAI's Whisper to automatically generate YouTube subtitles
Purfview/whisper-standalone-win
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
abdeladim-s/subsai
🎞️ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants 🎞️