whisper

There are 1908 repositories under whisper topic.

  • floneum

    Instant, controllable, local pre-trained AI models in Rust

    Language:Rust2k
  • OnnxStream

    Lightweight inference library for ONNX files, written in C++. It can run Stable Diffusion XL 1.0 on a RPI Zero 2 (or in 298MB of RAM) but also Mistral 7B on desktops and servers. ARM, x86, WASM, RISC-V supported. Accelerated by XNNPACK.

    Language:C++2k
  • AudioNotes

    快速提取音视频内容,整理成一份结构化的markdown笔记

    Language:Python1.9k
  • whisper-turbo

    Cross-Platform, GPU Accelerated Whisper 🏎️

    Language:TypeScript1.8k
  • openai-kotlin

    OpenAI API client for Kotlin with multiplatform and coroutines capabilities.

    Language:Kotlin1.8k
  • subsai

    🎞️ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants 🎞️

    Language:Python1.6k
  • auto-subs

    auto-subs

    Instantly generate AI-powered subtitles on your device. Works standalone or connects to DaVinci Resolve.

    Language:TypeScript1.5k
  • yt-whisper

    Using OpenAI's Whisper to automatically generate YouTube subtitles

    Language:Python1.4k
  • Speech-AI-Forge

    Speech-AI-Forge

    🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

    Language:Python1.3k
  • modelfusion

    The TypeScript library for building AI applications.

    Language:TypeScript1.3k
  • gp.nvim

    gp.nvim

    Gp.nvim (GPT prompt) Neovim AI plugin: ChatGPT sessions & Instructable text/code operations & Speech to text [OpenAI, Ollama, Anthropic, ..]

    Language:Lua1.3k
  • whisper

    Whisper is a file-based time-series database format for Graphite.

    Language:Python1.3k
  • aura-voice

    Aura is like Siri, but in your browser. An AI voice assistant optimized for low latency responses.

    Language:TypeScript1.2k
  • ai-dev-gallery

    ai-dev-gallery

    An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps.

    Language:C#1.2k
  • Whisper-Finetune

    Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment

    Language:C1.1k
  • whisper-ctranslate2

    Whisper command line client compatible with original OpenAI client based on CTranslate2.

    Language:Python1.1k
  • truss

    truss

    The simplest way to serve AI/ML models in production

    Language:Python1.1k
  • kubeai

    AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.

    Language:Go1.1k
  • video-subtitle-generator

    视频音频生成字幕,生成srt文件。无需申请第三方API,本地实现音频转文本。基于Transformer的视频字幕生成框架。A GUI tool for generating subtitle from videos and generating srt files.

    Language:Python1.1k
  • AI-Waifu-Vtuber

    AI Vtuber for Streaming on Youtube/Twitch

    Language:Python978
  • whisper-writer

    💬📝 A small dictation app using OpenAI's Whisper speech recognition model.

    Language:Python924
  • Whisperboard

    The open-source iOS app that's making quality voice transcription more accessible on mobile devices.

    Language:Swift912
  • whisper.api

    This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.

    Language:Python897
  • obs-localvocal

    OBS plugin for local speech recognition and captioning using AI

    Language:C++894
  • transcriptionstream

    turnkey self-hosted offline transcription and diarization service with llm summary

    Language:Python889
  • TwitchLib

    C# Twitch Chat, Whisper, API and PubSub Library. Allows for chatting, whispering, stream event subscription and channel/account modification. Supports everything that supports .NETStandard 2.0

  • subvert

    Generate subtitles, summaries, and chapters from videos in seconds

    Language:PHP844
  • OpenAI-Unity

    An unofficial OpenAI Unity Package that aims to help you use OpenAI API directly in Unity Game engine.

    Language:C#827
  • whisper-playground

    Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/

    Language:Python817
  • go-carbon

    Golang implementation of Graphite/Carbon server with classic architecture: Agent -> Cache -> Persister

    Language:Go817
  • CrisperWhisper

    Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

    Language:Python814
  • VideoSubtitleGenerator

    批量为本地视频生成字幕文件,并可将字幕文件翻译成其它语言, 跨平台支持 window, mac 系统

    Language:JavaScript804
  • generate-subtitles

    Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration

    Language:JavaScript801
  • june

    Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit

    Language:Python785
  • use-whisper

    React hook for OpenAI Whisper with speech recorder, real-time transcription, and silence removal built-in

    Language:TypeScript778
  • whisper_mic

    Project that allows one to use a microphone with OpenAI whisper.

    Language:Python777