whisper-api
There are 61 repositories under whisper-api topic.
adithya-s-k/omniparse
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
mallorbc/whisper_mic
Project that allows one to use a microphone with OpenAI whisper.
Carleslc/AudioToText
Transcribe and translate audio to text using Whisper and DeepL.
mouredev/tggenerator
Generador de logotipos de eSports por IA (con fines académicos durante el evento Tenerife GG)
themanyone/whisper_dictation
Fast! Offline, privacy-focused, hands-free voice typing, 2-way AI voice chat, AI images, webcam, recorder, voice control, in under 4 GiB of VRAM.
supershaneski/openai-whisper-talk
openai-whisper-talk is a sample voice conversation application powered by OpenAI technologies such as Whisper, Completions, Embeddings, and the latest Text-to-Speech. The application is built using Nuxt, a Javascript framework based on Vue.js.
ionic-bond/stream-translator-gpt
A stream-translator fork with VAD based audio slicing & GPT / Gemini translation.
supershaneski/openai-whisper-api
A sample speech transcription app implementing OpenAI Text to Speech API based on Whisper, an automatic speech recognition (ASR) system, built using Next 13, the React framework
carloscdias/whisper-cpp-python
whisper.cpp bindings for python
arian0zen/QueryWhisperer
Unleash the power of AI with QueryWhisperer! Get instant answers to your questions about YouTube videos.
Helltar/artific_intellig_bot
AI Telegram Bot, ChatGPT, Dalle2, Whisper, GPT-4 Vision, Stability AI
mochi-neko/Whisper-API-unity
A client library of OpenAI Whisper transcription and translation API for Unity.
FlyingFathead/TelegramBot-OpenAI-API
A feature-rich Python-based Telegram bot for OpenAI API & Perplexity API
gabrielcdb/RealJarvis
A working Speech to Speech AI assistant that can interact with you, manage your system, and more!
didmar/whisper-api-server
Drop-in replacement for the OpenAI's Whisper API using the same API but running locally
shamspias/twilio-studio-gpt3-assistant
This repository provides a Flask app that processes voice messages recorded through Twilio or Twilio Studio, transcribes them using OpenAI's Whisper ASR, generates responses with GPT-3.5, and sends the replies as SMS using Twilio.
natehouk/flow-ai-hackathon-2023
YASS.ai - Team Orange's entry to the Flow AI Hackathon 2023
allseeteam/whisperx-fastapi
WhisperX FastAPI integration
crazydevlegend/twspace-discord-stt
Discord bot that downloads and transcribes twitter space audio file
lliWcWill/liveTranslation_openai-whisper
Live translation tool utilizing OpenAI's Whisper model for real-time audio transcription/translation with BYOK OpenAI API key for your choice of language.
goktugcy/noteai
An artificial intelligence supported NodeJS application that allows the audio file to be displayed as pdf after converting it to text with the Whisper tool.
GURPREETKAURJETHRA/Youtube-Video-Transcribe-Summarizer-LLM-App
YouTube Video Summarization App built using open source LLM and Framework like Llama 2, Haystack, Whisper, and Streamlit. This app smoothly runs on CPU as Llama 2 model is in GGUF format loaded through Llama.cpp.
kristofferv98/VoiceProcessingToolkit
The VoiceProcessingToolkit is an all-encompassing suite designed for sophisticated voice detection, wake word recognition, text-to-speech synthesis, and advanced audio processing. It offers intuitive interfaces to streamline the integration of voice processing capabilities into your applications
ayushsoni1010/textify
🎙️Seamlessly transcribing the world, one spoken word at a time, in any language you desire.
interactive-applications/speech-to-clipboard
A simple UI tool written in Python, for recording audio from a microphone and automatically transcribing the recording using OpenAI's Whisper model via OpenAI's API.
jacksparrow124/HM-GPT
Home Manager GPT is a text to speech chat gpt that can be used to control your entire house. ask verbal questions and get verbal answers from google speech recognition.
redocrepus/arkode
Code in VS Code, using your voice, fmedia, WhisperAI and ChatGPT
AznIronMan/pyscribe
PyScribe is a command-line tool to transcribe audio files. It uses `ffmpeg` for audio conversion and `pywhisper` for transcription.
DamianB-BitFlipper/async-whisper
Asynchronously transcribe audio files split into chunks in parallel and intelligently join results, yielding nearly identical transcriptions to full audio transcriptions but in a fraction of the time.
VolkanSah/OpenAI-Text-to-Speech-Interface
Simple User Interface: Enter text and generate speech with a single click.
marcelpetrick/speech4excellence
Voice transcription prototype with openAI's Whisper and PyQt-UI and Excel output
MO7YW4NG/CYCU-iLearning-Video-Transcription
中原大學 iLearning 影片教材轉錄逐字稿
niqifan007/Openai-tts-stt-streamlit
A gui interface for tts (text-to-speech) and stt (speech-to-text) interfaces using the openai api developed by Streamlit, with a history function一个使用Streamlit开发的openai的api接口的tts(文字转语音)和stt(语音转文字)接口的gui界面,带有历史记录功能
sameemul-haque/TranscribeTool
📼 A streamlit web interface designed to extract words from video/audio files into text • Python, FFmpeg, Whisper, YT-DLP
vwkyc/ASSR
sentiment analysis on transcribed speech or text with multilingual capability