whisper-api

There are 61 repositories under whisper-api topic.

adithya-s-k/omniparse
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
Language:Python5k 34 77418
mallorbc/whisper_mic
Project that allows one to use a microphone with OpenAI whisper.
Language:Python702 20 54158
Carleslc/AudioToText
Transcribe and translate audio to text using Whisper and DeepL.
Language:Jupyter Notebook253 6 532
mouredev/tggenerator
Generador de logotipos de eSports por IA (con fines académicos durante el evento Tenerife GG)
Language:Kotlin192 3 014
themanyone/whisper_dictation
Fast! Offline, privacy-focused, hands-free voice typing, 2-way AI voice chat, AI images, webcam, recorder, voice control, in under 4 GiB of VRAM.
Language:Python160 11 520
supershaneski/openai-whisper-talk
openai-whisper-talk is a sample voice conversation application powered by OpenAI technologies such as Whisper, Completions, Embeddings, and the latest Text-to-Speech. The application is built using Nuxt, a Javascript framework based on Vue.js.
Language:JavaScript139 7 635
ionic-bond/stream-translator-gpt
A stream-translator fork with VAD based audio slicing & GPT / Gemini translation.
Language:Python115 1 519
supershaneski/openai-whisper-api
A sample speech transcription app implementing OpenAI Text to Speech API based on Whisper, an automatic speech recognition (ASR) system, built using Next 13, the React framework
Language:JavaScript74 3 623
carloscdias/whisper-cpp-python
whisper.cpp bindings for python
Language:Python70 6 1117
arian0zen/QueryWhisperer
Unleash the power of AI with QueryWhisperer! Get instant answers to your questions about YouTube videos.
Language:JavaScript32 2 38
Helltar/artific_intellig_bot
AI Telegram Bot, ChatGPT, Dalle2, Whisper, GPT-4 Vision, Stability AI
Language:Kotlin22 4 34
mochi-neko/Whisper-API-unity
A client library of OpenAI Whisper transcription and translation API for Unity.
Language:C#19 1 01
FlyingFathead/TelegramBot-OpenAI-API
A feature-rich Python-based Telegram bot for OpenAI API & Perplexity API
Language:Python15 2 13
gabrielcdb/RealJarvis
A working Speech to Speech AI assistant that can interact with you, manage your system, and more!
Language:Python12 3 10
didmar/whisper-api-server
Drop-in replacement for the OpenAI's Whisper API using the same API but running locally
Language:Python11 1 11
shamspias/twilio-studio-gpt3-assistant
This repository provides a Flask app that processes voice messages recorded through Twilio or Twilio Studio, transcribes them using OpenAI's Whisper ASR, generates responses with GPT-3.5, and sends the replies as SMS using Twilio.
Language:Python10 3 01
natehouk/flow-ai-hackathon-2023
YASS.ai - Team Orange's entry to the Flow AI Hackathon 2023
Language:Python8 3 10
allseeteam/whisperx-fastapi
WhisperX FastAPI integration
Language:Python6 0 01
crazydevlegend/twspace-discord-stt
Discord bot that downloads and transcribes twitter space audio file
Language:Python6 1 10
lliWcWill/liveTranslation_openai-whisper
Live translation tool utilizing OpenAI's Whisper model for real-time audio transcription/translation with BYOK OpenAI API key for your choice of language.
Language:Python6 2 02
goktugcy/noteai
An artificial intelligence supported NodeJS application that allows the audio file to be displayed as pdf after converting it to text with the Whisper tool.
Language:TypeScript4 1 00
GURPREETKAURJETHRA/Youtube-Video-Transcribe-Summarizer-LLM-App
YouTube Video Summarization App built using open source LLM and Framework like Llama 2, Haystack, Whisper, and Streamlit. This app smoothly runs on CPU as Llama 2 model is in GGUF format loaded through Llama.cpp.
Language:Python4 1 03
kristofferv98/VoiceProcessingToolkit
The VoiceProcessingToolkit is an all-encompassing suite designed for sophisticated voice detection, wake word recognition, text-to-speech synthesis, and advanced audio processing. It offers intuitive interfaces to streamline the integration of voice processing capabilities into your applications
Language:Python4 2 00
ayushsoni1010/textify
🎙️Seamlessly transcribing the world, one spoken word at a time, in any language you desire.
Language:TypeScript3 2 01
interactive-applications/speech-to-clipboard
A simple UI tool written in Python, for recording audio from a microphone and automatically transcribing the recording using OpenAI's Whisper model via OpenAI's API.
Language:Python3 1 00
jacksparrow124/HM-GPT
Home Manager GPT is a text to speech chat gpt that can be used to control your entire house. ask verbal questions and get verbal answers from google speech recognition.
Language:Python3 3 220
redocrepus/arkode
Code in VS Code, using your voice, fmedia, WhisperAI and ChatGPT
Language:TypeScript3 1 20
AznIronMan/pyscribe
PyScribe is a command-line tool to transcribe audio files. It uses `ffmpeg` for audio conversion and `pywhisper` for transcription.
Language:Python2 1 00
DamianB-BitFlipper/async-whisper
Asynchronously transcribe audio files split into chunks in parallel and intelligently join results, yielding nearly identical transcriptions to full audio transcriptions but in a fraction of the time.
Language:Python2 2 00
Lord-Haji/ChatAudio
Language:Python2 0 01
VolkanSah/OpenAI-Text-to-Speech-Interface
Simple User Interface: Enter text and generate speech with a single click.
Language:HTML2 2 0
marcelpetrick/speech4excellence
Voice transcription prototype with openAI's Whisper and PyQt-UI and Excel output
Language:Python1 3 0
MO7YW4NG/CYCU-iLearning-Video-Transcription
中原大學 iLearning 影片教材轉錄逐字稿
Language:Jupyter Notebook1 1 01
niqifan007/Openai-tts-stt-streamlit
A gui interface for tts (text-to-speech) and stt (speech-to-text) interfaces using the openai api developed by Streamlit, with a history function一个使用Streamlit开发的openai的api接口的tts（文字转语音）和stt（语音转文字）接口的gui界面，带有历史记录功能
Language:Python1
sameemul-haque/TranscribeTool
📼 A streamlit web interface designed to extract words from video/audio files into text • Python, FFmpeg, Whisper, YT-DLP
Language:Python1 1 00
vwkyc/ASSR
sentiment analysis on transcribed speech or text with multilingual capability
Language:JavaScript1 1 00

whisper-api

adithya-s-k/omniparse

mallorbc/whisper_mic

Carleslc/AudioToText

mouredev/tggenerator

themanyone/whisper_dictation

supershaneski/openai-whisper-talk

ionic-bond/stream-translator-gpt

supershaneski/openai-whisper-api

carloscdias/whisper-cpp-python

arian0zen/QueryWhisperer

Helltar/artific_intellig_bot

mochi-neko/Whisper-API-unity

FlyingFathead/TelegramBot-OpenAI-API

gabrielcdb/RealJarvis

didmar/whisper-api-server

shamspias/twilio-studio-gpt3-assistant

natehouk/flow-ai-hackathon-2023

allseeteam/whisperx-fastapi

crazydevlegend/twspace-discord-stt

lliWcWill/liveTranslation_openai-whisper

goktugcy/noteai

GURPREETKAURJETHRA/Youtube-Video-Transcribe-Summarizer-LLM-App

kristofferv98/VoiceProcessingToolkit

ayushsoni1010/textify

interactive-applications/speech-to-clipboard

jacksparrow124/HM-GPT

redocrepus/arkode

AznIronMan/pyscribe

DamianB-BitFlipper/async-whisper

Lord-Haji/ChatAudio

VolkanSah/OpenAI-Text-to-Speech-Interface

marcelpetrick/speech4excellence

MO7YW4NG/CYCU-iLearning-Video-Transcription

niqifan007/Openai-tts-stt-streamlit

sameemul-haque/TranscribeTool

vwkyc/ASSR