whisper-api

There are 61 repositories under whisper-api topic.

  • omniparse

    adithya-s-k/omniparse

    Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

    Language:Python5k3477418
  • mallorbc/whisper_mic

    Project that allows one to use a microphone with OpenAI whisper.

    Language:Python7022054158
  • Carleslc/AudioToText

    Transcribe and translate audio to text using Whisper and DeepL.

    Language:Jupyter Notebook2536532
  • mouredev/tggenerator

    Generador de logotipos de eSports por IA (con fines académicos durante el evento Tenerife GG)

    Language:Kotlin1923014
  • themanyone/whisper_dictation

    Fast! Offline, privacy-focused, hands-free voice typing, 2-way AI voice chat, AI images, webcam, recorder, voice control, in under 4 GiB of VRAM.

    Language:Python16011520
  • supershaneski/openai-whisper-talk

    openai-whisper-talk is a sample voice conversation application powered by OpenAI technologies such as Whisper, Completions, Embeddings, and the latest Text-to-Speech. The application is built using Nuxt, a Javascript framework based on Vue.js.

    Language:JavaScript1397635
  • ionic-bond/stream-translator-gpt

    A stream-translator fork with VAD based audio slicing & GPT / Gemini translation.

    Language:Python1151519
  • supershaneski/openai-whisper-api

    A sample speech transcription app implementing OpenAI Text to Speech API based on Whisper, an automatic speech recognition (ASR) system, built using Next 13, the React framework

    Language:JavaScript743623
  • carloscdias/whisper-cpp-python

    whisper.cpp bindings for python

    Language:Python7061117
  • arian0zen/QueryWhisperer

    Unleash the power of AI with QueryWhisperer! Get instant answers to your questions about YouTube videos.

    Language:JavaScript32238
  • Helltar/artific_intellig_bot

    AI Telegram Bot, ChatGPT, Dalle2, Whisper, GPT-4 Vision, Stability AI

    Language:Kotlin22434
  • mochi-neko/Whisper-API-unity

    A client library of OpenAI Whisper transcription and translation API for Unity.

    Language:C#19101
  • FlyingFathead/TelegramBot-OpenAI-API

    A feature-rich Python-based Telegram bot for OpenAI API & Perplexity API

    Language:Python15213
  • gabrielcdb/RealJarvis

    A working Speech to Speech AI assistant that can interact with you, manage your system, and more!

    Language:Python12310
  • didmar/whisper-api-server

    Drop-in replacement for the OpenAI's Whisper API using the same API but running locally

    Language:Python11111
  • shamspias/twilio-studio-gpt3-assistant

    This repository provides a Flask app that processes voice messages recorded through Twilio or Twilio Studio, transcribes them using OpenAI's Whisper ASR, generates responses with GPT-3.5, and sends the replies as SMS using Twilio.

    Language:Python10301
  • natehouk/flow-ai-hackathon-2023

    YASS.ai - Team Orange's entry to the Flow AI Hackathon 2023

    Language:Python8310
  • allseeteam/whisperx-fastapi

    WhisperX FastAPI integration

    Language:Python6001
  • crazydevlegend/twspace-discord-stt

    Discord bot that downloads and transcribes twitter space audio file

    Language:Python6110
  • lliWcWill/liveTranslation_openai-whisper

    Live translation tool utilizing OpenAI's Whisper model for real-time audio transcription/translation with BYOK OpenAI API key for your choice of language.

    Language:Python6202
  • goktugcy/noteai

    An artificial intelligence supported NodeJS application that allows the audio file to be displayed as pdf after converting it to text with the Whisper tool.

    Language:TypeScript4100
  • GURPREETKAURJETHRA/Youtube-Video-Transcribe-Summarizer-LLM-App

    YouTube Video Summarization App built using open source LLM and Framework like Llama 2, Haystack, Whisper, and Streamlit. This app smoothly runs on CPU as Llama 2 model is in GGUF format loaded through Llama.cpp.

    Language:Python4103
  • kristofferv98/VoiceProcessingToolkit

    The VoiceProcessingToolkit is an all-encompassing suite designed for sophisticated voice detection, wake word recognition, text-to-speech synthesis, and advanced audio processing. It offers intuitive interfaces to streamline the integration of voice processing capabilities into your applications

    Language:Python4200
  • ayushsoni1010/textify

    🎙️Seamlessly transcribing the world, one spoken word at a time, in any language you desire.

    Language:TypeScript3201
  • interactive-applications/speech-to-clipboard

    A simple UI tool written in Python, for recording audio from a microphone and automatically transcribing the recording using OpenAI's Whisper model via OpenAI's API.

    Language:Python3100
  • jacksparrow124/HM-GPT

    Home Manager GPT is a text to speech chat gpt that can be used to control your entire house. ask verbal questions and get verbal answers from google speech recognition.

    Language:Python33220
  • redocrepus/arkode

    Code in VS Code, using your voice, fmedia, WhisperAI and ChatGPT

    Language:TypeScript3120
  • AznIronMan/pyscribe

    PyScribe is a command-line tool to transcribe audio files. It uses `ffmpeg` for audio conversion and `pywhisper` for transcription.

    Language:Python2100
  • DamianB-BitFlipper/async-whisper

    Asynchronously transcribe audio files split into chunks in parallel and intelligently join results, yielding nearly identical transcriptions to full audio transcriptions but in a fraction of the time.

    Language:Python2200
  • Lord-Haji/ChatAudio

    Language:Python2001
  • OpenAI-Text-to-Speech-Interface

    VolkanSah/OpenAI-Text-to-Speech-Interface

    Simple User Interface: Enter text and generate speech with a single click.

    Language:HTML220
  • marcelpetrick/speech4excellence

    Voice transcription prototype with openAI's Whisper and PyQt-UI and Excel output

    Language:Python130
  • MO7YW4NG/CYCU-iLearning-Video-Transcription

    中原大學 iLearning 影片教材轉錄逐字稿

    Language:Jupyter Notebook1101
  • niqifan007/Openai-tts-stt-streamlit

    A gui interface for tts (text-to-speech) and stt (speech-to-text) interfaces using the openai api developed by Streamlit, with a history function一个使用Streamlit开发的openai的api接口的tts(文字转语音)和stt(语音转文字)接口的gui界面,带有历史记录功能

    Language:Python1
  • sameemul-haque/TranscribeTool

    📼 A streamlit web interface designed to extract words from video/audio files into text • Python, FFmpeg, Whisper, YT-DLP

    Language:Python1100
  • vwkyc/ASSR

    sentiment analysis on transcribed speech or text with multilingual capability

    Language:JavaScript1100