audio-ai

There are 14 repositories under audio-ai topic.

EmulationAI/awesome-large-audio-models
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
692 26 343
narcotic-sh/senko
Very fast, accurate speaker diarization
Language:Python9310
zebbern/no-cost-ai
A Collection of no cost ai websites with models such as Claude 4 sonnet/opus, Grok 4, o3 Pro, Gemini 2.5 Pro for free & much more...
91 1 010
kyegomez/AudioFlamingo
Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities"
Language:Python40 5 31
serp-ai/ai-text-to-audio-latent-diffusion
text-to-audio-latent-diffusion
Language:Python37 6 18
ksasso1028/audio-reverb-removal
Code to train a custom time-domain autoencoder to dereverb audio
Language:Python16 1 12
aaivu/KuralNet
A deep learning-based Speech Emotion Recognition (SER) model trained primarily on Indian languages. Designed for applications in call centers, sentiment analysis, and accessibility tools.
Language:Python7
domenicostefani/elk-audio-AI-tutorial
Guide to deploying neural networks in VST plugins, with a specific focus on embedded devices using the Elk Audio OS
Language:Jupyter Notebook6 1 01
SoheilGtex/Voice-Cloning-SV2TTS-
Safe, production-ready starter for voice cloning via SV2TTS (RTVC wrapper). CLI, tests, Docker, CI, pre-commit. No model weights included.
Language:Python5
saoud30/Audio-AI
🗣️ Audio AI: Your Audio & Video Transcription Powerhouse!
Language:Python31
open-v2ai/podcast-ai
Whether it’s text or a link, it can be turned into a podcast!
Language:TypeScript1 1 00
engasd999/senko
⚡ Accelerate speaker diarization with Senko, processing 1 hour of audio in just 5 seconds on powerful hardware—boost your audio analysis efficiency.
Language:Python
hari7261/AgentPodcast-AI
PodcastAgent uses advanced text-to-speech technology to create natural-sounding multi-speaker podcasts from any written content.
Language:Python
SzymiczeQ/zanshin
🎧 Navigate audio content effortlessly with Zanshin, a media player that enhances your listening experience by speaker, supporting both YouTube and local files.
Language:Svelte

audio-ai

EmulationAI/awesome-large-audio-models

narcotic-sh/senko

zebbern/no-cost-ai

kyegomez/AudioFlamingo

serp-ai/ai-text-to-audio-latent-diffusion

ksasso1028/audio-reverb-removal

aaivu/KuralNet

domenicostefani/elk-audio-AI-tutorial

SoheilGtex/Voice-Cloning-SV2TTS-

saoud30/Audio-AI

open-v2ai/podcast-ai

engasd999/senko

hari7261/AgentPodcast-AI

SzymiczeQ/zanshin