alvesfelipe's Stars
ijl/orjson
Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy
SonyCSLParis/music-inpainting-ts
A collection of web interfaces for AI-assisted interactive music creation
SonyCSLParis/pesto
Self-supervised learning for fast pitch estimation
LAION-AI/CLAP
Contrastive Language-Audio Pretraining
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
tiangolo/fastapi
FastAPI framework, high performance, easy to learn, fast to code, ready for production
tiangolo/full-stack-fastapi-template
Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.
coqui-ai/TTS
šøš¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
myshell-ai/OpenVoice
Instant voice cloning by MyShell.
stanfordnlp/dspy
DSPy: The framework for programmingānot promptingāfoundation models
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
Vaibhavs10/insanely-fast-whisper
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
152334H/DL-Art-School
TorToiSe fine-tuning with DLAS
rebotnix/Tortoise-TTS-Training
Community framework for training tortoise
wazenmai/MIDI-BERT
This is the official repository for the paper, MidiBERT-Piano: Large-scale Pre-training for Symbolic Music Understanding.
yxlllc/DDSP-SVC
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
sanchit-gandhi/whisper-jax
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
Speaker-Identification/You-Only-Speak-Once
Deep Learning - one shot learning for speaker recognition using Filter Banks
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
facebookresearch/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
resemble-ai/Resemblyzer
A python package to analyze and compare voices with deep learning
iperov/DeepFaceLive
Real-time face swap for PC streaming or video calls
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
justinjohn0306/so-vits-svc-4.0-v2
SoftVC VITS Singing Voice Conversion