ranne's Stars
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
lllyasviel/Fooocus
Focus on prompting and generating
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
mckaywrigley/chatbot-ui
Come join the best place on the internet to learn AI skills. Use code "chatbotui" for an extra 20% off.
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
deepinsight/insightface
State-of-the-art 2D and 3D Face Analysis Project
joonspk-research/generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
visenger/awesome-mlops
A curated list of references for MLOps
espnet/espnet
End-to-End Speech Processing Toolkit
rtqichen/torchdiffeq
Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.
kohya-ss/sd-scripts
TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
serp-ai/bark-with-voice-clone
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
NVIDIA/OpenSeq2Seq
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
juncongmoo/chatllama
ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT
Link-AGI/AutoAgents
[IJCAI 2024] Generate different roles for GPTs to form a collaborative entity for complex tasks.
wenet-e2e/speech-synthesis-paper
List of speech synthesis papers.
lhotse-speech/lhotse
Tools for handling speech data in machine learning projects.
msurtsukov/neural-ode
Jupyter notebook with Pytorch implementation of Neural Ordinary Differential Equations
rochars/wavefile
Create, read and write wav files according to the specs. :star: :notes: :heart:
intflow/YOLOX_AUDIO
Audio event detection model based on YOLOX
pyannote/pyannote-database
Reproducible experimental protocols for multimedia (audio, video, text) database
regmi-saugat/66Days_MachineLearning
I am sharing my journey of 66DaysOfData in Machine Learning
AiTeRLab-GIST/GIST_ASD_DETECTION
Deep learning based autism spectral disorder detection from children voice
dipjyoti92/speaker_embeddings_GE2E
PyTorch Implementation of Generalized End-to-End Loss for Speaker Verification
SMART-TTS/SMART-Multi-Speaker-Style-TTS
Multi-speaker & Multi-style TTS
wrtnio/openai-function-schema
OpenAI Function Call Schema Composer and Executor from OpenAPI (Swagger) Document.
juanmc2005/SimilarityLearning
Similarity Learning applied to Speaker Verification and Semantic Textual Similarity
AiTeRLab-GIST/GC4_track2_violence_detection_GIST