Pinned Repositories
llm-app-stack
audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
ai-agent-roadmap
Explore the latest AI Agent Framework!
ai-audio-datasets
AI Audio Datasets 🎵. A list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.
ai-game-devtools
Here we will keep track of the latest AI Game Development Tools, including LLM, Agent, Code, Writer, Image, Texture, Shader, 3D Model, Animation, Video, Audio, Music, Singing Voice and Analytics. 🔥
audio-ai-agent
Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.
audio-development-tools
This is a list of sound, audio and music development tools which contains machine learning, audio generation, audio signal processing, sound synthesis, spatial audio, music information retrieval, music generation, speech recognition, speech synthesis, singing voice synthesis and more.
ComfyUI-Tools-Roadmap
Here we will track the latest development tools for ComfyUI, including Image, Mesh, Texture, Animation, Video, Audio, 3D Model, and more!🔥
SouPyX
SouPyX: An Audio Exploration Space.🪐
Yuan-ManX's Repositories
Yuan-ManX/ai-audio-datasets
AI Audio Datasets 🎵. A list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.
Yuan-ManX/ai-game-devtools
Here we will keep track of the latest AI Game Development Tools, including LLM, Agent, Code, Writer, Image, Texture, Shader, 3D Model, Animation, Video, Audio, Music, Singing Voice and Analytics. 🔥
Yuan-ManX/ai-agent-roadmap
Explore the latest AI Agent Framework!
Yuan-ManX/ComfyUI-Tools-Roadmap
Here we will track the latest development tools for ComfyUI, including Image, Mesh, Texture, Animation, Video, Audio, 3D Model, and more!🔥
Yuan-ManX/Yuan-ManX
Yuan-ManX/01
The open-source language model computer
Yuan-ManX/ai-multimodal-timeline
Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Agent, Audio, Image, Video, Music and 3D content. 🔥
Yuan-ManX/ChatTTS
ChatTTS is a generative speech model for daily dialogue.
Yuan-ManX/dataform
DataForm: Data Transform 🐼
Yuan-ManX/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Yuan-ManX/ImageBind
ImageBind One Embedding Space to Bind Them All
Yuan-ManX/llm-app-stack
Yuan-ManX/Omost
Your image is almost there!
Yuan-ManX/AI-RPi-detection
AI Raspberry Pi cat detection and notification: get a text when your cat does something it's not supposed to do, and have AI narrate what it sees. Generalizable across other use cases outside of cats
Yuan-ManX/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Yuan-ManX/anime.gf
Local & Open Source Alternative to CharacterAI
Yuan-ManX/ComfyUI-Manager
Yuan-ManX/ComfyUI-SoundHub
Yuan-ManX/ComfyUI_examples
Examples of ComfyUI workflows
Yuan-ManX/everything-ai
Introducing everything-ai, your multi-task, AI-powered and local assistant! 🤖
Yuan-ManX/friendly-stable-audio-tools
Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.
Yuan-ManX/gigax
LLM-powered NPCs running on your hardware
Yuan-ManX/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Yuan-ManX/multi-modal-starter-kit
Multi-modal starter kit for AI video understanding and narration. Works with Ollama (Llava, bakllava), GPT-4v
Yuan-ManX/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Yuan-ManX/phidata
Add memory, knowledge and tools to LLMs
Yuan-ManX/ragoon
Improve large language models (LLM) retrieval using dynamic web-search based on blazingly fast query generation from Groq chips ⚡
Yuan-ManX/Scrapegraph-ai
Python scraper based on AI
Yuan-ManX/SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
Yuan-ManX/XTTSv2
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production