TheMindExpansionNetwork

TheMindExpansionNetwork's Stars

Vchitect/FasterCache
FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
Language:Python1868
abus-aikorea/voice-pro
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downloading, vocal isolation(UVR5), Text-to-Speech (Edge-TTS), and multi-language translation. Perfect for content creators and developers.
Language:Python2.4k181
tylerprogramming/ai
This repository will have different projects using AutoGen and Tutorials
Language:Python467152
fastapi/full-stack-fastapi-template
Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.
Language:TypeScript28.7k5.2k
haoyuhsu/autovfx
Offical codes for "AutoVFX: Physically Realistic Video Editing from Natural Language Instructions."
Language:Jupyter Notebook24819
xdit-project/mochi-xdit
faster parallel inference of mochi-1 video generation model
Language:Python986
strnad/CrewAI-Studio
A user-friendly, multi-platform GUI for managing and running CrewAI agents and tasks. Supports Conda and virtual environments, no coding needed.
Language:Python461109
matatonic/openedai-speech
An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.
Language:Python55780
xszyou/Fay
Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants.
Language:JavaScript9.6k1.8k
t41372/Open-LLM-VTuber
Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms
Language:Python1.7k161
moonlight-stream/moonlight-docs
Moonlight Documentation
1.3k78
tencent-ailab/persona-hub
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
Language:Python95465
AlekseyKorshuk/role-play-synthetic
Synthetic Role-Play Conversation Dataset Generation
Language:Python405
ydrive/EasySynth
Unreal Engine plugin for easy creation of synthetic image datasets
Language:C++18230
edwko/OuteTTS
Interface for OuteTTS models.
Language:Python78661
attashe/ComfyUI-FluxRegionAttention
Implement Region Attention for Flux model
Language:Python923
1038lab/ComfyUI-OmniGen
ComfyUI-OmniGen - A ComfyUI custom node implementation of OmniGen, a powerful text-to-image generation and editing model.
Language:Python15716
EnVision-Research/LucidFusion
Official implementation of “LucidFusion: Generating 3D Gaussians with Arbitrary Unposed Images”
Language:Python603
bytedance/X-Portrait
Source code for the SIGGRAPH 2024 paper "X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention"
Language:Python43035
fffiloni/X-Portrait
Source code for the SIGGRAPH 2024 paper "X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention"
1
HelloVision/HelloMeme
The official HelloMeme GitHub site
Language:Python53337
HelloVision/ComfyUI_HelloMeme
Official comfyui repository of Hellomeme
Language:Python31521
codions/docker-stream-server
Docker image for video streaming server that supports RTMP, HLS, and DASH streams.
Language:XSLT286
joshpocock/kestra
:zap: Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...
Language:Java2811
GameGen-X/GameGen-X
2193
Tencent/Hunyuan3D-1
Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
Language:Python2.5k187
ShiromiyaG/RVC-AI-Cover-Maker-UI
Performs the entire AI cover generation process with UI
Language:Python102
Ayushanbhore/Unreal-Quest3-SceneSample
A sample to create your own Unreal Scene Project for Meta Quest 3, Meta Quest Pro. Works with Meta Quest 2 but it is not recommended. Use it to learn Unreal Engine
163
CyberAgentAILab/TANGO
Official implementation of the paper "TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation"
Language:Python867106
OS-Copilot/OS-Atlas
OS-ATLAS: A Foundation Action Model For Generalist GUI Agents
2137