TheMindExpansionNetwork's Stars
Vchitect/FasterCache
FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
abus-aikorea/voice-pro
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downloading, vocal isolation(UVR5), Text-to-Speech (Edge-TTS), and multi-language translation. Perfect for content creators and developers.
tylerprogramming/ai
This repository will have different projects using AutoGen and Tutorials
fastapi/full-stack-fastapi-template
Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.
haoyuhsu/autovfx
Offical codes for "AutoVFX: Physically Realistic Video Editing from Natural Language Instructions."
xdit-project/mochi-xdit
faster parallel inference of mochi-1 video generation model
strnad/CrewAI-Studio
A user-friendly, multi-platform GUI for managing and running CrewAI agents and tasks. Supports Conda and virtual environments, no coding needed.
matatonic/openedai-speech
An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.
xszyou/Fay
Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants.
t41372/Open-LLM-VTuber
Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms
moonlight-stream/moonlight-docs
Moonlight Documentation
tencent-ailab/persona-hub
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
AlekseyKorshuk/role-play-synthetic
Synthetic Role-Play Conversation Dataset Generation
ydrive/EasySynth
Unreal Engine plugin for easy creation of synthetic image datasets
edwko/OuteTTS
Interface for OuteTTS models.
attashe/ComfyUI-FluxRegionAttention
Implement Region Attention for Flux model
1038lab/ComfyUI-OmniGen
ComfyUI-OmniGen - A ComfyUI custom node implementation of OmniGen, a powerful text-to-image generation and editing model.
EnVision-Research/LucidFusion
Official implementation of “LucidFusion: Generating 3D Gaussians with Arbitrary Unposed Images”
bytedance/X-Portrait
Source code for the SIGGRAPH 2024 paper "X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention"
fffiloni/X-Portrait
Source code for the SIGGRAPH 2024 paper "X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention"
HelloVision/HelloMeme
The official HelloMeme GitHub site
HelloVision/ComfyUI_HelloMeme
Official comfyui repository of Hellomeme
codions/docker-stream-server
Docker image for video streaming server that supports RTMP, HLS, and DASH streams.
joshpocock/kestra
:zap: Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...
GameGen-X/GameGen-X
Tencent/Hunyuan3D-1
Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
ShiromiyaG/RVC-AI-Cover-Maker-UI
Performs the entire AI cover generation process with UI
Ayushanbhore/Unreal-Quest3-SceneSample
A sample to create your own Unreal Scene Project for Meta Quest 3, Meta Quest Pro. Works with Meta Quest 2 but it is not recommended. Use it to learn Unreal Engine
CyberAgentAILab/TANGO
Official implementation of the paper "TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation"
OS-Copilot/OS-Atlas
OS-ATLAS: A Foundation Action Model For Generalist GUI Agents