fungkingfai's Stars
godotengine/godot
Godot Engine – Multi-platform 2D and 3D game engine
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
lobehub/lobe-chat
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Qwen / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS/Plugins/Artifacts). One-click FREE deployment of your private ChatGPT/ Claude application.
geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell. Audio foundation model.
janhq/jan
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer
invoke-ai/InvokeAI
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.
microsoft/semantic-kernel
Integrate cutting-edge LLM technology quickly and easily into your apps
joaomdmoura/crewAI
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
InstantID/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
cumulo-autumn/StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
LargeWorldModel/LWM
Large World Model -- Modeling Text and Video with Millions Context
Stability-AI/StableCascade
Official Code for Stable Cascade
threestudio-project/threestudio
A unified framework for 3D content generation.
myshell-ai/MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
xxlong0/Wonder3D
Single Image to 3D using Cross-Domain Diffusion for 3D Generation
Fanghua-Yu/SUPIR
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
tyxsspa/AnyText
Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>
deepseek-ai/DreamCraft3D
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
collabora/WhisperFusion
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.
NUS-HPC-AI-Lab/OpenDiT
OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference
wenquanlu/HandRefiner
Human3DAIGC/Make-A-Character
Official repo for Make-A-Character: High Quality Text-to-3D Character Generation within Minutes
ArweaveTeam/SmartWeave
Simple, scalable smart contracts on the Arweave protocol.
magicvideov2/magicvideov2.github.io