lliang2003's Stars
microsoft/generative-ai-for-beginners
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
danielmiessler/fabric
fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
dataease/dataease
🔥 人人可用的开源 BI 工具,Tableau、帆软的开源替代。
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
modelscope/DiffSynth-Studio
Enjoy the magic of Diffusion models!
Moonvy/OpenPromptStudio
🥣 AIGC 提示词可视化编辑器 | OPS | Open Prompt Studio
andrewyng/translation-agent
GuijiAI/duix.ai
PetoiCamp/OpenCat
An open source quadruped robot pet framework for developing Boston Dynamics-style four-legged robots that are perfect for STEM, coding & robotics education, IoT robotics applications, AI-enhanced robotics application services, research, and DIY robotics kit development.
PeterH0323/Streamer-Sales
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭建后端🗝️、Docker-compose 打包部署🐋
facebookresearch/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
landing-ai/vision-agent
Vision agent
ShareGPT4Omni/ShareGPT4Video
[NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
ali-vilab/MimicBrush
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
alipay/agentUniverse
agentUniverse is a LLM multi-agent framework that allows developers to easily build multi-agent applications.
DAMO-NLP-SG/VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
megvii-research/megactor
mezbaul-h/june
Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit
MarkFzp/humanplus
[CoRL 2024] HumanPlus: Humanoid Shadowing and Imitation from Humans
yossTheDev/removerized
🖼️ Effortlessly Remove Image Backgrounds with AI - 🆓 Free & Limitless with 🛩️ Offline Support
6drf21e/ChatTTS_Speaker
ChatTTS 2000条音色稳定性打分🥇+区分男女年龄👧+在线试听🔈 ChatTTS 2K Speaker Stability Score & Categorized by Gender and Age & Audio Preview
THUDM/Inf-DiT
Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer
Bklieger/groqnotes
GroqNotes: Generate organized notes from audio using Groq, Whisper, and Llama3
SamKhoze/ComfyUI-DeepFuze
DeepFuze is a state-of-the-art deep learning tool that seamlessly integrates with ComfyUI to revolutionize facial transformations, lipsyncing, Face Swapping, Lipsync Translation, video generation, and voice cloning.
VisActor/VMind
Not only automatic, but also intelligent. An Intelligent data Visualization System, based on LLM.
trigaten/Prompt_Systematic_Review
BigWhiteFox/EssayAssistant
it-ebooks-0/aigc-books
:books: 暂存AIGC相关书籍
agicto/chatcto-next-web
AGICTO && DevAGI next chat 版本