lliang2003

lliang2003's Stars

microsoft/generative-ai-for-beginners
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
Language:Jupyter Notebook64.5k 552 12832.9k
danielmiessler/fabric
fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
Language:Go24.2k 322 4992.6k
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python22.1k 186 4902.1k
dataease/dataease
🔥 人人可用的开源 BI 工具，Tableau、帆软的开源替代。
Language:Java17.8k 164 5.2k3.2k
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language:Python9.4k 651 1451.3k
modelscope/DiffSynth-Studio
Enjoy the magic of Diffusion models!
Language:Python6.5k 57 151588
Moonvy/OpenPromptStudio
🥣 AIGC 提示词可视化编辑器 | OPS | Open Prompt Studio
Language:Vue6k 46 90706
andrewyng/translation-agent
Language:Python4.7k 52 15540
GuijiAI/duix.ai
Language:C++4.5k 214 41648
PetoiCamp/OpenCat
An open source quadruped robot pet framework for developing Boston Dynamics-style four-legged robots that are perfect for STEM, coding & robotics education, IoT robotics applications, AI-enhanced robotics application services, research, and DIY robotics kit development.
Language:C++3.6k 87 37434
PeterH0323/Streamer-Sales
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁，一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭建后端🗝️、Docker-compose 打包部署🐋
Language:Python2.5k 39 28372
facebookresearch/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
Language:Python1.8k 26 46110
landing-ai/vision-agent
Vision agent
Language:Python1.3k 19 12131
ShareGPT4Omni/ShareGPT4Video
[NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Language:Python1.3k 32 3644
ali-vilab/MimicBrush
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
Language:Python1.1k 14 2378
alipay/agentUniverse
agentUniverse is a LLM multi-agent framework that allows developers to easily build multi-agent applications.
Language:Python846 13 26107
DAMO-NLP-SG/VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
Language:Python822 10 9656
megvii-research/megactor
Language:Python749 38 26102
mezbaul-h/june
Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit
Language:Python707 6 843
MarkFzp/humanplus
[CoRL 2024] HumanPlus: Humanoid Shadowing and Imitation from Humans
Language:Python554 13 092
yossTheDev/removerized
🖼️ Effortlessly Remove Image Backgrounds with AI - 🆓 Free & Limitless with 🛩️ Offline Support
Language:TypeScript525 5 464
6drf21e/ChatTTS_Speaker
ChatTTS 2000条音色稳定性打分🥇+区分男女年龄👧+在线试听🔈 ChatTTS 2K Speaker Stability Score & Categorized by Gender and Age & Audio Preview
Language:Python512 6 1029
THUDM/Inf-DiT
Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer
Language:Python372 21 2718
Bklieger/groqnotes
GroqNotes: Generate organized notes from audio using Groq, Whisper, and Llama3
Language:Python367 3 776
SamKhoze/ComfyUI-DeepFuze
DeepFuze is a state-of-the-art deep learning tool that seamlessly integrates with ComfyUI to revolutionize facial transformations, lipsyncing, Face Swapping, Lipsync Translation, video generation, and voice cloning.
Language:Python303 2 5034
VisActor/VMind
Not only automatic, but also intelligent. An Intelligent data Visualization System, based on LLM.
Language:TypeScript179 12 6416
trigaten/Prompt_Systematic_Review
Language:HTML167 5 6115
BigWhiteFox/EssayAssistant
Language:Python50 3 18
it-ebooks-0/aigc-books
:books: 暂存AIGC相关书籍
21 3 06
agicto/chatcto-next-web
AGICTO && DevAGI next chat 版本
Language:TypeScript4 2 00