xiaobaiha's Stars
abi/screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
lobehub/lobe-chat
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Qwen / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS/Plugins/Artifacts). One-click FREE deployment of your private ChatGPT/ Claude application.
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
chatanywhere/GPT_API_free
Free ChatGPT API Key,免费ChatGPT API,支持GPT4 API(免费),ChatGPT国内可用免费转发API,直连无需代理。可以搭配ChatBox等软件/插件使用,极大降低接口使用成本。国内即可无限制畅快聊天。
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
facefusion/facefusion
Industry leading face manipulation platform
bleedline/aimoneyhunter
ai副业赚钱大集合,教你如何利用ai做一些副业项目,赚取更多额外收益。The Ultimate Guide to Making Money with AI Side Hustles: Learn how to leverage AI for some cool side gigs and rake in some extra cash. Check out the English version for more insights.
netease-youdao/QAnything
Question and Answer based on Anything.
guoyww/AnimateDiff
Official implementation of AnimateDiff.
InstantID/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
microsoft/UFO
A UI-Focused Agent for Windows OS Interaction.
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
LargeWorldModel/LWM
Large World Model -- Modeling Text and Video with Millions Context
levihsu/OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
HumanAIGC/OutfitAnyone
Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person
microsoft/TaskWeaver
A code-first agent framework for seamlessly planning and executing data analytics tasks.
aiwaves-cn/agents
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents
Hillobar/Rope
GUI-focused roop
AILab-CVC/VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
NExT-GPT/NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
facebookresearch/audio2photoreal
Code and dataset for photorealistic Codec Avatars driven from audio
AIGCDesignGroup/ReplaceAnything
ddupont808/GPT-4V-Act
AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI
Weixin-Liang/LLM-scientific-feedback
Can large language models provide useful feedback on research papers? A large-scale empirical analysis.
Lightricks/LongAnimateDiff
OpenDFM/MULTI-Benchmark
MULTI-Benchmark: Multimodal Understanding Leaderboard with Text and Images