MichaelYONGHENG's Stars
microsoft/OmniParser
A simple screen parsing tool towards pure vision based GUI agent
corbt/agent.exe
songquanpeng/one-api
OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI.
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
Open-Source-O1/Open-O1
THUDM/CogView3
text to image to generation: CogView3-Plus and CogView3(ECCV 2024)
NginxProxyManager/nginx-proxy-manager
Docker container for managing Nginx proxy hosts with a simple, powerful interface
ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
BAAI-Agents/Cradle
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.
TencentARC/SmartEdit
Official code of SmartEdit [CVPR-2024 Highlight]
apple/ml-mgie
taichengguo/LLM_MultiAgents_Survey_Papers
Large Language Model based Multi-Agents: A Survey of Progress and Challenges
PKU-YuanGroup/MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
OthersideAI/self-operating-computer
A framework to enable multimodal models to operate a computer.
MichaelYONGHENG/michael_first_repo
witcherofresearch/Forgedit
ddupont808/GPT-4V-Act
AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI
LLaVA-VL/LLaVA-Interactive-Demo
LLaVA-Interactive-Demo
zhile-io/pandora
潘多拉,一个让你呼吸顺畅的ChatGPT。Pandora, a ChatGPT client that lets you breathe freely.
awesome-selfhosted/awesome-selfhosted
A list of Free Software network services and web applications which can be hosted on your own servers
ChatGPTNextWeb/ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
InternLM/InternLM-XComposer
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
iperov/DeepFaceLive
Real-time face swap for PC streaming or video calls
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
meta-llama/codellama
Inference code for CodeLlama models
OpenTalker/SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
modelscope/facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
wgwang/awesome-LLMs-In-China
**大模型