Weiyun1025's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
lllyasviel/ControlNet
Let us control diffusion models!
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
openai/spinningup
An educational resource to help anyone learn deep reinforcement learning.
mlfoundations/open_clip
An open source implementation of CLIP.
huggingface/trl
Train transformer language models with reinforcement learning.
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
xlang-ai/OpenAgents
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
microsoft/AI-System
System for AI Education Resource.
tlkh/asitop
Perf monitoring CLI tool for Apple Silicon
NExT-GPT/NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
noamgat/lm-format-enforcer
Enforce the output format (JSON Schema, Regex etc) of a language model
huggingface/llm_training_handbook
An open collection of methodologies to help with successful training of large language models.
OpenGVLab/all-seeing
[ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of the Open World"
Luodian/RelateAnything
Relate Anything Model is capable of taking an image as input and utilizing SAM to identify the corresponding mask within the image.
Jingkang50/OpenPSG
Benchmarking Panoptic Scene Graph Generation (PSG), ECCV'22
xlang-ai/xlang-paper-reading
Paper collection on building and evaluating language model agents via executable language grounding
LLaMafia/llamafia.github
Zeqiang-Lai/Mini-DALLE3
Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
baaivision/CapsFusion
[CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale
X2FD/LVIS-INSTRUCT4V
xlang-ai/text2reward
[ICLR 2024] Code for the paper "Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning"
Maxlinn/CHAIR-metric-standalone
CHAIR metric is a rule-based metric for evaluating object hallucination in caption generation.
xk-huang/Promptable-GRiT
Promptable GRiT: support inference with both automatic proposal generation and custom point/box prompts.