yuanrr
Ph.D student, focusing on image and video understanding, i.e., visual question answering, video question answering, etc.
yuanrr's Stars
nomic-ai/gpt4all
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
microsoft/JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
ztjhz/BetterChatGPT
An amazing UI for OpenAI's ChatGPT (Website + Windows + MacOS + Linux)
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
rustformers/llm
[Unmaintained, see README] An ecosystem of Rust libraries for working with large language models
mymusise/ChatGLM-Tuning
基于ChatGLM-6B + LoRA的Fintune方案
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
LC1332/Luotuo-Chinese-LLM
骆驼(Luotuo): Open Sourced Chinese Language Models. Developed by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子昂 @ 商汤科技
cvlab-columbia/viper
Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"
WangRongsheng/ChatGenTitle
🌟 ChatGenTitle:使用百万arXiv论文信息在LLaMA模型上进行微调的论文题目生成模型
facebookresearch/eai-vc
The repository for the largest and most comprehensive empirical study of visual foundation models for Embodied AI (EAI).
jiawen-zhu/ViPT
[CVPR23] Visual Prompt Multi-Modal Tracking
ylsung/VL_adapter
PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)
anosorae/IRRA
Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval (CVPR 2023)
luogen1996/RepAdapter
Official implementation of "Towards Efficient Visual Adaption via Structural Re-parameterization".
sachit-menon/classify_by_description_release
Yushi-Hu/PromptCap
natual language guided image captioning
geekyutao/TaskRes
Task Residual for Tuning Vision-Language Models (CVPR 2023)
LAION-AI/General-GPT
gist-rs/book
Rust, Wasm, TLDR;
Zhiquan-Wen/D-VQA
PyTorch implementation of "Debiased Visual Question Answering from Feature and Sample Perspectives" (NeurIPS 2021)
szzexpoi/POEM
Official Implementation for CVPR 2023 paper "Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasoning"
ppj567/WSVOG_Causal_Intervention