KeiChiTse's Stars
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
chatchat-space/Langchain-Chatchat
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
LLaVA-VL/LLaVA-NeXT
X-PLUG/mPLUG-Owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
EvolvingLMMs-Lab/lmms-eval
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
Pointcept/Pointcept
Pointcept: a codebase for point cloud perception research. Latest works: PTv3 (CVPR'24 Oral), PPT (CVPR'24), OA-CNNs (CVPR'24), MSC (CVPR'23)
yunlong10/Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
hzwer/WritingAIPaper
Writing AI Conference Papers: A Handbook for Beginners
ShareGPT4Omni/ShareGPT4Video
[NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
mini-sora/minisora
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
Jack-bo1220/Awesome-Remote-Sensing-Foundation-Models
agi-brain/xuance
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library
AlonzoLeeeooo/awesome-text-to-image-studies
A collection of awesome text-to-image generation studies.
zcablii/SARDet_100K
[NeurIPS 2024 spotlight] Offical implementation of MSFA and release of SARDet_100K dataset for Large-Scale Synthetic Aperture Radar (SAR) Object Detection
AlonzoLeeeooo/awesome-video-generation
A collection of awesome video generation studies.
ChenDelong1999/RemoteCLIP
🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)
bcmi/Awesome-Aesthetic-Evaluation-and-Cropping
Q-Future/Q-Instruct
②[CVPR 2024] Low-level visual instruction tuning, with a 200K dataset and a model zoo for fine-tuned checkpoints.
AlonzoLeeeooo/awesome-image-inpainting-studies
A collection of awesome image inpainting studies.
Q-Future/Q-Bench-Video
A benchmark for video quality understanding of LMMs
XPixelGroup/DepictQA
DepictQA: Depicted Image Quality Assessment with Vision Language Models
lixinustc/KVQ-Challenge-CVPR-NTIRE2024
The first challenge on short-form video quality assessment
yuleiqin/fantastic-data-engineering
Fantastic Data Engineering for Large Language Models
qyp2000/XPSR
keithAND2020/awesome-Occupancy-research
Papers on occupation, including monocular and multi-view in autonomous driving scenarios
ZhouKanglei/Awesome-AQA
Awesome Action Quality Assessment (AQA)
KeiChiTse/QPT-V2
[ACM MM 2024] QPT V2: An MIM-based pretraining framework for IQA, VQA, and IAA.