kunli-cs's Stars
wgwang/awesome-LLMs-In-China
中国大模型
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
xusenlinzy/api-for-open-llm
Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, Xverse, SqlCoder, CodeLLaMA, ChatGLM, ChatGLM2, ChatGLM3 etc. 开源大模型的统一后端接口
ThuCCSLab/Awesome-LM-SSP
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
OpenGVLab/VideoMAEv2
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
synbol/Awesome-Parameter-Efficient-Transfer-Learning
Collection of awesome parameter-efficient fine-tuning resources.
chrisliu298/awesome-llm-unlearning
A resource repository for machine unlearning in large language models
sming256/OpenTAD
OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.
whwu95/Text4Vis
【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective
ZebangCheng/Emotion-LLaMA
Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning
swordlidev/Evaluation-Multimodal-LLMs-Survey
A Survey on Benchmarks of Multimodal Large Language Models
TimeMarker-LLM/TimeMarker
A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability
KAIST-VICLab/SkateFormer
[ECCV 2024] Official repository of SkateFormer
richard-peng-xia/CARES
[NeurIPS'24 & ICMLW'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
lzw108/EmoLLMs
logan-zou/Tutorial_for_developing_LLM_application
一个面向小白的大模型应用开发课程
francescotonini/object-aware-gaze-target-detection
Official repo of the paper "Object-aware Gaze Target Detection" (ICCV 2023)
materight/RepNet-pytorch
A PyTorch port with pre-trained weights of RepNet, from "Counting Out Time: Class Agnostic Video Repetition Counting in the Wild".
Stevetich/EventHallusion
EventHallusion: Diagnosing Event Hallucinations in Video LLMs
QiWang233/DailyDVS-200
[ECCV-2024] DailyDVS-200: A Comprehensive Benchmark Dataset for Event-Based Action Recognition
EnVision-Research/GPT4Affectivity
GPT as Psychologist? Preliminary Evaluations for GPT-4V on Visual Affective Computing
JackYFL/EmoLA
The official implementation of ECCV2024 paper "Facial Affective Behavior Analysis with Instruction Tuning"
PhysiologicAILab/FactorizePhys
FactorizePhys: Matrix Factorization for Multidimensional Attention in Remote Physiological Sensing [NeurIPS 2024]
franciscoliu/MLLMU-Bench
jasongief/LEAP
[2024 ECCV] Label-anticipated Event Disentanglement for Audio-Visual Video Parsing
qklee-lz/ACMMM2024-MAC
Accepted by ACM MM 2024, also ACM MM 2024 Grand Challenge Micro-Action Analysis Track-1 Top 1 solution.
tailofcat/DIPO
A Density-driven Iterative Prototype Optimization for Transductive Few-shot Learning
guoyang9/UNK-VQA
A VQA dataset that includes unanswerable questions [TPAMI 2024].
it-hao/SFAN
Spatial-Frequency Adaptive Remote Sensing Image Dehazing with Mixture of Experts
Rockwangyu/Plant-SAM
Plant Segmentation Anything Model:Towards Superior Leaf Segmentation in Agriculture Plants