Pinned Repositories
EVA
EVA Series: Visual Representation Fantasies from BAAI
ms-swift
Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
AC-EVAL
The official GitHub repository for AC-EVAL, an ancient Chinese evaluation suite for large language models (LLMs)
atp-video-language
Official repo for CVPR 2022 (Oral) paper: Revisiting the "Video" in Video-Language Understanding. Contains code for the Atemporal Probe (ATP).
HQGA
Video as Conditional Graph Hierarchy for Multi-Granular Question Answering (AAAI'22, Oral)
LLMs_interview_notes
LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案
roland
VGT
Video Graph Transformer for Video Question Answering (ECCV'22)
Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
AC-EVAL
The official GitHub repository for AC-EVAL, an ancient Chinese evaluation suite for large language models (LLMs)
PolarisHsu's Repositories
PolarisHsu/AC-EVAL
The official GitHub repository for AC-EVAL, an ancient Chinese evaluation suite for large language models (LLMs)
PolarisHsu/atp-video-language
Official repo for CVPR 2022 (Oral) paper: Revisiting the "Video" in Video-Language Understanding. Contains code for the Atemporal Probe (ATP).
PolarisHsu/HQGA
Video as Conditional Graph Hierarchy for Multi-Granular Question Answering (AAAI'22, Oral)
PolarisHsu/LLMs_interview_notes
LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案
PolarisHsu/roland
PolarisHsu/VGT
Video Graph Transformer for Video Question Answering (ECCV'22)