PolarisHsu's Stars
yunjey/pytorch-tutorial
PyTorch Tutorial for Deep Learning Researchers
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Fafa-DL/Lhy_Machine_Learning
李宏毅2021/2022/2023春季机器学习课程课件及作业
amusi/AI-Job-Notes
AI算法岗求职攻略(涵盖准备攻略、刷题指南、内推和AI公司清单等资料)
wdndev/llm_interview_note
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
baaivision/EVA
EVA Series: Visual Representation Fantasies from BAAI
wshzd/Awesome-AIGC
AIGC资料汇总学习,持续更新......
haonan-li/CMMLU
CMMLU: Measuring massive multitask language understanding in Chinese
MILVLG/bottom-up-attention.pytorch
A PyTorch reimplementation of bottom-up-attention models
TXH-mercury/VAST
Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
52CV/ICCV-2023-Papers
Yui010206/SeViLA
[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering
thaolmk54/hcrn-videoqa
Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)
jayleicn/singularity
[ACL 2023] Official PyTorch code for Singularity model in "Revealing Single Frame Bias for Video-and-Language Learning"
microsoft/FIBER
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
bckim92/language-evaluation
:clipboard: Collection of evaluation code for natural language generation.
doc-doc/NExT-GQA
Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)
StanfordVL/atp-video-language
Official repo for CVPR 2022 (Oral) paper: Revisiting the "Video" in Video-Language Understanding. Contains code for the Atemporal Probe (ATP).
wdrink/STTS
Official PyTorch implementation of the ECCV 2022 paper: Efficient Video Transformers with Spatial-Temporal Token Selection.
Spico197/random-luck
Automatically select the best random seed based on ancient Chinese I Ching. Good luck and best wishes !
marlin-codes/HTGN
PyTorch Implementation for "Discrete-time Temporal Network Embedding via Implicit Hierarchical Learning in Hyperbolic Space (KDD2021)"
iva-mzsun/MOSO
showlab/mist
DongqiFu/SDG
SDG: A Simplified and Dynamic Graph Neural Network, SIGIR 2021
afcedf/SOONet
Scanning Only Once: An End-to-end Framework for Fast Temporal Grounding in Long Videos
sukrutrao/Model-Guidance
Code for the paper: Studying How to Efficiently and Effectively Guide Models with Explanations. ICCV 2023.
doc-doc/CoVGT
Contrastive Video Question Answering via Video Graph Transformer (IEEE T-PAMI'23)
BaoBaoGitHub/Hungyi_Lee_Machine_Learning_2021
李宏毅机器学习2021笔记
yl3800/TranSTR
yuting-wei/AC-EVAL
The official GitHub repository for AC-EVAL, an ancient Chinese evaluation suite for large language models (LLMs)