PolarisHsu

PolarisHsu's Stars

yunjey/pytorch-tutorial
PyTorch Tutorial for Deep Learning Researchers
Language:Python30.4k 625 1798.1k
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook10k 97 674976
Fafa-DL/Lhy_Machine_Learning
李宏毅2021/2022/2023春季机器学习课程课件及作业
Language:Jupyter Notebook6.3k 51 131.6k
amusi/AI-Job-Notes
AI算法岗求职攻略（涵盖准备攻略、刷题指南、内推和AI公司清单等资料）
5.3k 131 10641
wdndev/llm_interview_note
主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题
Language:HTML4.1k 18 6472
baaivision/EVA
EVA Series: Visual Representation Fantasies from BAAI
Language:Python2.3k 30 163167
wshzd/Awesome-AIGC
AIGC资料汇总学习，持续更新......
759 8 090
haonan-li/CMMLU
CMMLU: Measuring massive multitask language understanding in Chinese
Language:Python703 11 3757
MILVLG/bottom-up-attention.pytorch
A PyTorch reimplementation of bottom-up-attention models
Language:Jupyter Notebook294 2 9576
TXH-mercury/VAST
Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
Language:Jupyter Notebook246 18 2717
52CV/ICCV-2023-Papers
243 6 412
Yui010206/SeViLA
[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering
Language:Python178 3 2722
thaolmk54/hcrn-videoqa
Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)
Language:Python131 7 1926
jayleicn/singularity
[ACL 2023] Official PyTorch code for Singularity model in "Revealing Single Frame Bias for Video-and-Language Learning"
Language:Python130 2 3014
microsoft/FIBER
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
Language:Python127 9 1611
bckim92/language-evaluation
:clipboard: Collection of evaluation code for natural language generation.
Language:Perl119 6 317
doc-doc/NExT-GQA
Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)
Language:Python60 1 81
StanfordVL/atp-video-language
Official repo for CVPR 2022 (Oral) paper: Revisiting the "Video" in Video-Language Understanding. Contains code for the Atemporal Probe (ATP).
Language:Python49 16 02
wdrink/STTS
Official PyTorch implementation of the ECCV 2022 paper: Efficient Video Transformers with Spatial-Temporal Token Selection.
Language:Python45 4 23
Spico197/random-luck
Automatically select the best random seed based on ancient Chinese I Ching. Good luck and best wishes !
Language:Python44 4 16
marlin-codes/HTGN
PyTorch Implementation for "Discrete-time Temporal Network Embedding via Implicit Hierarchical Learning in Hyperbolic Space (KDD2021)"
Language:Python43 2 24
iva-mzsun/MOSO
Language:Python34 1 52
showlab/mist
Language:Jupyter Notebook33 3 182
DongqiFu/SDG
SDG: A Simplified and Dynamic Graph Neural Network, SIGIR 2021
Language:Python22 1 14
afcedf/SOONet
Scanning Only Once: An End-to-end Framework for Fast Temporal Grounding in Long Videos
Language:Python20 3 42
sukrutrao/Model-Guidance
Code for the paper: Studying How to Efficiently and Effectively Guide Models with Explanations. ICCV 2023.
Language:Python18 2 22
doc-doc/CoVGT
Contrastive Video Question Answering via Video Graph Transformer (IEEE T-PAMI'23)
Language:Python17 2 101
BaoBaoGitHub/Hungyi_Lee_Machine_Learning_2021
李宏毅机器学习2021笔记
13 1 00
yl3800/TranSTR
Language:Python11 2 90
yuting-wei/AC-EVAL
The official GitHub repository for AC-EVAL, an ancient Chinese evaluation suite for large language models (LLMs)
Language:Python81