Ziyang412

Ph.D. student at UNC Chapel Hill

UNC Chapel Hill

Pinned Repositories

EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
Language:Python708 9 4748
fromage
🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".
Language:Jupyter Notebook0 0 00
FrozenBiLM
[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
Language:Python0 0 00
LLoVi
Official implementation for "A Simple LLM Framework for Long-Range Video Question-Answering"
Language:Python1 0 00
SeViLA
Self-Chained Image-Language Model for Video Localization and Question Answering
Language:Python1 0 01
ts2_net
[ECCV2022] A pytorch implementation for TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval
Language:Python0 0 00
UCoFiA
Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)
Language:Python63 3 101
VideoTree
Code for paper "VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos"
Language:Python103 1 113
X-CLIP
An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"
Language:Python0 0 00
ziyangw412.github.io
Language:Jupyter Notebook0 1 00

Ziyang412/VideoTree
Code for paper "VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos"
Language:Python103 1 113
Ziyang412/UCoFiA
Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)
Language:Python63 3 101
Ziyang412/LLoVi
Official implementation for "A Simple LLM Framework for Long-Range Video Question-Answering"
Language:Python1 0 00
Ziyang412/SeViLA
Self-Chained Image-Language Model for Video Localization and Question Answering
Language:Python1 0 01
Ziyang412/fromage
🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".
Language:Jupyter Notebook0 0 00
Ziyang412/FrozenBiLM
[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
Language:Python0 0 00
Ziyang412/ts2_net
[ECCV2022] A pytorch implementation for TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval
Language:Python0 0 00
Ziyang412/X-CLIP
An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"
Language:Python0 0 00
Ziyang412/ziyangw412.github.io
Language:Jupyter Notebook0 1 00