ycsun1972
PhD Student at Renmin University of China, vision+language
Renmin University of ChinaBeijing, China
Pinned Repositories
activitynet-qa
An VideoQA dataset based on the videos from ActivityNet
awesome-embodied-vision
Reading list for research topics in embodied vision
awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
Awesome-Multimodal-Research
A curated list of Multimodal Related Research.
BriVL
Bridging Vision and Language Model
BriVL-BUA-applications
Bling's Object detection tool
CMHSE
The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch
CollaborationDynamics
collaborative-experts
Video embeddings for retrieval with natural language queries
ScientificEvolution
ycsun1972's Repositories
ycsun1972/ScientificEvolution
ycsun1972/activitynet-qa
An VideoQA dataset based on the videos from ActivityNet
ycsun1972/awesome-embodied-vision
Reading list for research topics in embodied vision
ycsun1972/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
ycsun1972/Awesome-Multimodal-Research
A curated list of Multimodal Related Research.
ycsun1972/BriVL
Bridging Vision and Language Model
ycsun1972/BriVL-BUA-applications
Bling's Object detection tool
ycsun1972/CMHSE
The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch
ycsun1972/CollaborationDynamics
ycsun1972/collaborative-experts
Video embeddings for retrieval with natural language queries
ycsun1972/DeepSpeedExamples
Example models using DeepSpeed
ycsun1972/detr
End-to-End Object Detection with Transformers
ycsun1972/FrozenBiLM
[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
ycsun1972/MachineLearningNotebooks
Python notebooks with ML and deep learning examples with Azure Machine Learning Python SDK | Microsoft
ycsun1972/merlot
MERLOT: Multimodal Neural Script Knowledge Models
ycsun1972/trl
Train transformer language models with reinforcement learning.
ycsun1972/VideoLanguageFuturePred
[EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction
ycsun1972/vokenization
PyTorch code for EMNLP 2020 Paper "Vokenization: Improving Language Understanding with Visual Supervision"
ycsun1972/ycsun1972
Config files for my GitHub profile.
ycsun1972/ycsun1972.github.io