jayleicn

Research Scientist @ Meta AI, vision+language.

Meta AISeattle

Pinned Repositories

animeGAN
A simple PyTorch Implementation of Generative Adversarial Networks, focusing on anime face drawing.
Language:Jupyter Notebook1.3k 44 9198
ClipBERT
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.
Language:Python692 9 5885
moment_detr
[NeurIPS 2021] Moment-DETR code and QVHighlights dataset
Language:Python242 10 5442
recurrent-transformer
[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
Language:Jupyter Notebook167 10 1126
scipy-lecture-notes-zh-CN
中文版scipy-lecture-notes. 网站下线, 以离线HTML的形式继续更新, 见release.
Language:Python410 43 5176
singularity
[ACL 2023] Official PyTorch code for Singularity model in "Revealing Single Frame Bias for Video-and-Language Learning"
Language:Python126 2 3013
TVCaption
[ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset
Language:Python84 6 211
TVQA
[EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering
Language:Python160 9 1732
TVQAplus
[ACL 2020] PyTorch code for TVQA+: Spatio-Temporal Grounding for Video Question Answering
Language:Python121 10 2324
TVRetrieval
[ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Language:Python149 8 1223

jayleicn's Repositories

jayleicn/animeGAN
A simple PyTorch Implementation of Generative Adversarial Networks, focusing on anime face drawing.
Language:Jupyter Notebook1.3k 44 9198
jayleicn/ClipBERT
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.
Language:Python692 9 5885
jayleicn/moment_detr
[NeurIPS 2021] Moment-DETR code and QVHighlights dataset
Language:Python242 10 5442
jayleicn/recurrent-transformer
[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
Language:Jupyter Notebook167 10 1126
jayleicn/TVQA
[EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering
Language:Python160 9 1732
jayleicn/TVRetrieval
[ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Language:Python149 8 1223
jayleicn/singularity
[ACL 2023] Official PyTorch code for Singularity model in "Revealing Single Frame Bias for Video-and-Language Learning"
Language:Python126 2 3013
jayleicn/TVQAplus
[ACL 2020] PyTorch code for TVQA+: Spatio-Temporal Grounding for Video Question Answering
Language:Python121 10 2324
jayleicn/TVCaption
[ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset
Language:Python84 6 211
jayleicn/VideoLanguageFuturePred
[EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction
Language:Python47 2 104
jayleicn/mTVRetrieval
[ACL 2021] mTVR: Multilingual Video Moment Retrieval
Language:Python26 4 2
jayleicn/pytorch-pretrained-BERT
A copy from https://github.com/huggingface/pytorch-pretrained-BERT
Language:Jupyter Notebook1 3 03
jayleicn/video_feature_extractor
Easy to use video deep features extractor
Language:Python1 1 01
jayleicn/2D-TAN
AAAI‘20 - Learning 2D Temporal Localization Networks for Moment Localization with Natural Language
Language:Python1 0
jayleicn/accelerate
A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
Language:Python1 0
jayleicn/ALPRO
Align and Prompt: Video-and-Language Pre-training with Entity Prompts
Language:Python1 0
jayleicn/CLIP
Contrastive Language-Image Pretraining
Language:Jupyter Notebook1 0
jayleicn/coot-videotext
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
Language:Python1 0
jayleicn/detr
End-to-End Object Detection with Transformers
Language:Python1 0
jayleicn/easyturk
Make quick mechanical turk HTML/Javascript interfaces and launch them with Python functions
Language:HTML1 0
jayleicn/HERO-1
Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
Language:Python1 0
jayleicn/info-ground
Learning phrase grounding from captioned images through InfoNCE bound on mutual information
Language:Python1 0
jayleicn/just-ask
[TPAMI Special Issue on ICCV 2021 Best Papers, Oral] Just Ask: Learning to Answer Questions from Millions of Narrated Videos
Language:Jupyter Notebook0 0
jayleicn/mmaction2-1
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Language:Python1 0
jayleicn/mmf-1
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Language:Python1 0
jayleicn/Oscar
Oscar and VinVL
jayleicn/releasing-research-code
Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)
1 0
jayleicn/SlowFast
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Language:Python1 0
jayleicn/UNITER
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
Language:Python1 0
jayleicn/YouCook2-Leaderboard
A one-stop shop for YouCook2 info such as leaderboard and recent advances on (cooking) video retrieval and captioning.