linjieli222's Stars
blobfile/blobfile
Read Google Cloud Storage, Azure Blobs, and local paths with the same interface
UX-Decoder/Segment-Everything-Everywhere-All-At-Once
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
Computer-Vision-in-the-Wild/Elevater_Toolkit_IC
Toolkit for Elevater Benchmark
voxel51/fiftyone
The open-source tool for building high-quality datasets and computer vision models
microsoft/FIBER
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
ArrowLuo/CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
houqb/VisionPermutator
MLP-Like Vision Permutator for Visual Recognition (PyTorch)
VALUE-Leaderboard/EvaluationTools
Evaluation code and codalab submission examples for the VALUE benchmark.
rowanz/merlot
MERLOT: Multimodal Neural Script Knowledge Models
VALUE-Leaderboard/DataRelease
Data Release for VALUE Benchmark
jayleicn/ClipBERT
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.
linjieli222/HERO_Video_Feature_Extractor
Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
LuoweiZhou/YouCook2-Leaderboard
A one-stop shop for YouCook2 info such as leaderboard and recent advances on (cooking) video retrieval and captioning.
jayleicn/VideoLanguageFuturePred
[EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction
zhegan27/VILLA
Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": UNITER adversarial training part
zhegan27/LXMERT-AdvTrain
Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": LXMERT adversarial training part
flashlight/wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
espnet/espnet
End-to-End Speech Processing Toolkit
airsplay/py-bottom-up-attention
PyTorch bottom-up attention with Detectron2
linjieli222/HERO
Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
ChenRocks/UNITER
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
jason718/awesome-self-supervised-learning
A curated list of awesome self-supervised methods
thunlp/TAADpapers
Must-read Papers on Textual Adversarial Attack and Defense
lichengunc/detectron2
Detectron2 is FAIR's next-generation research platform for object detection and segmentation.
intersun/PKD-for-BERT-Model-Compression
pytorch implementation for Patient Knowledge Distillation for BERT Model Compression
linjieli222/VQA_ReGAT
Research Code for ICCV 2019 paper "Relation-aware Graph Attention Network for Visual Question Answering"