video-representation-learning

There are 19 repositories under video-representation-learning topic.

MCG-NJU/VideoMAE
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Language:Python1.2k 16 118124
cvlab-columbia/hyperfuture
Code for the paper Learning the Predictability of the Future (CVPR 2021)
Language:Python158 14 625
xyzforever/BEVT
PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529
Language:Python151 7 1018
ttengwang/Awesome_Long_Form_Video_Understanding
Awesome papers & datasets specifically focused on long-term videos.
96 7 02
ruiwang2021/mvd
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)
Language:Python92 6 99
GV1028/videogan
Implementation of "Generating Videos with Scene Dynamics" in Tensorflow
Language:Python76 3 820
boschresearch/rince
This is the code accompanying the AAAI 2022 paper "Ranking Info Noise Contrastive Estimation: Boosting Contrastive Learning via Ranked Positives" https://arxiv.org/abs/2201.11736 . The method allows you to use additional ranking information for representation learning.
Language:Python24 7 13
xiaojieli0903/MaskAgain
Official repository of the “Mask Again: Masked Knowledge Distillation for Masked Video Modeling” (ACM MM 2023)
Language:Python200
sunilhoho/VideoMS
Official Pytorch implementation of Efficient Video Representation Learning via Masked Video Modeling with Motion-centric Token Selection.
Language:Python19 3 11
xiaojieli0903/FGKVMemPred_video
Official repository of the "Fine-grained Key-Value Memory Enhanced Predictor for Video Representation Learning" (ACM MM 2023)
Language:Python17 2 00
mondalanindya/MSQNet
Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]
Language:Python16 1 00
Video-MAC/VideoMAC
Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”
Language:Python8 2 11
furushchev/chainervr
Chainer implementation of Networks for Learning Video Representations
Language:Python7 1 01
UARK-AICV/Video_Representation
[Asilomar 2022] Contextual Explainable Video Representation: Human Perception-based Understanding
5 1 01
gimpong/AAAI24-GMMFormer
The code for the paper "GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval" (AAAI'24)
Language:Python20
Mallory24/cae_dataset
The official repository for creating casual action effect (CAE) dataset for the IJCNLP-AACL 2023 paper: Implicit Affordance Acquisition via Causal Action–Effect Modeling in the Video Domain
Language:Python10
Mallory24/cae_modeling
The official repository for the IJCNLP-AACL 2023 paper: Implicit Affordance Acquisition via Causal Action–Effect Modeling in the Video Domain
Language:Python1 1 00
XFeiF/ComputerVision_PaperNotes
📚 Paper Notes (Computer vision)
1 2 440
mdnuruzzamanKALLOL/VideoMAE_Tensorflow
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Language:Python

video-representation-learning

MCG-NJU/VideoMAE

cvlab-columbia/hyperfuture

xyzforever/BEVT

ttengwang/Awesome_Long_Form_Video_Understanding

ruiwang2021/mvd

GV1028/videogan

boschresearch/rince

xiaojieli0903/MaskAgain

sunilhoho/VideoMS

xiaojieli0903/FGKVMemPred_video

mondalanindya/MSQNet

Video-MAC/VideoMAC

furushchev/chainervr

UARK-AICV/Video_Representation

gimpong/AAAI24-GMMFormer

Mallory24/cae_dataset

Mallory24/cae_modeling

XFeiF/ComputerVision_PaperNotes

mdnuruzzamanKALLOL/VideoMAE_Tensorflow