wangjk666
I'm a third-year Ph.D. student in the school of computer science at Fudan University, supervised by Prof. Zuxuan Wu and Prof. Yu-Gang Jiang.
Fudan UniversityShanghai
Pinned Repositories
Ask-Anything
ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
audioset-classification
Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
M2TR-Multi-modal-Multi-scale-Transformers-for-Deepfake-Detection
Objectformer
OmniVid
PyDeepFakeDet
PyDeepFakeDet is an integrated and scalable tool for Deepfake detection.
pytorchvideo
A deep learning library for video understanding research.
STTS
Official PyTorch implementation of the ECCV 2022 paper: Efficient Video Transformers with Spatial-Temporal Token Selection.
Video-Swin-Transformer
This is an official implementation for "Video Swin Transformers".
wangjk666's Repositories
wangjk666/PyDeepFakeDet
PyDeepFakeDet is an integrated and scalable tool for Deepfake detection.
wangjk666/M2TR-Multi-modal-Multi-scale-Transformers-for-Deepfake-Detection
wangjk666/STTS
Official PyTorch implementation of the ECCV 2022 paper: Efficient Video Transformers with Spatial-Temporal Token Selection.
wangjk666/OmniVid
wangjk666/Objectformer
wangjk666/Ask-Anything
ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
wangjk666/audioset-classification
Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning
wangjk666/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
wangjk666/pytorchvideo
A deep learning library for video understanding research.
wangjk666/Video-Swin-Transformer
This is an official implementation for "Video Swin Transformers".