wangjk666

I'm a third-year Ph.D. student in the school of computer science at Fudan University, supervised by Prof. Zuxuan Wu and Prof. Yu-Gang Jiang.

Fudan UniversityShanghai

Pinned Repositories

Ask-Anything
ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Language:Python00
audioset-classification
Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning
Language:Python00
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Python00
M2TR-Multi-modal-Multi-scale-Transformers-for-Deepfake-Detection
Language:Python94 4 1710
Objectformer
Language:Jupyter Notebook13 1 53
OmniVid
Language:Python28 4 22
PyDeepFakeDet
PyDeepFakeDet is an integrated and scalable tool for Deepfake detection.
Language:Python96 7 1015
pytorchvideo
A deep learning library for video understanding research.
Language:Python00
STTS
Official PyTorch implementation of the ECCV 2022 paper: Efficient Video Transformers with Spatial-Temporal Token Selection.
Language:Python42 3 23
Video-Swin-Transformer
This is an official implementation for "Video Swin Transformers".
Language:Python0 1 00

wangjk666/PyDeepFakeDet
PyDeepFakeDet is an integrated and scalable tool for Deepfake detection.
Language:Python96 7 1015
wangjk666/M2TR-Multi-modal-Multi-scale-Transformers-for-Deepfake-Detection
Language:Python94 4 1710
wangjk666/STTS
Official PyTorch implementation of the ECCV 2022 paper: Efficient Video Transformers with Spatial-Temporal Token Selection.
Language:Python42 3 23
wangjk666/OmniVid
Language:Python28 4 22
wangjk666/Objectformer
Language:Jupyter Notebook13 1 53
wangjk666/Ask-Anything
ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Language:Python00
wangjk666/audioset-classification
Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning
Language:Python00
wangjk666/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Python00
wangjk666/pytorchvideo
A deep learning library for video understanding research.
Language:Python00
wangjk666/Video-Swin-Transformer
This is an official implementation for "Video Swin Transformers".
Language:Python0 1 00