Pinned Repositories
face-alignment
:fire: 2D and 3D Face alignment library build using pytorch
video2dataset
Easily create large video dataset from video urls
mmaction2
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
ACMMM2018CSAN.github.io
This web page is userd for dataset releasing.
bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
RADAR-MM2022
ACM Multimedia 2022 - Micro-video Tagging via Jointly Modeling Social Influence and Tag Relation
RTQ-MM2023
ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model
Temporal-Language-Grounding-in-videos
Temporal Moment(Action) Localization via Language / Temporal Language Grounding / Video Moment Retrieval
TSGVs-MM2023
ACM Multimedia 2023 - Temporal Sentence in Streaming Videos
video-ReTaKe
Official implementation of paper ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding
SCZwangxiao's Repositories
SCZwangxiao/Temporal-Language-Grounding-in-videos
Temporal Moment(Action) Localization via Language / Temporal Language Grounding / Video Moment Retrieval
SCZwangxiao/RTQ-MM2023
ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model
SCZwangxiao/TSGVs-MM2023
ACM Multimedia 2023 - Temporal Sentence in Streaming Videos
SCZwangxiao/RADAR-MM2022
ACM Multimedia 2022 - Micro-video Tagging via Jointly Modeling Social Influence and Tag Relation
SCZwangxiao/ACMMM2018CSAN.github.io
This web page is userd for dataset releasing.
SCZwangxiao/bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
SCZwangxiao/cayman
Cayman is a Jekyll theme for GitHub Pages
SCZwangxiao/CBLN
Code for CVPR 2021 paper: Context-aware Biaffine Localizing Network for Temporal Sentence Grounding
SCZwangxiao/CSMGAN
Code for ACM MM2020 paper: Jointly Cross- and Self-Modal Graph Attention Network for Query-Based Moment Localization
SCZwangxiao/GPUServerMonitor
SCZwangxiao/DEPICT
a multi-modal video caption dataset with richer annotation
SCZwangxiao/face-alignment
:fire: 2D and 3D Face alignment library build using pytorch
SCZwangxiao/insightface
State-of-the-art 2D and 3D Face Analysis Project
SCZwangxiao/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
SCZwangxiao/lxmert
PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".
SCZwangxiao/mmaction2
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
SCZwangxiao/mmengine
OpenMMLab Foundational Library for Training Deep Learning Models
SCZwangxiao/NGCF-pytorch
A toy implementation of Neural Graph Collaborative filtering
SCZwangxiao/Numerical-methods
The implementation of what I learnt in Numerical Analysis (in matlab)
SCZwangxiao/paddle_youtube
使用百度Paddle框架进行视频分类算法NeXtVLAD视频分类模型。
SCZwangxiao/PIRender
The source code of the ICCV2021 paper "PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering"
SCZwangxiao/queue
SCZwangxiao/Rational-Design-of-NOT-gate-in-Tri-node-Enzyme-Regulatory-Networks
Rational Design of NOT-gate in Tri-node Enzyme Regulatory Networks
SCZwangxiao/SCZwangxiao.github.io
SCZwangxiao/sdubigdatacourse.github.io
SCZwangxiao/SDUDeepLearningCourse.github.io
SCZwangxiao/singularity
Official PyTorch code for Singularity model in the paper "Revealing Single Frame Bias for Video-and-Language Learning"
SCZwangxiao/TALL-1
TALL: Temporal Activity Localization via Language Query
SCZwangxiao/video-captioning-models-in-Pytorch
A PyTorch implementation of state of the art video captioning models from 2015-2019 on MSVD and MSRVTT datasets.
SCZwangxiao/video2dataset
Easily create large video dataset from video urls