Pinned Repositories
ACAR-Net
[CVPR 2021] Actor-Context-Actor Relation Network for Spatio-temporal Action Localization
active-speakers-context
Code for the Active Speakers in Context Paper (CVPR2020)
ctr_prediction
conversion rate prediction of an online article
DDM
[CVPR 2022] Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection
Ego-Exo4D-docs
Ego4d_TalkNet_ASD
epic-kitchens-100-annotations
:plate_with_cutlery: Annotations for the public release of the EPIC-KITCHENS-100 dataset
FPGA-verilog-AES
a group programme
LocoNet
Adult Image Classification by a local-context aware network
LoCoNet_ASD
code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection
SJTUwxz's Repositories
SJTUwxz/LoCoNet_ASD
code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection
SJTUwxz/LocoNet
Adult Image Classification by a local-context aware network
SJTUwxz/ctr_prediction
conversion rate prediction of an online article
SJTUwxz/FPGA-verilog-AES
a group programme
SJTUwxz/ACAR-Net
[CVPR 2021] Actor-Context-Actor Relation Network for Spatio-temporal Action Localization
SJTUwxz/active-speakers-context
Code for the Active Speakers in Context Paper (CVPR2020)
SJTUwxz/DDM
[CVPR 2022] Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection
SJTUwxz/Ego4d_TalkNet_ASD
SJTUwxz/epic-kitchens-100-annotations
:plate_with_cutlery: Annotations for the public release of the EPIC-KITCHENS-100 dataset
SJTUwxz/hands-and-objects
SJTUwxz/models
Models and examples built with TensorFlow
SJTUwxz/LaViLa
Code release for "Learning Video Representations from Large Language Models"
SJTUwxz/ObjectStateChange
SJTUwxz/pic_classify
SJTUwxz/pic_classify2
picture classification
SJTUwxz/pytorch_face_landmark
Fast and accurate face landmark detection library using PyTorch; Support 68-point semi-frontal and 39-point profile landmark detection; Support both coordinate-based and heatmap-based inference; Up to 100 FPS landmark inference speed with SOTA face detector on CPU.
SJTUwxz/senet.pytorch
PyTorch implementation of SENet
SJTUwxz/SJTUwxz.github.io
SJTUwxz/splatter-image
Official implementation of `Splatter Image: Ultra-Fast Single-View 3D Reconstruction'
SJTUwxz/TalkNet_ASD
TalkNet: Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection
SJTUwxz/vedatad
A single stage temporal action detection toolbox based on PyTorch
SJTUwxz/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
SJTUwxz/VTimeLLM_long
[CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".