SJTUwxz

@MicrosoftShanghai

Pinned Repositories

ACAR-Net
[CVPR 2021] Actor-Context-Actor Relation Network for Spatio-temporal Action Localization
Language:Python0 1 00
active-speakers-context
Code for the Active Speakers in Context Paper (CVPR2020)
Language:Python0 1 00
ctr_prediction
conversion rate prediction of an online article
Language:Python1 2 00
DDM
[CVPR 2022] Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection
Language:Python0 0 00
Ego-Exo4D-docs
Language:JavaScript00
Ego4d_TalkNet_ASD
Language:Python0 0 01
epic-kitchens-100-annotations
:plate_with_cutlery: Annotations for the public release of the EPIC-KITCHENS-100 dataset
Language:Python0 1 00
FPGA-verilog-AES
a group programme
1 2 01
LocoNet
Adult Image Classification by a local-context aware network
Language:Jupyter Notebook19 3 46
LoCoNet_ASD
code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection
Language:Python23 1 44

SJTUwxz's Repositories

SJTUwxz/LoCoNet_ASD
code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection
Language:Python23 1 44
SJTUwxz/LocoNet
Adult Image Classification by a local-context aware network
Language:Jupyter Notebook19 3 46
SJTUwxz/ctr_prediction
conversion rate prediction of an online article
Language:Python1 2 00
SJTUwxz/FPGA-verilog-AES
a group programme
1 2 01
SJTUwxz/ACAR-Net
[CVPR 2021] Actor-Context-Actor Relation Network for Spatio-temporal Action Localization
Language:Python0 1 00
SJTUwxz/active-speakers-context
Code for the Active Speakers in Context Paper (CVPR2020)
Language:Python0 1 00
SJTUwxz/DDM
[CVPR 2022] Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection
Language:Python0 0 00
SJTUwxz/Ego4d_TalkNet_ASD
Language:Python0 0 01
SJTUwxz/epic-kitchens-100-annotations
:plate_with_cutlery: Annotations for the public release of the EPIC-KITCHENS-100 dataset
Language:Python0 1 00
SJTUwxz/hands-and-objects
Language:C++0 0 00
SJTUwxz/models
Models and examples built with TensorFlow
Language:Python0 2 00
SJTUwxz/LaViLa
Code release for "Learning Video Representations from Large Language Models"
Language:Python0 0
SJTUwxz/ObjectStateChange
Language:Python0 0
SJTUwxz/pic_classify
2 0
SJTUwxz/pic_classify2
picture classification
2 0
SJTUwxz/pytorch_face_landmark
Fast and accurate face landmark detection library using PyTorch; Support 68-point semi-frontal and 39-point profile landmark detection; Support both coordinate-based and heatmap-based inference; Up to 100 FPS landmark inference speed with SOTA face detector on CPU.
Language:Python0 0
SJTUwxz/senet.pytorch
PyTorch implementation of SENet
Language:Python1 0
SJTUwxz/SJTUwxz.github.io
Language:CSS2 0
SJTUwxz/splatter-image
Official implementation of `Splatter Image: Ultra-Fast Single-View 3D Reconstruction'
Language:Python0 0
SJTUwxz/TalkNet_ASD
TalkNet: Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection
Language:Python1 0
SJTUwxz/vedatad
A single stage temporal action detection toolbox based on PyTorch
Language:Python1 0
SJTUwxz/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Language:Python0 0
SJTUwxz/VTimeLLM_long
[CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".
Language:Python0 0

SJTUwxz

Pinned Repositories

ACAR-Net

active-speakers-context

ctr_prediction

DDM

Ego-Exo4D-docs

Ego4d_TalkNet_ASD

epic-kitchens-100-annotations

FPGA-verilog-AES

LocoNet

LoCoNet_ASD

SJTUwxz's Repositories

SJTUwxz/LoCoNet_ASD

SJTUwxz/LocoNet

SJTUwxz/ctr_prediction

SJTUwxz/FPGA-verilog-AES

SJTUwxz/ACAR-Net

SJTUwxz/active-speakers-context

SJTUwxz/DDM

SJTUwxz/Ego4d_TalkNet_ASD

SJTUwxz/epic-kitchens-100-annotations

SJTUwxz/hands-and-objects

SJTUwxz/models

SJTUwxz/LaViLa

SJTUwxz/ObjectStateChange

SJTUwxz/pic_classify

SJTUwxz/pic_classify2

SJTUwxz/pytorch_face_landmark

SJTUwxz/senet.pytorch

SJTUwxz/SJTUwxz.github.io

SJTUwxz/splatter-image

SJTUwxz/TalkNet_ASD

SJTUwxz/vedatad

SJTUwxz/VILA

SJTUwxz/VTimeLLM_long