Byron1201

PhD Student at ZJU

ZJUShanghai

Byron1201's Stars

ShusenTang/Dive-into-DL-PyTorch
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。
Language:Jupyter Notebook18.5k 387 1515.4k
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook15.5k 115 3951.4k
jacobgil/pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Language:Python10.9k 45 4201.6k
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Language:Python7.1k 44 310714
facebookresearch/moco
PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722
Language:Python4.8k 52 135795
UX-Decoder/Segment-Everything-Everywhere-All-At-Once
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
Language:Python4.4k 59 149412
OpenGVLab/Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Language:Python3.1k 37 238253
mit-han-lab/temporal-shift-module
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
Language:Python2.1k 42 220416
yunlong10/Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
1.7k 55 588
Hello-SimpleAI/chatgpt-comparison-detection
Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥
Language:Python1.3k 26 26120
fudan-zvg/SETR
[CVPR 2021] Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
Language:Python1.1k 35 64149
HarborYuan/ovsam
[ECCV 2024] The official code of paper "Open-Vocabulary SAM".
Language:Python972 15 5131
MCG-NJU/MixFormer
[CVPR 2022 Oral & TPAMI 2024] MixFormer: End-to-End Tracking with Iterative Mixed Attention
Language:Python467 7 11073
RetroCirce/HTS-Audio-Transformer
The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"
Language:Python374 5 6265
MengyangPu/EDTER
EDTER: Edge Detection with Transformer, in CVPR 2022
Language:MATLAB290 8 10335
daishengdong/Games
games developed by python(五子棋、贪吃蛇、扫雷、俄罗斯方块、坦克大战、FlappyBird)
Language:Python247 8 3172
ys-zong/awesome-self-supervised-multimodal-learning
[T-PAMI] A curated list of self-supervised multimodal learning resources.
233 5 17
facebookresearch/AVT
Code release for ICCV 2021 paper "Anticipative Video Transformer"
Language:Python151 8 4228
rowanz/merlot_reserve
Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"
Language:Python137 5 2532
zinengtang/TVLT
PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral)
Language:Jupyter Notebook121 2 1613
OpenGVLab/EgoVideo
[CVPR 2024 Champions] Solutions for EgoVis Chanllenges in CVPR 2024
Language:Jupyter Notebook109 1 183
ChinaYi/ASFormer
Official repo for BMVC2021 paper ASFormer: Transformer for action segmentation
Language:Python95 4 1719
YapengTian/AVVP-ECCV20
Unified Multisensory Perception: Weakly-Supervised Audio-Visual Video Parsing, ECCV, 2020. (Spotlight)
Language:Python80 6 1121
OpenGVLab/EgoExoLearn
[CVPR 2024] Data and benchmark code for the EgoExoLearn dataset
Language:Python48 1 80
Echo0125/MAT-Memory-and-Anticipation-Transformer
[ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding
Language:Python44 2 203
GenjiB/ECLIPSE
Language:Python31 1 86
anpwu/ZJU-CS-ClassNotes
16 1 04
Chiaraplizz/ARGO1M-What-can-a-cook
Language:Python11 2 41
WeiyanCai/EPnP_Python
Language:Python11 1 19
bhwqy/pnp
My implementation of pnp problem including gauss newton, dlt and EPNP.
Language:C++7 1 03

Byron1201

Byron1201's Stars

ShusenTang/Dive-into-DL-PyTorch

IDEA-Research/Grounded-Segment-Anything

jacobgil/pytorch-grad-cam

IDEA-Research/GroundingDINO

facebookresearch/moco

UX-Decoder/Segment-Everything-Everywhere-All-At-Once

OpenGVLab/Ask-Anything

mit-han-lab/temporal-shift-module

yunlong10/Awesome-LLMs-for-Video-Understanding

Hello-SimpleAI/chatgpt-comparison-detection

fudan-zvg/SETR

HarborYuan/ovsam

MCG-NJU/MixFormer

RetroCirce/HTS-Audio-Transformer

MengyangPu/EDTER

daishengdong/Games

ys-zong/awesome-self-supervised-multimodal-learning

facebookresearch/AVT

rowanz/merlot_reserve

zinengtang/TVLT

OpenGVLab/EgoVideo

ChinaYi/ASFormer

YapengTian/AVVP-ECCV20

OpenGVLab/EgoExoLearn

Echo0125/MAT-Memory-and-Anticipation-Transformer

GenjiB/ECLIPSE

anpwu/ZJU-CS-ClassNotes

Chiaraplizz/ARGO1M-What-can-a-cook

WeiyanCai/EPnP_Python

bhwqy/pnp