Pinned Repositories
autogluon
Fast and Accurate ML in 3 Lines of Code
deepOF
TensorFlow implementation for "Guided Optical Flow Learning"
GuidedNet
Caffe implementation for "Guided Optical Flow Learning"
Hidden-Two-Stream
Caffe implementation for "Hidden Two-Stream Convolutional Networks for Action Recognition"
paper-reading
深度学习经典、新论文逐段精读
two-stream-pytorch
PyTorch implementation of two-stream networks for video action recognition
Video-Tutorial-CVPR2020
A Comprehensive Tutorial on Video Modeling
gluon-cv
Gluon CV Toolkit
paper-reading
深度学习经典、新论文逐段精读
semantic-segmentation
Nvidia Semantic Segmentation monorepo
bryanyzhu's Repositories
bryanyzhu/two-stream-pytorch
PyTorch implementation of two-stream networks for video action recognition
bryanyzhu/Hidden-Two-Stream
Caffe implementation for "Hidden Two-Stream Convolutional Networks for Action Recognition"
bryanyzhu/Video-Tutorial-CVPR2020
A Comprehensive Tutorial on Video Modeling
bryanyzhu/GuidedNet
Caffe implementation for "Guided Optical Flow Learning"
bryanyzhu/deepOF
TensorFlow implementation for "Guided Optical Flow Learning"
bryanyzhu/paper-reading
深度学习经典、新论文逐段精读
bryanyzhu/autogluon
AutoGluon: AutoML for Image, Text, and Tabular Data
bryanyzhu/semantic-segmentation
Improving Semantic Segmentation via Video Propagation and Label Relaxation
bryanyzhu/SlowFast
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
bryanyzhu/tiny-ucf101
bryanyzhu/Video-Swin-Transformer
This is an official implementation for "Video Swin Transformers".
bryanyzhu/ViLT
Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
bryanyzhu/bark
🔊 Text-Prompted Generative Audio Model
bryanyzhu/bigdetection
BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
bryanyzhu/blog
MXNet Blog in Chinese
bryanyzhu/CorrFlow
Self-supervised Learning for Video Correspondence Flow (BMVC 2019)
bryanyzhu/deit
Official DeiT repository
bryanyzhu/detectron2
Detectron2 is FAIR's next-generation platform for object detection and segmentation.
bryanyzhu/Detic
Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".
bryanyzhu/digital_video_introduction
A hands-on introduction to video technology: image, video, codec (av1, vp9, h265) and more (ffmpeg encoding).
bryanyzhu/gluon-cv
Gluon CV Toolkit
bryanyzhu/pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
bryanyzhu/ResNeSt
ResNeSt: Split-Attention Network
bryanyzhu/stable-diffusion-videos
Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts
bryanyzhu/web-data
The repo to host all the web data including images for documents in dmlc projects.