Pinned Repositories
3D-ResNets-PyTorch
3D ResNets for Action Recognition
AirFloat
Implementation of AirPlay audio (AirTunes) for iOS.
boston_housing
CapsNet
A PyTorch implementation of CapsNet based on Geoffrey Hinton's paper "Dynamic Routing Between Capsules"
CloudScheduler
CS161-Design-and-Analysis-of-Algorithms
Stanford CS161 course Fall 2017
cs231n-2017
My own solutions for Stanford CS231n (2017) assignments
Group4_workstation
code for Discriminative and Consistent Similarities in Instance-Level Multiple Instance Learning
improved-diffusion
Release for Improved Denoising Diffusion Probabilistic Models
lintel
A Python module to decode video frames directly, using the FFmpeg C API.
vateye's Repositories
vateye/AirFloat
Implementation of AirPlay audio (AirTunes) for iOS.
vateye/CS161-Design-and-Analysis-of-Algorithms
Stanford CS161 course Fall 2017
vateye/3D-ResNets-PyTorch
3D ResNets for Action Recognition
vateye/boston_housing
vateye/CapsNet
A PyTorch implementation of CapsNet based on Geoffrey Hinton's paper "Dynamic Routing Between Capsules"
vateye/CloudScheduler
vateye/cs231n-2017
My own solutions for Stanford CS231n (2017) assignments
vateye/Group4_workstation
code for Discriminative and Consistent Similarities in Instance-Level Multiple Instance Learning
vateye/improved-diffusion
Release for Improved Denoising Diffusion Probabilistic Models
vateye/lintel
A Python module to decode video frames directly, using the FFmpeg C API.
vateye/Megatron-LM
Ongoing research training transformer models at scale
vateye/mPLUG-2
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)
vateye/mPLUG-Owl
mPLUG-Owl🦉: Modularization Empowers Large Language Models with Multimodality
vateye/PyTorch-Encoding
A CV toolkit for my papers.
vateye/SlowFast
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
vateye/titanic_survival_exploration
vateye/video-nonlocal-net
Non-local Neural Networks for Video Classification