tomztyang

The Chinese University of Hong Kong

tomztyang's Stars

huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Language:Python32.3k 312 9304.8k
graphdeco-inria/gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Language:Python14.8k 119 1k1.9k
HobbitLong/SupContrast
PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)
Language:Python3.1k 20 135536
tensorflow/lingvo
Lingvo
Language:Python2.8k 118 254446
dvlab-research/LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
Language:Python1.9k 11 156131
sjtuytc/UnboundedNeRFPytorch
State-of-the-art, simple, fast unbounded / large-scale NeRFs.
Language:Python1.3k 45 122119
zju3dv/street_gaussians
[ECCV 2024] Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting
Language:Python878 71 7150
dvlab-research/VoxelNeXt
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)
Language:Python733 8 6464
qqlu/Entity
EntitySeg Toolbox: Towards Open-World and High-Quality Image Segmentation
Language:Jupyter Notebook700 23 4458
OpenDriveLab/Vista
[NeurIPS 2024] A Generalizable World Model for Autonomous Driving
Language:Python570 16 4342
dvlab-research/3DSSD
3DSSD: Point-based 3D Single Stage Object Detector (CVPR 2020)
Language:Python380 14 5068
dvlab-research/FocalsConv
Focal Sparse Convolutional Networks for 3D Object Detection (CVPR 2022, Oral)
Language:Python371 3 3535
ID-Animator/ID-Animator
Language:Python356 24 1726
dvlab-research/DSGN
DSGN: Deep Stereo Geometry Network for 3D Object Detection (CVPR 2020)
Language:Python326 23 2650
OpenDriveLab/TopoNet
Topology Reasoning for Scene Perception in Autonomous Driving
Language:Python285 23 2112
OpenDriveLab/ViDAR
[CVPR 2024 Highlight] Visual Point Cloud Forecasting
Language:Python280 9 4218
open-mmlab/StyleShot
StyleShot: A SnapShot on Any Style. 一款可以迁移任意风格到任意内容的模型，无需针对图片微调，即能生成高质量的个性风格化图片!
Language:Python267 4 2516
tarashakhurana/4d-occ-forecasting
CVPR 2023: Official code for `Point Cloud Forecasting as a Proxy for 4D Occupancy Forecasting'
Language:Python214 7 1723
dvlab-research/DeepVision3D
DeepVision3D is an open source toolbox for point-cloud understanding.
Language:Python121 5 98
open-mmlab/AnyControl
[ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. 一个支持用户自由输入控制信号的图像生成模型，能够根据多种控制生成自然和谐的结果！
Language:Python111 2 33
tau-yihouxiang/WS_DAN
The official TensorFlow implementation of WS-DAN.
Language:Python111 6 2228
OpenDriveLab/MPI
[RSS 2024] Learning Manipulation by Predicting Interaction
Language:Python90 3 31
CVMI-Lab/SPS-Conv
(NeurlPS 2022) Spatial Pruned Sparse Convolution for Efficient 3D Object Detection
Language:Python62 4 56

tomztyang

tomztyang's Stars

huggingface/pytorch-image-models

graphdeco-inria/gaussian-splatting

HobbitLong/SupContrast

tensorflow/lingvo

dvlab-research/LISA

sjtuytc/UnboundedNeRFPytorch

zju3dv/street_gaussians

dvlab-research/VoxelNeXt

qqlu/Entity

OpenDriveLab/Vista

dvlab-research/3DSSD

dvlab-research/FocalsConv

ID-Animator/ID-Animator

dvlab-research/DSGN

OpenDriveLab/TopoNet

OpenDriveLab/ViDAR

open-mmlab/StyleShot

tarashakhurana/4d-occ-forecasting

dvlab-research/DeepVision3D

open-mmlab/AnyControl

tau-yihouxiang/WS_DAN

OpenDriveLab/MPI

CVMI-Lab/SPS-Conv