tomztyang's Stars
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
graphdeco-inria/gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
HobbitLong/SupContrast
PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)
tensorflow/lingvo
Lingvo
dvlab-research/LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
sjtuytc/UnboundedNeRFPytorch
State-of-the-art, simple, fast unbounded / large-scale NeRFs.
zju3dv/street_gaussians
[ECCV 2024] Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting
dvlab-research/VoxelNeXt
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)
qqlu/Entity
EntitySeg Toolbox: Towards Open-World and High-Quality Image Segmentation
OpenDriveLab/Vista
[NeurIPS 2024] A Generalizable World Model for Autonomous Driving
dvlab-research/3DSSD
3DSSD: Point-based 3D Single Stage Object Detector (CVPR 2020)
dvlab-research/FocalsConv
Focal Sparse Convolutional Networks for 3D Object Detection (CVPR 2022, Oral)
ID-Animator/ID-Animator
dvlab-research/DSGN
DSGN: Deep Stereo Geometry Network for 3D Object Detection (CVPR 2020)
OpenDriveLab/TopoNet
Topology Reasoning for Scene Perception in Autonomous Driving
OpenDriveLab/ViDAR
[CVPR 2024 Highlight] Visual Point Cloud Forecasting
open-mmlab/StyleShot
StyleShot: A SnapShot on Any Style. 一款可以迁移任意风格到任意内容的模型,无需针对图片微调,即能生成高质量的个性风格化图片!
tarashakhurana/4d-occ-forecasting
CVPR 2023: Official code for `Point Cloud Forecasting as a Proxy for 4D Occupancy Forecasting'
dvlab-research/DeepVision3D
DeepVision3D is an open source toolbox for point-cloud understanding.
open-mmlab/AnyControl
[ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. 一个支持用户自由输入控制信号的图像生成模型,能够根据多种控制生成自然和谐的结果!
tau-yihouxiang/WS_DAN
The official TensorFlow implementation of WS-DAN.
OpenDriveLab/MPI
[RSS 2024] Learning Manipulation by Predicting Interaction
CVMI-Lab/SPS-Conv
(NeurlPS 2022) Spatial Pruned Sparse Convolution for Efficient 3D Object Detection