sherylwang

ICTpeking

sherylwang's Stars

facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook47.9k 308 6755.7k
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python35.7k 345 2.9k4.1k
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
27.4k 286 422.3k
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
Language:Jupyter Notebook9.3k 95 409834
openlm-research/open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
7.4k 122 91380
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Language:Python6.9k 42 305700
OpenDriveLab/UniAD
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
Language:Python3.6k 38 190407
fudan-zvg/Semantic-Segment-Anything
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
Language:Python2.2k 19 58138
traveller59/spconv
Spatial Sparse Convolution Library
Language:Python1.9k 24 698367
V2AI/Det3D
World's first general purpose 3D object detection codebse.
Language:Python1.5k 39 147299
NVIDIA-AI-IOT/Lidar_AI_Solution
A project demonstrating Lidar related AI solutions, including three GPU accelerated Lidar/camera DL networks (PointPillars, CenterPoint, BEVFusion) and the related libs (cuPCL, 3D SparseConvolution, YUV2RGB, cuOSD,).
Language:Python1.4k 19 289239
chaytonmin/Awesome-BEV-Perception-Multi-Cameras
Awesome papers about Multi-Camera 3D Object Detection and Segmentation in Bird's-Eye-View, such as DETR3D, BEVDet, BEVFormer, BEVDepth, UniAD
994 52 7108
megvii-research/PETR
[ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection & [ICCV2023] PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images
Language:Python879 13 162132
Pointcept/PointTransformerV3
[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)
Language:Python841 14 11747
exiawsh/StreamPETR
[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
Language:Python596 12 24164
NVIDIA/TensorRT-Model-Optimizer
TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.
Language:Python594 15 10344
JeffWang987/OpenOccupancy
[ICCV 2023] OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception
Language:Python593 13 5750
OpenDriveLab/OccNet
[ICCV 2023] OccNet: Scene as Occupancy
Language:Python573 16 4851
OpenGVLab/DCNv4
[CVPR 2024] Deformable Convolution v4
Language:Python529 8 8927
DerryHub/BEVFormer_tensorrt
BEVFormer inference on TensorRT, including INT8 Quantization and Custom TensorRT Plugins (float/half/half2/int8).
Language:Python433 5 8171
PJLab-ADG/OpenPCSeg
OpenPCSeg: Open Source Point Cloud Segmentation Toolbox and Benchmark
Language:Python371 12 2636
MCG-NJU/SparseBEV
[ICCV 2023] SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos
Language:Python363 8 9027
zhangyp15/OccFormer
[ICCV 2023] OccFormer: Dual-path Transformer for Vision-based 3D Semantic Occupancy Prediction
Language:Python331 9 2423
itsprakhar/Downstream-Dinov2
Downstream-Dino-V2: A GitHub repository featuring an easy-to-use implementation of the DINOv2 model by Facebook for downstream tasks such as Classification, Semantic Segmentation and Monocular depth estimation.
Language:Jupyter Notebook198 3 1113
zya3d/Awesome-3D-Occupancy-Prediction
Vision-based 3D occupancy prediction in autonomous driving: a review and outlook
170 5 06
robot-learning-freiburg/PanopticBEV
Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images. http://panoptic-bev.cs.uni-freiburg.de
Language:Python125 8 2322
ModelTC/Dipoorlet
Offline Quantization Tools for Deploy.
Language:Python116 14 1116
Fudan-ProjectTitan/OpenAnnotate3D
OpenAnnotate3D: Open-Vocabulary Auto-Labeling System for Multi-modal Data
Language:Jupyter Notebook77 13 42
NVlabs/EfficientDL
Language:HTML30 5 12
swiss-ai-center/djl-image-sam-example
Djl interface adapter to SAM
Language:Python8 8 12

sherylwang

sherylwang's Stars

facebookresearch/segment-anything

microsoft/DeepSpeed

google-research/tuning_playbook

facebookresearch/dinov2

openlm-research/open_llama

IDEA-Research/GroundingDINO

OpenDriveLab/UniAD

fudan-zvg/Semantic-Segment-Anything

traveller59/spconv

V2AI/Det3D

NVIDIA-AI-IOT/Lidar_AI_Solution

chaytonmin/Awesome-BEV-Perception-Multi-Cameras

megvii-research/PETR

Pointcept/PointTransformerV3

exiawsh/StreamPETR

NVIDIA/TensorRT-Model-Optimizer

JeffWang987/OpenOccupancy

OpenDriveLab/OccNet

OpenGVLab/DCNv4

DerryHub/BEVFormer_tensorrt

PJLab-ADG/OpenPCSeg

MCG-NJU/SparseBEV

zhangyp15/OccFormer

itsprakhar/Downstream-Dinov2

zya3d/Awesome-3D-Occupancy-Prediction

robot-learning-freiburg/PanopticBEV

ModelTC/Dipoorlet

Fudan-ProjectTitan/OpenAnnotate3D

NVlabs/EfficientDL

swiss-ai-center/djl-image-sam-example