SheffieldCao

Computer Vision, Autonomous Driving

Tongji UnivShanghai, China

Pinned Repositories

mmagic
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
Language:Jupyter Notebook7.1k 96 7091.1k
mmdetection
OpenMMLab Detection Toolbox and Benchmark
Language:Python30.7k 371 8.5k9.6k
mmsegmentation
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
Language:Python8.8k 52 2.4k2.7k
BEVerse
The official repository for BEVerse
Language:Python0 0 00
BEVFormer
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
Language:Python0 0 00
BoYuanRoadLaneDetection
Language:Python00
CAT-Seg
Official Implementation of "CAT-Seg🐱: Cost Aggregation for Open-Vocabulary Semantic Segmentation"
Language:Python0 0 00
clip-interrogator
Image to prompt with BLIP and CLIP
Language:Python0 0 00
Co-DETR
[ICCV 2023] DETRs with Collaborative Hybrid Assignments Training
Language:Python0 0 00

SheffieldCao's Repositories

SheffieldCao/BEVFormer
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
Language:Python0 0 00
SheffieldCao/CAT-Seg
Official Implementation of "CAT-Seg🐱: Cost Aggregation for Open-Vocabulary Semantic Segmentation"
Language:Python0 0 00
SheffieldCao/clip-interrogator
Image to prompt with BLIP and CLIP
Language:Python0 0 00
SheffieldCao/Co-DETR
[ICCV 2023] DETRs with Collaborative Hybrid Assignments Training
Language:Python0 0 00
SheffieldCao/DiffIR
This project is the official implementation of 'Diffir: Efficient diffusion model for image restoration', ICCV2023
Language:Jupyter Notebook0 0 00
SheffieldCao/fromage
🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".
Language:Jupyter Notebook0 0 00
SheffieldCao/Lite-Mono
Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth Estimation
Language:Python0 0 00
SheffieldCao/mmdet-learning
Language:Python0 1 00
SheffieldCao/Far3D
[AAAI2024] Far3D: Expanding the Horizon for Surround-view 3D Object Detection
Language:Jupyter Notebook0 0
SheffieldCao/mmagic
OpenMMLab Image and Video Restoration, Editing and Generation Toolbox
Language:Jupyter Notebook0 0
SheffieldCao/mmpretrain
OpenMMLab Pre-training Toolbox and Benchmark
Language:Python0 0
SheffieldCao/mmsegmentation
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
Language:Python0 0
SheffieldCao/Multimodal-GPT
Multimodal-GPT
Language:Python0 0
SheffieldCao/Occ3D
Language:Python0 0
SheffieldCao/Occ3DBaseline
CVPR2023-Occupancy-Prediction-Challenge
Language:Python0 0
SheffieldCao/ODISE
Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]
Language:Python0 0
SheffieldCao/ov-seg
This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.
Language:Jupyter Notebook0 0
SheffieldCao/OVO-Open-Vocabulary-Occupancy
Language:Python0 0
SheffieldCao/PolarFormer
[AAAI 2023] PolarFormer: Multi-camera 3D Object Detection with Polar Transformers
Language:Python0 0
SheffieldCao/SAN
Open-vocabulary Semantic Segmentation
Language:Python0 0
SheffieldCao/Semantic-Segment-Anything
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
Language:Python0 0
SheffieldCao/sheffield.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript0 0
SheffieldCao/SheffieldCao
Config files for my GitHub profile.
1 0
SheffieldCao/stable-dreamfusion
A pytorch implementation of text-to-3D dreamfusion, powered by stable diffusion.
Language:Python0 0
SheffieldCao/SurroundOcc
Multi-camera 3D Occupancy Prediction for Autonomous Driving
Language:Python0 0
SheffieldCao/UniAD
[CVPR 2023 Best Paper] Planning-oriented Autonomous Driving
Language:Python0 0
SheffieldCao/ViT-Adapter
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
Language:Python0 0
SheffieldCao/VLDet
[ICLR 2023] PyTorch implementation of VLDet （https://arxiv.org/abs/2211.14843）
Language:Python0 0
SheffieldCao/VoxFormer
Official PyTorch implementation of VoxFormer [CVPR 2023 Highlight]
Language:Python0 0
SheffieldCao/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python0 0