Pinned Repositories
mmagic
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
mmdetection
OpenMMLab Detection Toolbox and Benchmark
mmsegmentation
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
BEVerse
The official repository for BEVerse
BEVFormer
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
BoYuanRoadLaneDetection
CAT-Seg
Official Implementation of "CAT-Seg🐱: Cost Aggregation for Open-Vocabulary Semantic Segmentation"
clip-interrogator
Image to prompt with BLIP and CLIP
Co-DETR
[ICCV 2023] DETRs with Collaborative Hybrid Assignments Training
SheffieldCao's Repositories
SheffieldCao/BEVFormer
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
SheffieldCao/CAT-Seg
Official Implementation of "CAT-Seg🐱: Cost Aggregation for Open-Vocabulary Semantic Segmentation"
SheffieldCao/clip-interrogator
Image to prompt with BLIP and CLIP
SheffieldCao/Co-DETR
[ICCV 2023] DETRs with Collaborative Hybrid Assignments Training
SheffieldCao/DiffIR
This project is the official implementation of 'Diffir: Efficient diffusion model for image restoration', ICCV2023
SheffieldCao/fromage
🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".
SheffieldCao/Lite-Mono
Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth Estimation
SheffieldCao/mmdet-learning
SheffieldCao/Far3D
[AAAI2024] Far3D: Expanding the Horizon for Surround-view 3D Object Detection
SheffieldCao/mmagic
OpenMMLab Image and Video Restoration, Editing and Generation Toolbox
SheffieldCao/mmpretrain
OpenMMLab Pre-training Toolbox and Benchmark
SheffieldCao/mmsegmentation
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
SheffieldCao/Multimodal-GPT
Multimodal-GPT
SheffieldCao/Occ3D
SheffieldCao/Occ3DBaseline
CVPR2023-Occupancy-Prediction-Challenge
SheffieldCao/ODISE
Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]
SheffieldCao/ov-seg
This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.
SheffieldCao/OVO-Open-Vocabulary-Occupancy
SheffieldCao/PolarFormer
[AAAI 2023] PolarFormer: Multi-camera 3D Object Detection with Polar Transformers
SheffieldCao/SAN
Open-vocabulary Semantic Segmentation
SheffieldCao/Semantic-Segment-Anything
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
SheffieldCao/sheffield.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
SheffieldCao/SheffieldCao
Config files for my GitHub profile.
SheffieldCao/stable-dreamfusion
A pytorch implementation of text-to-3D dreamfusion, powered by stable diffusion.
SheffieldCao/SurroundOcc
Multi-camera 3D Occupancy Prediction for Autonomous Driving
SheffieldCao/UniAD
[CVPR 2023 Best Paper] Planning-oriented Autonomous Driving
SheffieldCao/ViT-Adapter
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
SheffieldCao/VLDet
[ICLR 2023] PyTorch implementation of VLDet (https://arxiv.org/abs/2211.14843)
SheffieldCao/VoxFormer
Official PyTorch implementation of VoxFormer [CVPR 2023 Highlight]
SheffieldCao/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.