YTEP-ZHI
Ph.D. Student at MMLab, CUHK | Generative Models | Autonomous Driving | Robotics.
The Chinese University of Hong KongHong Kong SAR
Pinned Repositories
DriveAGI
[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving & Foundation Models in Autonomous System
UniAD
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
Vista
[NeurIPS 2024] A Generalizable World Model for Autonomous Driving
BEVFormer
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
centerformer
Implementation for CenterFormer: Center-based Transformer for 3D Object Detection (ECCV 2022)
DiffusionDet
PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
dino
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
GroundingDINO
The official implementation of "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
HAL
Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs" at AAAI 2020.
YTEP-ZHI's Repositories
YTEP-ZHI/BEVFormer
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
YTEP-ZHI/centerformer
Implementation for CenterFormer: Center-based Transformer for 3D Object Detection (ECCV 2022)
YTEP-ZHI/DiffusionDet
PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
YTEP-ZHI/dino
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
YTEP-ZHI/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
YTEP-ZHI/GroundingDINO
The official implementation of "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
YTEP-ZHI/HAL
Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs" at AAAI 2020.
YTEP-ZHI/LLaMA-Adapter
Fine-tuning LLaMA to follow instructions within 1 Hour and 1.2M Parameters
YTEP-ZHI/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
YTEP-ZHI/MAE-pytorch
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
YTEP-ZHI/Mask2Former
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
YTEP-ZHI/mmdetection
OpenMMLab Detection Toolbox and Benchmark
YTEP-ZHI/nerfvis
NeRF visualization library under construction
YTEP-ZHI/MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
YTEP-ZHI/OpenSelfSup
Self-Supervised Learning Toolbox and Benchmark
YTEP-ZHI/PolyLoss
Source code of Universal Weighting Metric Learning for Cross-Modal Matching. The paper is accepted by CVPR2020.
YTEP-ZHI/Proxy-Anchor-CVPR2020
Official PyTorch Implementation of Proxy Anchor Loss for Deep Metric Learning, CVPR 2020
YTEP-ZHI/pyllama
LLaMA: Open and Efficient Foundation Language Models
YTEP-ZHI/ResNeSt
ResNeSt: Split-Attention Networks
YTEP-ZHI/SCAN
PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)
YTEP-ZHI/setup
Setup a new machine without sudo!
YTEP-ZHI/ST-P3
[ECCV 2022] ST-P3, an end-to-end vision-based autonomous driving framework via spatial-temporal feature learning.
YTEP-ZHI/Stable-Pix2Seq
A full-fledged version of Pix2Seq
YTEP-ZHI/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
YTEP-ZHI/TCP
[NeurIPS 2022] Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline.
YTEP-ZHI/transfuser
[PAMI'22] TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving, [CVPR'21] Multi-Modal Fusion Transformer for End-to-End Autonomous Driving
YTEP-ZHI/UniAD
Goal-oriented Autonomous Driving
YTEP-ZHI/unified-io-inference
YTEP-ZHI/YOLOX
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
YTEP-ZHI/YTEP-ZHI