YTEP-ZHI

Ph.D. Student at MMLab, CUHK | Generative Models | Autonomous Driving | Robotics.

The Chinese University of Hong KongHong Kong SAR

Pinned Repositories

DriveAGI
[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving & Foundation Models in Autonomous System
Language:Python697 32 1332
UniAD
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
Language:Python3.9k 38 199434
Vista
[NeurIPS 2024] A Generalizable World Model for Autonomous Driving
Language:Python678 18 5448
BEVFormer
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
Language:Python1 0 00
centerformer
Implementation for CenterFormer: Center-based Transformer for 3D Object Detection (ECCV 2022)
Language:Python1 0 00
DiffusionDet
PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
Language:Python1 0 00
dino
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
Language:Python1 0 00
DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python1 0 00
GroundingDINO
The official implementation of "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Language:Python1 0 00
HAL
Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs" at AAAI 2020.
Language:Python1 0 00

YTEP-ZHI's Repositories

YTEP-ZHI/BEVFormer
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
Language:Python1 0 00
YTEP-ZHI/centerformer
Implementation for CenterFormer: Center-based Transformer for 3D Object Detection (ECCV 2022)
Language:Python1 0 00
YTEP-ZHI/DiffusionDet
PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
Language:Python1 0 00
YTEP-ZHI/dino
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
Language:Python1 0 00
YTEP-ZHI/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python1 0 00
YTEP-ZHI/GroundingDINO
The official implementation of "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Language:Python1 0 00
YTEP-ZHI/HAL
Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs" at AAAI 2020.
Language:Python1 0 00
YTEP-ZHI/LLaMA-Adapter
Fine-tuning LLaMA to follow instructions within 1 Hour and 1.2M Parameters
Language:Python1 0 00
YTEP-ZHI/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
Language:Jupyter Notebook1 0 00
YTEP-ZHI/MAE-pytorch
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
Language:Python1 0 00
YTEP-ZHI/Mask2Former
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
Language:Python0 0 00
YTEP-ZHI/mmdetection
OpenMMLab Detection Toolbox and Benchmark
Language:Python0 0 00
YTEP-ZHI/nerfvis
NeRF visualization library under construction
Language:Python0 0 00
YTEP-ZHI/MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
Language:Python0 0
YTEP-ZHI/OpenSelfSup
Self-Supervised Learning Toolbox and Benchmark
Language:Python0 0
YTEP-ZHI/PolyLoss
Source code of Universal Weighting Metric Learning for Cross-Modal Matching. The paper is accepted by CVPR2020.
Language:Python0 0
YTEP-ZHI/Proxy-Anchor-CVPR2020
Official PyTorch Implementation of Proxy Anchor Loss for Deep Metric Learning, CVPR 2020
Language:Python0 0
YTEP-ZHI/pyllama
LLaMA: Open and Efficient Foundation Language Models
Language:Python0 0
YTEP-ZHI/ResNeSt
ResNeSt: Split-Attention Networks
Language:Python0 0
YTEP-ZHI/SCAN
PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)
Language:Python0 0
YTEP-ZHI/setup
Setup a new machine without sudo!
Language:Shell0 0
YTEP-ZHI/ST-P3
[ECCV 2022] ST-P3, an end-to-end vision-based autonomous driving framework via spatial-temporal feature learning.
Language:Python0 0
YTEP-ZHI/Stable-Pix2Seq
A full-fledged version of Pix2Seq
Language:Python0 0
YTEP-ZHI/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python0 0
YTEP-ZHI/TCP
[NeurIPS 2022] Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline.
Language:Python0 0
YTEP-ZHI/transfuser
[PAMI'22] TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving, [CVPR'21] Multi-Modal Fusion Transformer for End-to-End Autonomous Driving
Language:Python0 0
YTEP-ZHI/UniAD
Goal-oriented Autonomous Driving
Language:JavaScript0 0
YTEP-ZHI/unified-io-inference
Language:Jupyter Notebook0 0
YTEP-ZHI/YOLOX
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
Language:Python0 0
YTEP-ZHI/YTEP-ZHI