Pinned Repositories
2022AICITY_Code_From_Top_Teams
codes from top teams of AI City Challenge 2022
3D-ResNets-PyTorch
3D ResNets for Action Recognition (CVPR 2018)
3d-vehicle-tracking
3D-VLA
[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model
ACNet
ACNet: Strengthening the Kernel Skeletons for Powerful CNN via Asymmetric Convolution Blocks
acoustic-scene-analysis-with-multihead-self-attention
This repo contains implementation of the paper "Acoustic Scene Analysis With Multihead Self Attention" by Weimin Wang, Weiran Wang, Ming Sun, Chao Wang from Amazon Alexa team
ADNet_pytorch
ADNet implementation on pytorch
AICIty-reID-2020
:red_car: The 1st Place Submission to AICity Challenge 2020 re-id track (Baidu-UTS submission)
AICity2020-VOC-ReID
The 2nd place submission for AICity Challenge 2020 ReID track, VOC-ReID: Vehicle Re-identification based on Vehicle-Orientation-Camera
cvpr2019
cvpr2019 papers,极市团队整理
zhuMingXu's Repositories
zhuMingXu/Awesome-Autonomous-Driving
awesome-autonomous-driving
zhuMingXu/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
zhuMingXu/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
zhuMingXu/distill-bev
DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)
zhuMingXu/DQTrack
Official PyTorch implementation of End-to-end 3D Tracking with Decoupled Queries [ICCV 2023]
zhuMingXu/EA-LSS
EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object Detection
zhuMingXu/FB-BEV
FB-BEV and FB-OCC are vision-centric autonomous driving perception algorithm based on forward-backward view transformation strtegies.
zhuMingXu/gaussian_splatting_notes
A detailed formulae explanation on gaussian splatting
zhuMingXu/HashNeRF-pytorch
Pure PyTorch Implementation of NVIDIA paper on Instant Training of Neural Graphics primitives: https://nvlabs.github.io/instant-ngp/
zhuMingXu/instant-ngp
Instant neural graphics primitives: lightning fast NeRF and more
zhuMingXu/MapTR
[ICLR'23 Spotlight] MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction
zhuMingXu/mmLaneDet
zhuMingXu/MonoScene
[CVPR 2022] "MonoScene: Monocular 3D Semantic Scene Completion": 3D Semantic Occupancy Prediction from a single image
zhuMingXu/MOTR
[ECCV2022] MOTR: End-to-End Multiple-Object Tracking with TRansformer
zhuMingXu/nerf-pytorch
A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.
zhuMingXu/Occ3D
zhuMingXu/OccNet
[ICCV 2023] OccNet: Scene as Occupancy
zhuMingXu/OpenOccupancy
[ICCV 2023] OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception
zhuMingXu/Point-Clouds-3D-Perception-with-Open3D
Using the KITTI dataset, we employed Open3D to visualize, downsample, segment with RANSAC, cluster via DBSCAN, create 3D bounding boxes, and perform surface reconstruction on point clouds.
zhuMingXu/ScalableMap
The official repository for ScalableMap (CoRL 2023)
zhuMingXu/SegAnyGAussians
The official implementation of SAGA (Segment Any 3D GAussians)
zhuMingXu/segment-lidar
Python package for segmenting LiDAR data using Segment-Anything Model (SAM) from Meta AI.
zhuMingXu/SegmentAnything3D
[ICCV'23 Workshop] SAM3D: Segment Anything in 3D Scenes
zhuMingXu/semantic-kitti-api
SemanticKITTI API for visualizing dataset, processing data, and evaluating results.
zhuMingXu/ST-P3
[ECCV 2022] ST-P3, an end-to-end vision-based autonomous driving framework via spatial-temporal feature learning.
zhuMingXu/StreamPETR
[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
zhuMingXu/super-easy-super-resolution
super easy super resolution
zhuMingXu/SurroundOcc
[ICCV 2023] SurroundOcc: Multi-camera 3D Occupancy Prediction for Autonomous Driving
zhuMingXu/VAD
[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving
zhuMingXu/VectorMapNet_code
This is the official code base of VectorMapNet (ICML 2023)