cxy1996's Stars
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
InternLM/InternLM
Official release of InternLM2.5 base and chat models. 1M context support
MrNeRF/awesome-3D-gaussian-splatting
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
OpenGVLab/Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Pointcept/Pointcept
Pointcept: a codebase for point cloud perception research. Latest works: PTv3 (CVPR'24 Oral), PPT (CVPR'24), OA-CNNs (CVPR'24), MSC (CVPR'23)
autonomousvision/occupancy_networks
This repository contains the code for the paper "Occupancy Networks - Learning 3D Reconstruction in Function Space"
megvii-research/ML-GCN
PyTorch implementation of Multi-Label Image Recognition with Graph Convolutional Networks, CVPR 2019.
buaacyw/GaussianEditor
[CVPR 2024] GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting
luo3300612/Visualizer
assistant tools for attention visualization in deep learning
Kedreamix/Awesome-Talking-Head-Synthesis
💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this repo useful, please give it a star! 🤩
Pointcept/PointTransformerV3
[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)
4DVLab/Vision-Centric-BEV-Perception
Vision-Centric BEV Perception: A Survey
zju3dv/LoG
Level of Gaussians
NVlabs/EmerNeRF
PyTorch Implementation of EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision
OpenDriveLab/OccNet
[ICCV 2023] OccNet: Scene as Occupancy
xverse-engine/XV3DGS-UEPlugin
A Unreal Engine 5 (UE5) based plugin aiming to provide real-time visulization, management, editing, and scalable hybrid rendering of Guassian Splatting model.
zhulf0804/PointPillars
A Simple PointPillars PyTorch Implementation for 3D LiDAR(KITTI) Detection.
OpenDriveLab/LaneSegNet
[ICLR 2024] Map Learning with Lane Segment for Autonomous Driving
IrohXu/Awesome-Multimodal-LLM-Autonomous-Driving
[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving
ucla-mobility/V2V4Real
[CVPR2023 Highlight] The official codebase for paper "V2V4Real: A large-scale real-world dataset for Vehicle-to-Vehicle Cooperative Perception"
er-muyue/BeMapNet
Tsinghua-MARS-Lab/neural_map_prior
The official implementation of the CVPR2023 paper titled “Neural Map Prior for Autonomous Driving”.
jike5/P-MapNet
Received by RAL
wenjie710/PivotNet
Source code of PivotNet (ICCV2023, PivotNet: Vectorized Pivot Learning for End-to-end HD Map Construction)
LLVM-AD/MAPLM
[CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding
xiaolul2/MGMap
[CVPR2024] The code for "MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction"
fudan-zvg/RoadNet
[ICCV2023 Oral] RoadNetworkTRansformer & [AAAI 2024] LaneGraph2Seq