Pinned Repositories
2D_detection
TensorFlow implementation of SqueezeDet, trained on the KITTI dataset.
2DPASS
2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds (ECCV 2022) :fire:
3D-BoundingBox
PyTorch implementation for 3D Bounding Box Estimation Using Deep Learning and Geometry
3D-Deepbox
3D Bounding Box Estimation Using Deep Learning and Geometry (MultiBin)
3D-PointCloud
Papers and Datasets about Point Cloud.
aadc-2017
ADAPT
This repository is an official implementation of ADAPT: Action-aware Driving Caption Transformer, accepted by ICRA 2023.
AdaptSegNet
Learning to Adapt Structured Output Space for Semantic Segmentation, CVPR 2018 (spotlight)
AdvSemiSeg
Adversarial Learning for Semi-supervised Semantic Segmentation, BMVC 2018
ai_for_robotics
Visualizations of algorithms covered in Sebastian Thrun's excellent Artificial Intelligence for Robotics course on Udacity.
weisili2016's Repositories
weisili2016/Bench2Drive
Closed-loop multi-ability evaluation of end-to-end autonomous driving algorithms
weisili2016/Bench2DriveZoo
BEVFormer, UniAD, VAD in Closed-Loop CARLA Evaluation with World Model RL Expert Think2Drive
weisili2016/DeepLearing-Interview-Awesome-2024
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
weisili2016/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
weisili2016/diffusion_policy
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
weisili2016/DINO
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
weisili2016/ELM
[ECCV 2024] Embodied Understanding of Driving Scenarios
weisili2016/generative-models
Generative Models by Stability AI
weisili2016/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
weisili2016/insightface
State-of-the-art 2D and 3D Face Analysis Project
weisili2016/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
weisili2016/KnowledgeEditingPapers
[知识编辑] Must-read Papers on Knowledge Editing for Large Language Models.
weisili2016/LaneSegNet
[ICLR 2024] Map Learning with Lane Segment for Autonomous Driving
weisili2016/llama.cpp
LLM inference in C/C++
weisili2016/machine-learning-notes
My continuously updated Machine Learning, Probabilistic Models and Deep Learning notes and demos (2000+ slides) 我不间断更新的机器学习,概率模型和深度学习的讲义(2000+页)和视频链接
weisili2016/minimind
【大模型】3小时完全从0训练一个仅有26M的小参数GPT,最低仅需2G显卡即可推理训练!
weisili2016/RoadNet
[ICCV2023 Oral] RoadNetworkTRansformer & [AAAI 2024] LaneGraph2Seq
weisili2016/RT-DETR
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
weisili2016/scaling-diffusion-perception
Scaling Properties of Diffusion Models For Perceptual Tasks
weisili2016/Segment-Everything-Everywhere-All-At-Once
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
weisili2016/Senna
Bridging Large Vision-Language Models and End-to-End Autonomous Driving
weisili2016/shikra
weisili2016/sophon-demo
weisili2016/swift
ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
weisili2016/T-Rex
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
weisili2016/TPVFormer
[CVPR 2023] An academic alternative to Tesla's occupancy network for autonomous driving.
weisili2016/VAD
[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving
weisili2016/VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
weisili2016/VisionLLM
VisionLLM Series
weisili2016/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection