Pinned Repositories
3D-Box-Segment-Anything
We extend Segment Anything to 3D perception by combining it with VoxelNeXt.
FocalsConv
Focal Sparse Convolutional Networks for 3D Object Detection (CVPR 2022, Oral)
LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
VoxelNeXt
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)
OpenPCDet
OpenPCDet Toolbox for LiDAR-based 3D Object Detection.
NAS-quantization
The code for Joint Neural Architecture Search and Quantization
Paper-Notes-2017
A notebook for some good papers I have read, including their key points and English writing.
RENAS
Code of ImageNet training and evaluation for the paper: RENAS: Reinforced Evolutionary Neural Architecture Search
Stitcher
yukang2017's Repositories
yukang2017/Stitcher
yukang2017/RENAS
Code of ImageNet training and evaluation for the paper: RENAS: Reinforced Evolutionary Neural Architecture Search
yukang2017/NAS-quantization
The code for Joint Neural Architecture Search and Quantization
yukang2017/Pose-Mobile
A real-time posing app
yukang2017/VoxelNeXt
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)
yukang2017/yukang2017.github.io
yukang2017/LongLoRA
yukang2017/OpenPCDet
OpenPCDet Toolbox for LiDAR-based 3D Object Detection.
yukang2017/AutoGPT
An experimental open-source attempt to make GPT-4 fully autonomous.
yukang2017/Awesome-BEV-Perception-Multi-Cameras
Awesome papers about Multi-Camera 3D Object Detection and Segmentation in Bird-Eye-View, such as DETR3D, BEVDet, BEVFormer
yukang2017/Composition-Stable-Diffusion
Image Composition via Stable Diffusion
yukang2017/DetNAS
yukang2017/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
yukang2017/EAT-NAS
EAT-NAS: Elastic Architecture Transfer for Accelerating Large-scale Neural Architecture Search
yukang2017/FCOS
FCOS: Fully Convolutional One-Stage Object Detection (ICCV'19)
yukang2017/Grounded-Segment-Anything
Marrying Grounding DINO with Segment Anything & Stable Diffusion & BLIP & Whisper & ChatBot - Automatically Detect , Segment and Generate Anything with Image, Text, and Speech Inputs
yukang2017/GroundingDINO
The official implementation of "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
yukang2017/ierg5350-assignment
yukang2017/IST-Net
yukang2017/LinK
[CVPR 2023] LinK: Linear Kernel for LiDAR-based 3D Perception
yukang2017/LongBench
LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
yukang2017/Mask3D
Mask3D predicts accurate 3D semantic instances achieving state-of-the-art on ScanNet, ScanNet200, S3DIS and STPLS3D.
yukang2017/Segment-Everything-Everywhere-All-At-Once
yukang2017/SparseKD
(NeurlPS 2022) Towards Efficient 3D Object Detection with Knowledge Distillation
yukang2017/spconv
Spatial Sparse Convolution Library
yukang2017/SPS-Conv
(NeurlPS 2022) Spatial Pruned Sparse Convolution for Efficient 3D Object Detection
yukang2017/spvnas
[ECCV 2020] Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution
yukang2017/SST
Codes for “Fully Sparse 3D Object Detection” & “Embracing Single Stride 3D Object Detector with Sparse Transformer”
yukang2017/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
yukang2017/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)