yukang2017

Research Scientist in NVIDIA. Research on LLM, Efficient DL, and Computer Vision.

NVIDIAHong Kong

Pinned Repositories

3D-Box-Segment-Anything
We extend Segment Anything to 3D perception by combining it with VoxelNeXt.
Language:Jupyter Notebook560 12 1725
LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
Language:Python2.4k 12 199179
LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Language:Python2.7k 13 174290
VoxelNeXt
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)
Language:Python826 9 7171
VILA
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
Language:Python3.5k 39 184295
OpenPCDet
OpenPCDet Toolbox for LiDAR-based 3D Object Detection.
Language:Python5.2k 64 1.5k1.4k
NAS-quantization
The code for Joint Neural Architecture Search and Quantization
Language:Python13 1 10
Paper-Notes-2017
A notebook for some good papers I have read, including their key points and English writing.
5 1 00
RENAS
Code of ImageNet training and evaluation for the paper: RENAS: Reinforced Evolutionary Neural Architecture Search
Language:Python22 1 05
Stitcher
Language:Python93 2 611

yukang2017's Repositories

yukang2017/Stitcher
Language:Python93 2 611
yukang2017/RENAS
Code of ImageNet training and evaluation for the paper: RENAS: Reinforced Evolutionary Neural Architecture Search
Language:Python22 1 05
yukang2017/NAS-quantization
The code for Joint Neural Architecture Search and Quantization
Language:Python13 1 10
yukang2017/Paper-Notes-2017
A notebook for some good papers I have read, including their key points and English writing.
5 1 00
yukang2017/Pose-Mobile
A real-time posing app
Language:C++3 3 00
yukang2017/VoxelNeXt
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)
Language:Python2 0 0
yukang2017/yukang2017.github.io
Language:HTML2 1 0
yukang2017/LongLoRA
Language:Python1 0 0
yukang2017/OpenPCDet
OpenPCDet Toolbox for LiDAR-based 3D Object Detection.
Language:Python1 1 0
yukang2017/AutoGPT
An experimental open-source attempt to make GPT-4 fully autonomous.
Language:JavaScript0 0
yukang2017/Awesome-BEV-Perception-Multi-Cameras
Awesome papers about Multi-Camera 3D Object Detection and Segmentation in Bird-Eye-View, such as DETR3D, BEVDet, BEVFormer
1 0
yukang2017/Composition-Stable-Diffusion
Image Composition via Stable Diffusion
Language:Python0 0
yukang2017/DetNAS
Language:Python2 01
yukang2017/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python0 0
yukang2017/EAT-NAS
EAT-NAS: Elastic Architecture Transfer for Accelerating Large-scale Neural Architecture Search
Language:Python2 0
yukang2017/FCOS
FCOS: Fully Convolutional One-Stage Object Detection (ICCV'19)
Language:Python0 0
yukang2017/Grounded-Segment-Anything
Marrying Grounding DINO with Segment Anything & Stable Diffusion & BLIP & Whisper & ChatBot - Automatically Detect , Segment and Generate Anything with Image, Text, and Speech Inputs
Language:Jupyter Notebook0 0
yukang2017/GroundingDINO
The official implementation of "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Language:Python0 0
yukang2017/ierg5350-assignment
Language:Jupyter Notebook1 0
yukang2017/IST-Net
Language:Python0 0
yukang2017/LongBench
LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
Language:Python0 0
yukang2017/Mask3D
Mask3D predicts accurate 3D semantic instances achieving state-of-the-art on ScanNet, ScanNet200, S3DIS and STPLS3D.
Language:Python0 0
yukang2017/Segment-Everything-Everywhere-All-At-Once
0 0
yukang2017/SparseKD
(NeurlPS 2022) Towards Efficient 3D Object Detection with Knowledge Distillation
Language:Python0 0
yukang2017/spconv
Spatial Sparse Convolution Library
Language:Python0 0
yukang2017/SPS-Conv
(NeurlPS 2022) Spatial Pruned Sparse Convolution for Efficient 3D Object Detection
Language:Python0 0
yukang2017/spvnas
[ECCV 2020] Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution
Language:Python0 0
yukang2017/SST
Codes for “Fully Sparse 3D Object Detection” & “Embracing Single Stride 3D Object Detector with Sparse Transformer”
Language:Python0 0
yukang2017/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python0 0
yukang2017/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Language:Python0 0