DV Lab

Deep Vision Lab

Pinned Repositories

3D-Box-Segment-Anything
We extend Segment Anything to 3D perception by combining it with VoxelNeXt.
Language:Jupyter Notebook509 11 1623
DeepUPE
Underexposed Photo Enhancement Using Deep Illumination Estimation
Language:Python561 24 80100
LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
Language:Python1.5k 10 121101
LLaMA-VID
Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models
Language:Python596 11 9438
LLMGA
This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant'
Language:Python259 4 517
LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Language:Python2.5k 13 163251
MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
Language:Python3k 25 111274
PanopticFCN
Fully Convolutional Networks for Panoptic Segmentation (CVPR2021 Oral)
Language:Python388 8 5053
Video-P2P
Video-P2P: Video Editing with Cross-attention Control
Language:Python333 9 1422
VoxelNeXt
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)
Language:Python653 8 5953

DV Lab's Repositories

dvlab-research/MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
Language:Python3k 25 111274
dvlab-research/LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Language:Python2.5k 13 163251
dvlab-research/LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
Language:Python1.5k 10 121101
dvlab-research/VoxelNeXt
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)
Language:Python653 8 5953
dvlab-research/LLaMA-VID
Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models
Language:Python596 11 9438
dvlab-research/DeepUPE
Underexposed Photo Enhancement Using Deep Illumination Estimation
Language:Python561 24 80100
dvlab-research/3D-Box-Segment-Anything
We extend Segment Anything to 3D perception by combining it with VoxelNeXt.
Language:Jupyter Notebook509 11 1623
dvlab-research/PointGroup
PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation
Language:Python366 13 6279
dvlab-research/FocalsConv
Focal Sparse Convolutional Networks for 3D Object Detection (CVPR 2022, Oral)
Language:Python359 3 3534
dvlab-research/Video-P2P
Video-P2P: Video Editing with Cross-attention Control
Language:Python333 9 1422
dvlab-research/PFENet
PFENet: Prior Guided Feature Enrichment Network for Few-shot Segmentation (TPAMI).
Language:Python297 9 8154
dvlab-research/SphereFormer
The official implementation for "Spherical Transformer for LiDAR-based 3D Recognition" (CVPR 2023).
Language:Python281 5 7132
dvlab-research/LLMGA
This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant'
Language:Python259 4 517
dvlab-research/Parametric-Contrastive-Learning
Parametric Contrastive Learning (ICCV2021) & GPaCo (TPAMI 2023)
Language:Python224 7 2329
dvlab-research/LargeKernel3D
LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs (CVPR 2023)
Language:Python183 6 167
dvlab-research/Context-Aware-Consistency
Semi-supervised Semantic Segmentation with Directional Context-aware Consistency (CVPR 2021)
Language:Python154 5 3019
dvlab-research/SparseTransformer
A fast and memory-efficient libarary for sparse transformer with varying token numbers (e.g., 3D point cloud).
Language:Python143 6 79
dvlab-research/MOOD
Official PyTorch implementation of MOOD series: (1) MOODv1: Rethinking Out-of-distributionDetection: Masked Image Modeling Is All You Need. (2) MOODv2: Masked Image Modeling for Out-of-Distribution Detection.
Language:Python131 3 114
dvlab-research/RIVAL
[NeurIPS 2023 Spotlight] Real-World Image Variation by Aligning Diffusion Inversion Chain
Language:Python129 17 89
dvlab-research/Ref-NPR
[CVPR 2023] Ref-NPR: Reference-Based Non-PhotoRealistic Radiance Fields
Language:Python119 6 139
dvlab-research/Prompt-Highlighter
[CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs
Language:Python102 2 22
dvlab-research/Imbalanced-Learning
Imbalanced learning tool for imbalanced recognition and segmentation
Language:Python78 4 38
dvlab-research/Mask-Attention-Free-Transformer
Official Implementation for "Mask-Attention-Free Transformer for 3D Instance Segmentation"
Language:Python58 3 114
dvlab-research/MoTCoder
This is the official code repository of MoTCoder: Elevating Large Language Models with Modular of Thought for Challenging Programming Tasks.
Language:Python55 0 21
dvlab-research/ProposeReduce
Video Instance Segmentation with a Propose-Reduce Paradigm (ICCV 2021)
Language:Python41 4 44
dvlab-research/TriVol
The official code of TriVol in CVPR-2023
Language:Python38 7 51
dvlab-research/GroupContrast
[CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding
351
dvlab-research/MR-GSM8K
Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs
Language:Python35 2 2
dvlab-research/LBGAT
Learnable Boundary Guided Adversarial Training (ICCV2021)
Language:Python33 3 42
dvlab-research/APD
Language:Python4 2 0