Pinned Repositories
3D-Box-Segment-Anything
We extend Segment Anything to 3D perception by combining it with VoxelNeXt.
DeepUPE
Underexposed Photo Enhancement Using Deep Illumination Estimation
LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
LLaMA-VID
Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models
LLMGA
This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant'
LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
PanopticFCN
Fully Convolutional Networks for Panoptic Segmentation (CVPR2021 Oral)
Video-P2P
Video-P2P: Video Editing with Cross-attention Control
VoxelNeXt
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)
DV Lab's Repositories
dvlab-research/MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
dvlab-research/LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
dvlab-research/LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
dvlab-research/VoxelNeXt
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)
dvlab-research/LLaMA-VID
Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models
dvlab-research/DeepUPE
Underexposed Photo Enhancement Using Deep Illumination Estimation
dvlab-research/3D-Box-Segment-Anything
We extend Segment Anything to 3D perception by combining it with VoxelNeXt.
dvlab-research/PointGroup
PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation
dvlab-research/FocalsConv
Focal Sparse Convolutional Networks for 3D Object Detection (CVPR 2022, Oral)
dvlab-research/Video-P2P
Video-P2P: Video Editing with Cross-attention Control
dvlab-research/PFENet
PFENet: Prior Guided Feature Enrichment Network for Few-shot Segmentation (TPAMI).
dvlab-research/SphereFormer
The official implementation for "Spherical Transformer for LiDAR-based 3D Recognition" (CVPR 2023).
dvlab-research/LLMGA
This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant'
dvlab-research/Parametric-Contrastive-Learning
Parametric Contrastive Learning (ICCV2021) & GPaCo (TPAMI 2023)
dvlab-research/LargeKernel3D
LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs (CVPR 2023)
dvlab-research/Context-Aware-Consistency
Semi-supervised Semantic Segmentation with Directional Context-aware Consistency (CVPR 2021)
dvlab-research/SparseTransformer
A fast and memory-efficient libarary for sparse transformer with varying token numbers (e.g., 3D point cloud).
dvlab-research/MOOD
Official PyTorch implementation of MOOD series: (1) MOODv1: Rethinking Out-of-distributionDetection: Masked Image Modeling Is All You Need. (2) MOODv2: Masked Image Modeling for Out-of-Distribution Detection.
dvlab-research/RIVAL
[NeurIPS 2023 Spotlight] Real-World Image Variation by Aligning Diffusion Inversion Chain
dvlab-research/Ref-NPR
[CVPR 2023] Ref-NPR: Reference-Based Non-PhotoRealistic Radiance Fields
dvlab-research/Prompt-Highlighter
[CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs
dvlab-research/Imbalanced-Learning
Imbalanced learning tool for imbalanced recognition and segmentation
dvlab-research/Mask-Attention-Free-Transformer
Official Implementation for "Mask-Attention-Free Transformer for 3D Instance Segmentation"
dvlab-research/MoTCoder
This is the official code repository of MoTCoder: Elevating Large Language Models with Modular of Thought for Challenging Programming Tasks.
dvlab-research/ProposeReduce
Video Instance Segmentation with a Propose-Reduce Paradigm (ICCV 2021)
dvlab-research/TriVol
The official code of TriVol in CVPR-2023
dvlab-research/GroupContrast
[CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding
dvlab-research/MR-GSM8K
Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs
dvlab-research/LBGAT
Learnable Boundary Guided Adversarial Training (ICCV2021)
dvlab-research/APD