Leon1207

Coding bird.

Xiamen University

Pinned Repositories

Video-MME
✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
362 5 2611
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python19.1k 159 1.5k2.1k
3DRefTR
This is a PyTorch implementation of 3DRefTR proposed by our paper "A Unified Framework for 3D Point Cloud Visual Grounding"
Language:Python18 1 20
PointMetaBase
This is a PyTorch implementation of PointMetaBase proposed by our paper "Meta Architecure for Point Cloud Analysis"
Language:Python1 0 00
Video-MME
✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
Language:Python0 0 00
LLaVA-NeXT
Language:Python2.3k 32 190155
MinkowskiEngine
Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors
Language:Python2.4k 45 543356
nccl
Optimized primitives for collective multi-GPU communication
Language:C++3.1k 151 1.2k787
spconv
Spatial Sparse Convolution Library
Language:Python1.8k 24 689362
video-mme.github.io
Language:JavaScript1 3 02

Leon1207/3DRefTR
This is a PyTorch implementation of 3DRefTR proposed by our paper "A Unified Framework for 3D Point Cloud Visual Grounding"
Language:Python18 1 20
Leon1207/PointMetaBase
This is a PyTorch implementation of PointMetaBase proposed by our paper "Meta Architecure for Point Cloud Analysis"
Language:Python1 0 00
Leon1207/Video-MME
✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
Language:Python0 0 00