gaomingqi

PhD student in Computer Vision and Deep Learning

University of Warwick | SUSTechShenzhen, China

gaomingqi's Stars

facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook13.6k 80 4161.3k
MrNeRF/awesome-3D-gaussian-splatting
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
Language:HTML6.6k 225 60395
VAST-AI-Research/TripoSR
Language:Python4.8k 53 103555
UX-Decoder/Semantic-SAM
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
Language:Python2.4k 23 97121
dvlab-research/ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
Language:Python1.5k 26 7773
Stability-AI/stable-fast-3d
SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement
Language:Python1.3k 20 57152
DAMO-NLP-SG/VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
Language:Python1k 10 12266
HCPLab-SYSU/Embodied_AI_Paper_List
[Embodied-AI-Survey-2024] Paper list and projects for Embodied AI
999 14 672
mbzuai-oryx/groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
Language:Python813 31 8140
buaacyw/MeshAnythingV2
From anything to mesh like human artists. Official impl. of "MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh Tokenization"
Language:Python710 20 1740
FoundationVision/Groma
[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization
Language:Python594 36 3661
pytorch-labs/attention-gym
Helpful tools and examples for working with flex-attention
Language:Python574 7 7730
Jyxarthur/flowsam
[ACCV 2024 (Oral)] Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi Xie, Andrew Zisserman
Language:Python288 4 1521
Traffic-X/ViT-CoMer
Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions.
Language:Python256 3 2718
weijielyu/Gaga
Gaga: Group Any Gaussians via 3D-aware Memory Bank
Language:Python246 8 57
zamling/PSALM
[ECCV2024] This is an official implementation for "PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model"
Language:Python204 7 2210
cilinyan/VISA
[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model
Language:Python146 6 124
zhang-tao-whu/DVIS_Plus
Language:Python101 3 248
shirowalker/UCAD
[AAAI-2024] Offical code for <Unsupervised Continual Anomaly Detection with Contrastively-learned Prompt>.
Language:Python86 8 202
heshuting555/DsHmp
[CVPR-2024] Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation
Language:Python82 5 170
XuHu0529/SAGS
The official implementation of SAGS (Segment Anything in 3D Gaussians)
Language:Jupyter Notebook66 2 24
htqin/BiMatting
[NeurIPS 2023] This project is the official implementation of our accepted NeurIPS 2023 paper BiMatting: Efficient Video Matting via Binarization.
Language:Python30 1 34
Tapall-AI/MeViS_Track_Solution_2024
[CVPR 2024 Challenge] 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation
Language:Python28 2 31
jinlab-imvr/Surgical-SAM-2
Language:Jupyter Notebook27 3 23
ttgeng233/UniAV
Unified Audio-Visual Perception for Multi-Task Video Localization
Language:Python24 3 31
Kki2Eve/RISNet
Depth-Aware Concealed Crop Detection in Dense Agricultural Scenes, CVPR 2024
Language:Python22 2 62
zjr2000/REVERIE
[ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models
Language:Python15 2 00
yoqim/waveface
Repository for "WaveFace: Authentic Face Restoration with Efficient Frequency Recovery" (CVPR24)
10 2 30
yunlong10/MMComposition
Repo for MMComposition Benchmark
4 1 00
yoqim/waveface_page
project page for waveface
10

gaomingqi

gaomingqi's Stars

facebookresearch/sam2

MrNeRF/awesome-3D-gaussian-splatting

VAST-AI-Research/TripoSR

UX-Decoder/Semantic-SAM

dvlab-research/ControlNeXt

Stability-AI/stable-fast-3d

DAMO-NLP-SG/VideoLLaMA2

HCPLab-SYSU/Embodied_AI_Paper_List

mbzuai-oryx/groundingLMM

buaacyw/MeshAnythingV2

FoundationVision/Groma

pytorch-labs/attention-gym

Jyxarthur/flowsam

Traffic-X/ViT-CoMer

weijielyu/Gaga

zamling/PSALM

cilinyan/VISA

zhang-tao-whu/DVIS_Plus

shirowalker/UCAD

heshuting555/DsHmp

XuHu0529/SAGS

htqin/BiMatting

Tapall-AI/MeViS_Track_Solution_2024

jinlab-imvr/Surgical-SAM-2

ttgeng233/UniAV

Kki2Eve/RISNet

zjr2000/REVERIE

yoqim/waveface

yunlong10/MMComposition

yoqim/waveface_page