yanglixiaoshen

PhD student, EE, BUAA; Member of MC2 Lab; Working on CV, MM and touching fish.

Beihang UniversityBeijing China

yanglixiaoshen's Stars

facebookresearch/Mask2Former
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
Language:Python2.6k388
Wu-ZJ/DSGNN
Language:Python5
dragonlee258079/Saliency-Ranking
Code release for the TPAMI 2021 paper "Instance-Level Relative Saliency Ranking with Graph Reasoning" by Nian Liu, Long Li, Wangbo Zhao, Junwei Han, and Ling Shao.
Language:Python11
MinglangQiao/awesome-salient-object-ranking
A curated list of awesome resources for salient object ranking (SOR)
3
Luo-Z13/SkySenseGPT
A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding
Language:Python664
milvus-io/milvus
A cloud-native vector database, storage for next generation AI applications
Language:Go31k2.9k
hukenovs/easyportrait
EasyPortrait - Face Parsing and Portrait Segmentation Dataset
Language:Python25821
suoych/KEDs
Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)
Language:Shell13
thuyngch/Human-Segmentation-PyTorch
Human segmentation models, training/inference code, and trained weights, implemented in PyTorch
Language:Jupyter Notebook558114
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook10k974
guanhuankang/ECCV24PoseSOR
ECCV24-PoseSOR: Human Pose Can Guide Our Attention
Language:Python5
EricDengbowen/QAGNet
Official repository for CVPR 2024 paper "Advancing Saliency Ranking with Human Fixations: Dataset, Models and Benchmarks".
Language:Python14
guanhuankang/SeqRank
Paper "SeqRank: Sequential Ranking of Salient Objects" is accepted in AAAI-24.
Language:Python81
franciszzj/VLPrompt
VLPrompt: Vision-Language Prompting for Panoptic Scene Graph Generation
Language:Python201
ChocoWu/Awesome-Scene-Graph-Generation
This is a repository for listing papers on scene graph generation and application.
978
Q-Future/Q-Align
③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.
Language:Python29921
NeeluMadan/ViFM_Survey
Foundation Models for Video Understanding: A Survey
983
NExT-ChatV/NExT-Chat
The code of the paper "NExT-Chat: An LMM for Chat, Detection and Segmentation".
Language:Python2248
yunlong10/Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
1.6k82
SkyworkAI/Vitron
NeurIPS 2024 Paper: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
Language:Python39024
lxtGH/OMG-Seg
OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]
Language:Python1.3k50
VITA-MLLM/VITA
✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM
Language:Python96959
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
12.8k819
jingyi0000/VLM_survey
Collection of AWESOME vision-language models for vision tasks
2.5k220
OpenGVLab/unmasked_teacher
[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
Language:Python30016
Rubics-Xuan/MRES
This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation", accepted by CVPR 2024.
63
feiyanhu/tinyHD
Language:Python19
yuhangzang/ContextDET
Contextual Object Detection with Multimodal Large Language Models
Language:Python2045
microsoft/LLaVA-Med
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.
Language:Python1.6k201
codec2021/video_codec_learn
关于视频编解码学习资料
Language:Python408110

yanglixiaoshen

yanglixiaoshen's Stars

facebookresearch/Mask2Former

Wu-ZJ/DSGNN

dragonlee258079/Saliency-Ranking

MinglangQiao/awesome-salient-object-ranking

Luo-Z13/SkySenseGPT

milvus-io/milvus

hukenovs/easyportrait

suoych/KEDs

thuyngch/Human-Segmentation-PyTorch

salesforce/LAVIS

guanhuankang/ECCV24PoseSOR

EricDengbowen/QAGNet

guanhuankang/SeqRank

franciszzj/VLPrompt

ChocoWu/Awesome-Scene-Graph-Generation

Q-Future/Q-Align

NeeluMadan/ViFM_Survey

NExT-ChatV/NExT-Chat

yunlong10/Awesome-LLMs-for-Video-Understanding

SkyworkAI/Vitron

lxtGH/OMG-Seg

VITA-MLLM/VITA

BradyFU/Awesome-Multimodal-Large-Language-Models

jingyi0000/VLM_survey

OpenGVLab/unmasked_teacher

Rubics-Xuan/MRES

feiyanhu/tinyHD

yuhangzang/ContextDET

microsoft/LLaVA-Med

codec2021/video_codec_learn