WingkeungM

Remote Sensing，Computer Vision Postdoc Fellow @ Tsinghua University

WHU -> UCAS -> THUBeijing, China

WingkeungM's Stars

NanmiCoder/MediaCrawler
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频｜评论爬虫、微博帖子｜评论爬虫、百度贴吧帖子｜百度贴吧评论回复爬虫 | 知乎问答文章｜评论爬虫
Language:Python18.8k 108 3695.7k
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook13.3k 76 4001.3k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
13.3k 257 128840
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
7.5k 335 266922
lllyasviel/Omost
Your image is almost there!
Language:Python7.4k 46 82427
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Language:Python7.2k 50 216546
FoundationVision/VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Jupyter Notebook6.5k 121 107429
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Language:Python5.7k 60 106521
dvlab-research/MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
Language:Python3.2k 29 133281
MzeroMiko/VMamba
VMamba: Visual State Space Models，code is based on mamba
Language:Python2.3k 17 342152
yuweihao/MambaOut
MambaOut: Do We Really Need Mamba for Vision?
Language:Python2.1k 8 24535
Ucas-HaoranWei/Vary
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
Language:Python1.8k 55 133161
yyyujintang/Awesome-Mamba-Papers
Awesome Papers related to Mamba.
1.3k 26 1866
mini-sora/minisora
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
Language:Python1.2k 19 65151
Jack-bo1220/Awesome-Remote-Sensing-Foundation-Models
1k 43 2496
NAOSI-DLUT/Campus2024
2024届互联网校招信息汇总
958 28 1371
Ucas-HaoranWei/Vary-toy
Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)
Language:Python613 13 3945
youquanl/Segment-Any-Point-Cloud
[NeurIPS'23 Spotlight] Segment Any Point Cloud Sequences by Distilling Vision Foundation Models
Language:Python576 26 1326
magic-research/Dataset_Quantization
[ICCV2023] Dataset Quantization
Language:Python256 7 1419
rsdler/Remote-Sensing-in-CVPR2024
Papers related to remote sensing in CVPR 2024
162 2 09
aim-uofa/SegPrompt
Official Implementation of ICCV 2023 Paper - SegPrompt: Boosting Open-World Segmentation via Category-level Prompt Learning
Language:Python109 10 11
Ucas-HaoranWei/Vary-tiny-600k
Vary-tiny codebase upon LAVIS （for training from scratch）and a PDF image-text pairs data (about 600k including English/Chinese)
Language:Python75 2 54
T1aNS1R/Evil-Geniuses
Language:Python67 4 21
Ucas-HaoranWei/Vary-family
56 5 30
fullcyxuc/B-Seg
Code for our SIGGRAPH'2023 paper: "UrbanBIS: a Large-scale Benchmark for Fine-grained Urban Building Instance Segmentation"
Language:Python52 4 77
canoe-Z/PETDet
[TGRS2023] Official implement for PETDet.
Language:Python42 2 106
MAVREC/mavrec-code
This code is provided for reproducibility of results in the paper: Multiview Aerial Visual Recognition (MAVREC): Can Multi-view Improve Aerial Visual Perception?
Language:Jupyter Notebook13 1 03
chunbolang/RARE
Official PyTorch Implementation of Retain and Recover: Delving into Information Loss for Few-Shot Segmentation (TIP'23).
Language:Python4 1 10
StarBurstStream0/SIRS
Official code for "SIRS: Multi-task Joint Learning for Remote Sensing Foreground-entity Image-text Retrieval" TGRS2024
Language:Jupyter Notebook4 2 30
Na-Z/LIOND
[AAAI 2024] Robust Visual Recognition with Class-Imbalanced Open-World Noisy Data
2 1 00

WingkeungM

WingkeungM's Stars

NanmiCoder/MediaCrawler

facebookresearch/sam2

BradyFU/Awesome-Multimodal-Large-Language-Models

HumanAIGC/EMO

lllyasviel/Omost

LiheYoung/Depth-Anything

FoundationVision/VAR

pytorch-labs/gpt-fast

dvlab-research/MGM

MzeroMiko/VMamba

yuweihao/MambaOut

Ucas-HaoranWei/Vary

yyyujintang/Awesome-Mamba-Papers

mini-sora/minisora

Jack-bo1220/Awesome-Remote-Sensing-Foundation-Models

NAOSI-DLUT/Campus2024

Ucas-HaoranWei/Vary-toy

youquanl/Segment-Any-Point-Cloud

magic-research/Dataset_Quantization

rsdler/Remote-Sensing-in-CVPR2024

aim-uofa/SegPrompt

Ucas-HaoranWei/Vary-tiny-600k

T1aNS1R/Evil-Geniuses

Ucas-HaoranWei/Vary-family

fullcyxuc/B-Seg

canoe-Z/PETDet

MAVREC/mavrec-code

chunbolang/RARE

StarBurstStream0/SIRS

Na-Z/LIOND