maynardsd's Stars
microsoft/generative-ai-for-beginners
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
Ruixxxx/Awesome-Vision-Mamba-Models
[Official Repo] A Survey on Vision Mamba: Models, Applications and Challenges
luca-medeiros/lang-segment-anything
SAM with text prompt
hotfinda/VideoMambaPro
Improving Mamaba performance on Video Understanding task
ChaoningZhang/MobileSAM
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
yformer/EfficientSAM
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
WHU-Sigma/HyperSIGMA
The official repo for the paper "HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model"
siyuanliii/masa
Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anything
OliverRensu/ARM
This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Event-AHU/Mamba_State_Space_Model_Paper_List
[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications
Event-AHU/Mamba_FETrack
[PRCV-2024] State Space Model based Frame-Event Tracking
tigasgon1999/Agricultural-Pattern-Recognition
This project focuses on using the Semantic Segmentation Deep Learning architecture DeepLAbV3+ on the Agriculture-Vision dataset. We focus on improving the architecture's performance by solving the class imbalance problem present in the data.
wufanyou/WRL-Agriculture-Vision
jackyjsy/CVPR21Chal-Agrivision
This repo contains the code to reproduce our results in CVPR21 Challenge on Agriculture-Vision.
edornd/agrivision-2022
Agriculture Vision Workshop 2022
hustvl/Vim
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
MzeroMiko/VMamba
VMamba: Visual State Space Models,code is based on mamba
MCG-NJU/VideoMAE
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
GXNU-ZhongLab/AQATrack
CVPR24
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
ChenHsing/Awesome-Video-Diffusion-Models
[Arxiv] A Survey on Video Diffusion Models
Vibashan/PosSAM
Official Repo for PosSAM: Panoptic Open-vocabulary Segment Anything
YingqingHe/LVDM
LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation
xingpingdong/Occlusion-Tracking
Occlusion-aware real-time object tracking, IEEE TMM 2017
MinghanLi/UniVS
Code release for "UniVS: Unified and Universal Video Segmentation with Prompts as Queries" (CVPR2024)
mini-sora/minisora
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
scutan90/DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
TorchEnsemble-Community/Ensemble-Pytorch
A unified ensemble framework for PyTorch to improve the performance and robustness of your deep learning model.