luckybird1994's Stars
facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
state-spaces/mamba
Mamba SSM architecture
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
mohuangrui/ucasthesis
LaTeX Thesis Template for the University of Chinese Academy of Sciences
facebookresearch/pytorchvideo
A deep learning library for video understanding research.
yyyujintang/Awesome-Mamba-Papers
Awesome Papers related to Mamba.
liliu-avril/Awesome-Segment-Anything
This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).
facebookresearch/NeuralCompression
A collection of tools for neural compression enthusiasts.
aim-uofa/Matcher
[ICLR'24] Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching
OpenGVLab/Vision-RWKV
Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
zsef123/PointRend-PyTorch
A PyTorch implementation of PointRend: Image Segmentation as Rendering
prismformore/Multi-Task-Transformer
Code of ICLR2023 paper "TaskPrompter: Spatial-Channel Multi-Task Prompting for Dense Scene Understanding" and ECCV2022 paper "Inverted Pyramid Multi-task Transformer for Dense Scene Understanding"
okhat/blog
yulunzhang/awesome-diffusion-low-level-vision
Awesome Diffusion Models in Low-Level Vision
luckybird1994/ASAM
MKFMIKU/vidm
[AAAI23 Oral] Official implementations of Video Implicit Diffusion Models
fxia22/gn.pytorch
OliverHxh/SkeletonGCL
The repository is the implementation of ICLR 2023 paper "Graph Contrastive Learning for Skeleton-based Action Recognition".
YannickStruempler/inr_based_compression
Contains the implementation of the paper "Implicit Neural Representation for Image Compression" at ECCV 2022
luckybird1994/SAMCOD
chaoliu18/RPLVC
Project page for the paper: "Liu C, Sun H, Katto J, et al.Learned Video Compression with Residual Prediction and Loop Filter"
luckybird1994/HQSOD
makinyilmaz/LHBDC
hkxiao/zs-cosod
Zero-Shot Co-salient Object Detection Framework
luckybird1994/classnet