Pinned Repositories
awesome
😎 Awesome lists about all kinds of interesting topics
Awesome-Computer-Vision
Awesome Resources for Advanced Computer Vision Topics
awesome-contrastive-self-supervised-learning
A comprehensive list of awesome contrastive self-supervised learning papers.
awesome-fast-attention
list of efficient attention modules
awesome-grounding
awesome grounding: A curated list of research papers in visual grounding
Awesome-Mamba-Papers
Awesome Papers related to Mamba.
awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
Awesome-Multimodal-Research
A curated list of Multimodal Related Research.
External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
myvqa
The implementation of CLVIN、CAAN and MPCCT
RainyMoo's Repositories
RainyMoo/myvqa
The implementation of CLVIN、CAAN and MPCCT
RainyMoo/External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
RainyMoo/awesome
😎 Awesome lists about all kinds of interesting topics
RainyMoo/Awesome-Computer-Vision
Awesome Resources for Advanced Computer Vision Topics
RainyMoo/awesome-contrastive-self-supervised-learning
A comprehensive list of awesome contrastive self-supervised learning papers.
RainyMoo/awesome-fast-attention
list of efficient attention modules
RainyMoo/awesome-grounding
awesome grounding: A curated list of research papers in visual grounding
RainyMoo/Awesome-Mamba-Papers
Awesome Papers related to Mamba.
RainyMoo/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
RainyMoo/Awesome-Multimodal-Research
A curated list of Multimodal Related Research.
RainyMoo/Awesome-Prompting-on-Vision-Language-Model
This repo lists relevant papers summarized in our survey paper: A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models.
RainyMoo/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
RainyMoo/awesome-vision-language-pretraining-papers
Recent Advances in Vision and Language PreTrained Models (VL-PTMs)
RainyMoo/awesome-visual-question-answering
A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Commonsense Reasoning and related area.
RainyMoo/D-VQA
PyTorch implementation of "Debiased Visual Question Answering from Feature and Sample Perspectives" (NeurIPS 2021)
RainyMoo/awesome-vision-and-language
A curated list of awesome vision and language resources (still under construction... stay tuned!)
RainyMoo/grid-feats-vqa
Grid features pre-training code for visual question answering
RainyMoo/ICCV2021-Papers-with-Code
ICCV 2021 论文和开源项目合集
RainyMoo/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
RainyMoo/linear-attention-transformer
Transformer based on a variant of attention that is linear complexity in respect to sequence length
RainyMoo/mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
RainyMoo/multimodal
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
RainyMoo/multimodal-deep-learning
This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.
RainyMoo/openvqa
A lightweight, scalable, and general framework for visual question answering research
RainyMoo/QA-Survey
北航大数据高精尖中心研究张日崇团队对问答系统的总结。包括基于知识图谱的问答(KBQA),基于文本的问答系统(TextQA),基于表格的问答系统(TabletQA)和基于视觉的问答系统(VisualQA),每类系统分别对学术界和工业界进行总结。
RainyMoo/SSL4MIS
Semi Supervised Learning for Medical Image Segmentation, a collection of literature reviews and code implementations.
RainyMoo/Transformer-MM-Explainability
[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.
RainyMoo/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
RainyMoo/TRAR-VQA
This is the official pytorch implementation for our ICCV 2021 paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering" on VQA Task
RainyMoo/xmodaler
X-modaler is a versatile and high-performance codebase for cross-modal analytics.