RainyMoo's Stars
naver-ai/pcme
Official Pytorch implementation of "Probabilistic Cross-Modal Embedding" (CVPR 2021)
facebookresearch/multimodal
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
alxndrTL/mamba.py
A simple and efficient Mamba implementation in pure PyTorch and MLX.
yyyujintang/Awesome-Mamba-Papers
Awesome Papers related to Mamba.
caodoanh2001/QuIIL_thompson_solution
JoonseoKang/mcan-cap
pkhdipraja/tapir
Code for "TAPIR: Learning Adaptive Revision for Incremental Natural Language Understanding with a Two-Pass Model", Findings of ACL 2023
uvavision/SimVQA
[CVPR 2022] SimVQA: Exploring Simulated Environments for Visual Question Answering
kaiyuanmifen/GFNDropout
SRI-CSL/TrinityMultimodalTrojAI
multimodal/multimodal
A collection of multimodal datasets, and visual features for VQA and captionning in pytorch. Just run "pip install multimodal"
ItemZheng/KDDAug
[ECCV2022] Rethinking Data Augmentation for Robust Visual Question Answering
yuleiniu/cfvqa
[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias
983632847/Awesome-Multimodal-Object-Tracking
A personal investigative project to track the latest progress in the field of multi-modal object tracking.
hankcs/HanLP
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
aioz-ai/CFR_VQA
Coarse-to-Fine Reasoning for Visual Question Answering (CVPRW'22)
salesforce/ALBEF
Code for ALBEF: a new vision-language pre-training method
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
open-mmlab/mmselfsup
OpenMMLab Self-Supervised Learning Toolbox and Benchmark
lucidrains/DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
HobbitLong/SupContrast
PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)
ryanzhumich/Contrastive-Learning-NLP-Papers
Paper List for Contrastive Learning for Natural Language Processing
pliang279/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
houjunlin/Awesome-Medical-Vision-Language-Learning
Papers and Public Datasets for Medical Vision-Language Learning
lzw-lzw/awesome-remote-sensing-vision-language-models
Awesome-Remote-Sensing-Vision-Language-Models
geoaigroup/awesome-vision-language-models-for-earth-observation
A curated list of awesome vision and language resources for earth observation.
Cadene/murel.bootstrap.pytorch
MUREL (CVPR 2019), a multimodal relational reasoning module for VQA
microsoft/Oscar
Oscar and VinVL
Tongjilibo/bert4torch
An elegent pytorch implement of transformers