Pinned Repositories
AM-GCN
AM-GCN: Adaptive Multi-channel Graph Convolutional Networks
awesome-python-cn
Python资源大全中文版,包括:Web框架、网络爬虫、模板引擎、数据库、数据可视化、图片处理等,由伯乐在线持续更新。
Awesome-PyTorch-Chinese
【干货】史上最全的PyTorch学习资源汇总
bert
TensorFlow code and pre-trained models for BERT
bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
clevr-iep
Inferring and Executing Programs for Visual Reasoning
CRN_tvqa
HGA
Reasoning with Heterogeneous Graph Alignment for Video Question Answering
ICCV2021-Paper-Code-Interpretation
ICCV2021/2019/2017 论文/代码/解读/直播合集,极市团队整理
knowit-rock
ROCK model for Knowledge-Based VQA in Videos
B-matchlsr's Repositories
B-matchlsr/AM-GCN
AM-GCN: Adaptive Multi-channel Graph Convolutional Networks
B-matchlsr/awesome-python-cn
Python资源大全中文版,包括:Web框架、网络爬虫、模板引擎、数据库、数据可视化、图片处理等,由伯乐在线持续更新。
B-matchlsr/Awesome-PyTorch-Chinese
【干货】史上最全的PyTorch学习资源汇总
B-matchlsr/bert
TensorFlow code and pre-trained models for BERT
B-matchlsr/bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
B-matchlsr/clevr-iep
Inferring and Executing Programs for Visual Reasoning
B-matchlsr/CRN_tvqa
B-matchlsr/HGA
Reasoning with Heterogeneous Graph Alignment for Video Question Answering
B-matchlsr/ICCV2021-Paper-Code-Interpretation
ICCV2021/2019/2017 论文/代码/解读/直播合集,极市团队整理
B-matchlsr/knowit-rock
ROCK model for Knowledge-Based VQA in Videos
B-matchlsr/layoutlm
B-matchlsr/leeml-notes
李宏毅《机器学习》笔记,在线阅读地址:https://datawhalechina.github.io/leeml-notes
B-matchlsr/LeetCode
:monkey:LeetCode、剑指Offer刷题笔记(C/C++、Python3实现)
B-matchlsr/lihang-code
《统计学习方法》的代码实现
B-matchlsr/LOGNet-VQA
Implementation for the paper "Dynamic Language Binding in Relational Visual Reasoning" (Le et al., IJCAI 2020)
B-matchlsr/mac-network
Implementation for the paper "Compositional Attention Networks for Machine Reasoning" (Hudson and Manning, ICLR 2018)
B-matchlsr/mcan-vqa
Deep Modular Co-Attention Networks for Visual Question Answering
B-matchlsr/ML-notes
notes about machine learning
B-matchlsr/mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
B-matchlsr/murel.bootstrap.pytorch
MUREL (CVPR 2019), a multimodal relational reasoning module for VQA
B-matchlsr/openvqa
A lightweight, scalable, and general framework for visual question answering research
B-matchlsr/parseq
Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)
B-matchlsr/pytorch-book
PyTorch tutorials and fun projects including neural talk, neural style, poem writing, anime generation (《深度学习框架PyTorch:入门与实战》)
B-matchlsr/pytorch-transformers
👾 A library of state-of-the-art pretrained models for Natural Language Processing (NLP)
B-matchlsr/sam-textvqa
Official code for paper "Spatially Aware Multimodal Transformers for TextVQA" published at ECCV, 2020.
B-matchlsr/sgg
Train Scene Graph Generation for Visual Genome and GQA in PyTorch >= 1.2 with improved zero and few-shot generalization.
B-matchlsr/show-me-the-code
Python 练习册,每天一个小程序
B-matchlsr/ssbaseline
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]
B-matchlsr/TRAR-VQA
This is the official pytorch implementation for our ICCV 2021 paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering" on VQA Task
B-matchlsr/vqa-mfb