Pinned Repositories
Action-Localization
Action-Localization, Atomic Visual Actions (AVA) Dataset
Adaptive
Pytorch Implementation of Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning
Adaptive-master
CapDec
CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)
echo360
Commandline tool for automated downloads of echo360 videos hosted by university
Patient-Instructions
Code for "Retrieve, Reason, and Refine: Generating Accurate and Faithful Discharge/Patient Instructions" (NeurIPS 2022)
Self-Supervised-Embedding-Fusion-Transformer
The code for our IEEE ACCESS (2020) paper Multimodal Emotion Recognition with Transformer-Based Self Supervised Feature Fusion.
spinningup
An educational resource to help anyone learn deep reinforcement learning.
VATN
dissertation
VL-BERT
Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".
PatrickStar-lanza's Repositories
PatrickStar-lanza/Self-Supervised-Embedding-Fusion-Transformer
The code for our IEEE ACCESS (2020) paper Multimodal Emotion Recognition with Transformer-Based Self Supervised Feature Fusion.
PatrickStar-lanza/Action-Localization
Action-Localization, Atomic Visual Actions (AVA) Dataset
PatrickStar-lanza/Adaptive
Pytorch Implementation of Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning
PatrickStar-lanza/Adaptive-master
PatrickStar-lanza/CapDec
CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)
PatrickStar-lanza/echo360
Commandline tool for automated downloads of echo360 videos hosted by university
PatrickStar-lanza/Patient-Instructions
Code for "Retrieve, Reason, and Refine: Generating Accurate and Faithful Discharge/Patient Instructions" (NeurIPS 2022)
PatrickStar-lanza/spinningup
An educational resource to help anyone learn deep reinforcement learning.
PatrickStar-lanza/VATN
dissertation
PatrickStar-lanza/VL-BERT
Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".
PatrickStar-lanza/XProNet
[ECCV2022] The official implementation of Cross-modal Prototype Driven Network for Radiology Report Generation