zipengxuc
Ph.D. student at MHUG, University of Trento. Research Interests: Generative Models, Vision-Language.
University of TrentoItaly
zipengxuc's Stars
labuladong/fucking-algorithm
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
pliang279/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
lucidrains/DALLE-pytorch
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
orpatashnik/StyleCLIP
Official Implementation for "StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery" (ICCV 2021 Oral)
jonbarron/website
facebookresearch/swav
PyTorch implementation of SwAV https//arxiv.org/abs/2006.09882
Eurus-Holmes/Awesome-Multimodal-Research
A curated list of Multimodal Related Research.
asheeshcric/awesome-contrastive-self-supervised-learning
A comprehensive list of awesome contrastive self-supervised learning papers.
Miller-Xie/Code
面试高频算法题总结,个人博客
ruotianluo/self-critical.pytorch
Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.
danqi/acl2020-openqa-tutorial
ACL2020 Tutorial: Open-Domain Question Answering
forence/Awesome-Visual-Captioning
This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP
jiasenlu/AdaptiveAttention
Implementation of "Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning"
batra-mlp-lab/visdial-challenge-starter-pytorch
Starter code in PyTorch for the Visual Dialog challenge
epfml/collaborative-attention
Code for Multi-Head Attention: Collaborate Instead of Concatenate
facebookresearch/simmc
With the aim of building next generation virtual assistants that can handle multimodal inputs and perform multimodal actions, we introduce two new datasets (both in the virtual shopping domain), the annotation schema, the core technical tasks, and the baseline models. The code for the baselines and the datasets will be opensourced.
jiasenlu/visDial.pytorch
visual dialog model in pytorch
yufengm/Adaptive
Pytorch Implementation of Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning
fawazsammani/knowing-when-to-look-adaptive-attention
PyTorch Implementation of Knowing When to Look: Adaptive Attention via a Visual Sentinal for Image Captioning
chengxuanying/KDD-Multimodalities-Recall
This is our solution for KDD Cup 2020. We implemented a very neat and simple neural ranking model based on siamese BERT which ranked first among the solo teams and ranked 12th among all teams on the final leaderboard.
gicheonkang/dan-visdial
✨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"
satwikkottur/clevr-dialog
Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog
shubhamagarwal92/visdial_conv
This repository contains code used in our ACL'20 paper History for Visual Dialog: Do we really need it?
vmurahari3/visdial-diversity
Pytorch implementation of https://arxiv.org/pdf/1909.10470.pdf
phellonchen/DMRM
DMRM: A Dual-channel Multi-hop Reasoning Model for Visual Dialog
zipengxuc/ADVSE-GuessWhat
Code for ACMMM'20 ✨"Answer-Driven Visual State Estimator for Goal-Oriented Visual Dialogue"
JXZe/DAM
danielamassiceti/geneval_visdial
A Revised Generative Evaluation of Visual Dialogue