mkava98
M.Sc. student in Artificial Intelligence Multi-modal learning, NLP, CV, Vision-Language model
Pinned Repositories
python_tutorial
Faster-rcnn
FasterRCNN
GA
Computational Intelligence _Genetic algorithm
MMBERT
MMBERT: Multimodal BERT Pretraining for Improved Medical VQA
nutrients
VQA_ReGAT
Research Code for ICCV 2019 paper "Relation-aware Graph Attention Network for Visual Question Answering"
gpt2-vision
Convert GPT-2 into a multimodal model using CLIP. Under 1000 lines of pure PyTorch
mkava98's Repositories
mkava98/GA
Computational Intelligence _Genetic algorithm
mkava98/python_tutorial
mkava98/MMBERT
MMBERT: Multimodal BERT Pretraining for Improved Medical VQA
mkava98/Faster-rcnn
FasterRCNN
mkava98/nutrients
mkava98/VQA_ReGAT
Research Code for ICCV 2019 paper "Relation-aware Graph Attention Network for Visual Question Answering"