mkava98

M.Sc. student in Artificial Intelligence Multi-modal learning, NLP, CV, Vision-Language model

Pinned Repositories

python_tutorial
Language:Jupyter Notebook10
Faster-rcnn
FasterRCNN
Language:Jupyter Notebook00
GA
Computational Intelligence _Genetic algorithm
Language:Jupyter Notebook00
MMBERT
MMBERT: Multimodal BERT Pretraining for Improved Medical VQA
Language:Python00
nutrients
Language:Python00
VQA_ReGAT
Research Code for ICCV 2019 paper "Relation-aware Graph Attention Network for Visual Question Answering"
Language:Python00
gpt2-vision
Convert GPT-2 into a multimodal model using CLIP. Under 1000 lines of pure PyTorch
Language:Python50

mkava98's Repositories

mkava98/GA
Computational Intelligence _Genetic algorithm
Language:Jupyter Notebook
mkava98/python_tutorial
Language:Jupyter Notebook1
mkava98/MMBERT
MMBERT: Multimodal BERT Pretraining for Improved Medical VQA
Language:Python
mkava98/Faster-rcnn
FasterRCNN
Language:Jupyter Notebook
mkava98/nutrients
Language:Python
mkava98/VQA_ReGAT
Research Code for ICCV 2019 paper "Relation-aware Graph Attention Network for Visual Question Answering"