mesnico

Researcher at ISTI-CNR, Pisa, Italy. I'm interested in the secrets of intelligence and I'm passionate about Computer Vision and Deep Learning.

Pisa, Italy

Pinned Repositories

ALADIN
Official implementation of the paper "ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval"
Language:Python17 5 36
attentive_video_crowd_counting
Code for reproducing the results for the paper "A Spatio-Temporal Attentive Network for Video-Based Crowd Counting"
Language:Python50
learning-relationship-aware-visual-features
Relational Content-Based Image Retrieval (R-CBIR) - Retrieving images with given relationships among objects
Language:Python17 1 02
MemePersuasionDetection
Deep learning methods for detecting persuasion techniques in memes
Language:Python6 1 11
RelationNetworks-CLEVR
A pytorch implementation for "A simple neural network module for relational reasoning", working on the CLEVR dataset
Language:Python85 7 926
TERAN
Code and Resources for the Transformer Encoder Reasoning and Alignment Network (TERAN), accepted for publication in ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM)
Language:Python73 2 612
TERN
Code and Resources for the Transformer Encoder Reasoning Network (TERN) - https://arxiv.org/abs/2004.09144
Language:Python58 6 1413
text-to-motion-retrieval
Official code for reproducing results obtained in the short paper "Text-to-Motion Retrieval: Towards Joint Understanding of Human Motion Data and Natural Language", accepted at SIGIR 2023.
Language:Python42 1 02
WifiIndoorLocation
Android application that localizes people in indoor environments, using wifi fingerprinting methods
Language:Java15 1 17
Wiki-Image-Caption-Matching
Language:Python8 4 01

mesnico's Repositories

mesnico/RelationNetworks-CLEVR
A pytorch implementation for "A simple neural network module for relational reasoning", working on the CLEVR dataset
Language:Python85 7 926
mesnico/TERAN
Code and Resources for the Transformer Encoder Reasoning and Alignment Network (TERAN), accepted for publication in ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM)
Language:Python73 2 612
mesnico/TERN
Code and Resources for the Transformer Encoder Reasoning Network (TERN) - https://arxiv.org/abs/2004.09144
Language:Python58 6 1413
mesnico/text-to-motion-retrieval
Official code for reproducing results obtained in the short paper "Text-to-Motion Retrieval: Towards Joint Understanding of Human Motion Data and Natural Language", accepted at SIGIR 2023.
Language:Python42 1 02
mesnico/ALADIN
Official implementation of the paper "ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval"
Language:Python17 5 36
mesnico/learning-relationship-aware-visual-features
Relational Content-Based Image Retrieval (R-CBIR) - Retrieving images with given relationships among objects
Language:Python17 1 02
mesnico/WifiIndoorLocation
Android application that localizes people in indoor environments, using wifi fingerprinting methods
Language:Java15 1 17
mesnico/Wiki-Image-Caption-Matching
Language:Python8 4 01
mesnico/MemePersuasionDetection
Deep learning methods for detecting persuasion techniques in memes
Language:Python6 1 11
mesnico/attentive_video_crowd_counting
Code for reproducing the results for the paper "A Spatio-Temporal Attentive Network for Video-Based Crowd Counting"
Language:Python50
mesnico/VBS22-KIS-Analysis
Language:Jupyter Notebook4 3 01
mesnico/ai4eu-text-to-visual-search
Code for building the Text to Visual search component for use in Acumos. It is compliant with the AI4EU specifications.
Language:Python21
mesnico/Akka-File-Sharing
File sharing distributed application developed with Akka Framework
Language:Java1 0 00
mesnico/DTfH-Laboratory
Colab notebooks for experimenting with the techniques explained at the course "Deep Learning tools for image classification and retrieval" of the Digital Tools for Humanists summer school
Language:Jupyter Notebook1
mesnico/faster-rcnn.pytorch
A faster pytorch implementation of faster r-cnn
Language:Python1 0 0
mesnico/food-recognition-MIM
Similarity search index and knn classifier for food searching and recognition. Implemented with Transfer Learning using Deep Features.
Language:Java1 0 0
mesnico/graph-rcnn.pytorch
Pytorch code for our ECCV 2018 paper "Graph R-CNN for Scene Graph Generation" and other papers
Language:Python1 2 0
mesnico/monet
An implementation of the MONet model for unsupervised scene decomposition in PyTorch
Language:Python1 2 0
mesnico/networkx
Official NetworkX source code repository.
Language:Python1 0 0
mesnico/pyskl
A toolbox for skeleton-based action recognition.
1
mesnico/pytorch-retinanet
Pytorch implementation of RetinaNet object detection.
Language:Python1 1 0
mesnico/recurrent_vision_transformer_visual_reasoning
Code for reproducing the results from our paper "Recurrent Vision Transformer for Solving Visual Reasoning Problems", accepted at ICIAP 2021
Language:Python11
mesnico/scene_graph_benchmark
image scene graph generation benchmark
Language:Python1
mesnico/str-encoders
Surrogate Text Representation Encoders for Real Vectors
Language:Python1
mesnico/tag_my_outfit_pipeline
1
mesnico/vsepp
PyTorch Code for the paper "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"
Language:Python1 1 0
mesnico/clip-vip_video_search
showing how to use CLIP-Vip to do video search
mesnico/ray
A high-performance distributed execution engine
Language:Python1 0