billpsomas
Deep Learning | Computer Vision | Research
National Technical University of AthensAthens, Greece
billpsomas's Stars
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
facebookresearch/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
voxel51/fiftyone
Refine high-quality datasets and visual AI models
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
princeton-vl/infinigen
Infinite Photorealistic Worlds using Procedural Generation
AILab-CVC/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
ali-vilab/AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
ultralytics/JSON2YOLO
Convert JSON annotations into YOLO format.
Qinying-Liu/Awesome-Open-Vocabulary-Semantic-Segmentation
A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..
inbarhub/DDPM_inversion
Official pytorch implementation of the paper: "An Edit Friendly DDPM Noise Space: Inversion and Manipulations". CVPR 2024.
MKLab-ITI/visil
Authors official PyTorch implementation of the "ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning" [ICCV 2019]
VamosC/CLIP4STR
An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".
MKLab-ITI/ndvr-dml
Authors official Tensorflow implementation of the "Near-Duplicate Video Retrieval with Deep Metric Learning" [ICCVW 2017]
YonghaoXu/SEANet
[AAAI 2019] Self-Ensembling Attention Networks: Addressing Domain Shift for Semantic Segmentation
vishaal27/SuS-X
Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]
billpsomas/rscir
Official PyTorch implementation and benchmark dataset for IGARSS 2024 ORAL paper: "Composed Image Retrieval for Remote Sensing"
MKLab-ITI/intermediate-cnn-features
Feature extraction from videos based on intermediate layers of a Convolutional Neural Network.
afiaka87/retrieval-augmented-diffusion
Retrieval augmented diffusion from CompVis.
gkordo/s2vs
Authors official PyTorch implementation of the "Self-Supervised Video Similarity Learning" [CVPRW 2023]
mever-team/ausil
Authors official Tensorflow implementation of the "Audio-based Near-Duplicate Video Retrieval with Audio Similarity Learning" [ICPR 2020]
aimagelab/freeda
FreeDA: Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation (CVPR 2024)
mkoshkina/jersey-number-pipeline
A General Framework for Jersey Number Recognition in Sports Video
MKLab-ITI/multimedia-geotagging
Contains the implementation of algorithms that estimate the geographic location of media content based on their content and metadata. It includes the participation in the MediaEval Placing Task 2014.
mever-team/visloc-estimation
Authors official PyTorch implementation of the "Leveraging EfficientNet and Contrastive Learning for Accurate Global-scale Location Estimation" [ICMR 2021] and "Leveraging Selective Prediction for Reliable Image Geolocation" [MMM 2022]
ttt-matching-based-vos/ttt_matching_vos
Authors official PyTorch implementation of the "Test-time Training for Matching-based Video Object Segmentation" [NeurIPS 2023]
billpsomas/mars_crater_detection
Official PyTorch implementation for IGARSS 2024 paper: "Evaluation of Resource-Efficient Crater Detectors on Embedded Systems"
Selefth/fair_neighborhood
yetigurbuz/generalized-sum-pooling
Official Tensorflow and PyTorch Implementation of "Generalized Sum Pooling for Metric Learning"
YonghaoXu/UT-KD