billpsomas
Deep Learning | Computer Vision | Research
National Technical University of AthensAthens, Greece
billpsomas's Stars
meta-llama/llama
Inference code for Llama models
WongKinYiu/yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
facebookresearch/ConvNeXt
Code release for ConvNeXt model
Guang000/Awesome-Dataset-Distillation
A curated list of awesome papers on dataset distillation and related applications.
omerbt/MultiDiffusion
Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)
kennethleungty/Neural-Network-Architecture-Diagrams
Diagrams for visualizing neural network architecture (Created with diagrams.net)
google/break-a-scene
Official implementation for "Break-A-Scene: Extracting Multiple Concepts from a Single Image" [SIGGRAPH Asia 2023]
layumi/University1652-Baseline
ACM Multimedia2020 University-1652: A Multi-view Multi-source Benchmark for Drone-based Geo-localization :helicopter: annotates 1652 buildings in 72 universities around the world.
ChenDelong1999/RemoteCLIP
🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)
OpenGVLab/unmasked_teacher
[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
georgeretsi/smirk
Official Pytorch Implementation of SMIRK: 3D Facial Expressions through Analysis-by-Neural-Synthesis (CVPR 2024)
MKLab-ITI/visil
Authors official PyTorch implementation of the "ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning" [ICCV 2019]
miccunifi/SEARLE
[ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion
samar-khanna/DiffusionSat
Official code repository for ICLR 2024 paper "DiffusionSat: A Generative Foundation Model for Satellite Imagery"
MKLab-ITI/ndvr-dml
Authors official Tensorflow implementation of the "Near-Duplicate Video Retrieval with Deep Metric Learning" [ICCVW 2017]
billpsomas/simpool
This repo contains the official implementation of ICCV 2023 paper "Keep It SimPool: Who Said Supervised Transformers Suffer from Attention Deficit?"
vimar-gu/MinimaxDiffusion
[CVPR2024] Efficient Dataset Distillation via Minimax Diffusion
navervision/CompoDiff
Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)
shashankvkt/DoRA_ICLR24
This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video""
pals-ttic/adapting-CLIP
MKLab-ITI/intermediate-cnn-features
Feature extraction from videos based on intermediate layers of a Convolutional Neural Network.
ExplainableML/Vision_by_Language
[ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"
gkakogeorgiou/spot
[CVPR 2024 Highlight] :dog: SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers
nikosips/met
A large-scale dataset for instance-level recognition for artworks is introduced.
wysoczanska/clip-diy
Official implementation of the WACV 2024 paper CLIP-DIY
mever-team/ausil
Authors official Tensorflow implementation of the "Audio-based Near-Duplicate Video Retrieval with Audio Similarity Learning" [ICPR 2020]
gkakogeorgiou/mados
[ISPRS Journal of Photogrammetry and Remote Sensing] Detecting Marine Pollutants and Sea Surface Features with Deep Learning in Sentinel-2 Imagery
psaltaath/tadn-mot
Transformer based Decision Networks for MOT
MKLab-ITI/multimedia-geotagging
Contains the implementation of algorithms that estimate the geographic location of media content based on their content and metadata. It includes the participation in the MediaEval Placing Task 2014.
conghui1002/DG-UCDIR