billpsomas

Deep Learning | Computer Vision | Research

National Technical University of AthensAthens, Greece

billpsomas's Stars

meta-llama/llama
Inference code for Llama models
Language:Python57.6k 530 1.1k9.7k
WongKinYiu/yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
Language:Python9.1k 55 5541.5k
facebookresearch/ConvNeXt
Code release for ConvNeXt model
Language:Python5.9k 32 130704
Guang000/Awesome-Dataset-Distillation
A curated list of awesome papers on dataset distillation and related applications.
Language:HTML1.5k 33 13141
omerbt/MultiDiffusion
Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)
Language:Jupyter Notebook1k 34 2660
kennethleungty/Neural-Network-Architecture-Diagrams
Diagrams for visualizing neural network architecture (Created with diagrams.net)
816 4 1478
google/break-a-scene
Official implementation for "Break-A-Scene: Extracting Multiple Concepts from a Single Image" [SIGGRAPH Asia 2023]
Language:Python512 9 2624
layumi/University1652-Baseline
ACM Multimedia2020 University-1652: A Multi-view Multi-source Benchmark for Drone-based Geo-localization :helicopter: annotates 1652 buildings in 72 universities around the world.
Language:Python505 12 3875
ChenDelong1999/RemoteCLIP
🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)
Language:Jupyter Notebook345 5 3922
OpenGVLab/unmasked_teacher
[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
Language:Python315 12 4817
georgeretsi/smirk
Official Pytorch Implementation of SMIRK: 3D Facial Expressions through Analysis-by-Neural-Synthesis (CVPR 2024)
Language:Python229 9 3825
MKLab-ITI/visil
Authors official PyTorch implementation of the "ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning" [ICCV 2019]
Language:Python212 9 2240
miccunifi/SEARLE
[ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion
Language:Python166 12 129
samar-khanna/DiffusionSat
Official code repository for ICLR 2024 paper "DiffusionSat: A Generative Foundation Model for Satellite Imagery"
Language:Python144 14 179
MKLab-ITI/ndvr-dml
Authors official Tensorflow implementation of the "Near-Duplicate Video Retrieval with Deep Metric Learning" [ICCVW 2017]
Language:Python119 6 1618
billpsomas/simpool
This repo contains the official implementation of ICCV 2023 paper "Keep It SimPool: Who Said Supervised Transformers Suffer from Attention Deficit?"
Language:Python98 2 12
vimar-gu/MinimaxDiffusion
[CVPR2024] Efficient Dataset Distillation via Minimax Diffusion
Language:Python90 1 119
navervision/CompoDiff
Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)
Language:Python83 9 73
shashankvkt/DoRA_ICLR24
This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video""
Language:Python79 2 49
pals-ttic/adapting-CLIP
Language:Python64 7 510
MKLab-ITI/intermediate-cnn-features
Feature extraction from videos based on intermediate layers of a Convolutional Neural Network.
Language:Python63 6 314
ExplainableML/Vision_by_Language
[ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"
Language:Python57 5 95
gkakogeorgiou/spot
[CVPR 2024 Highlight] :dog: SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers
Language:Python57 5 35
nikosips/met
A large-scale dataset for instance-level recognition for artworks is introduced.
Language:Python48 3 12
wysoczanska/clip-diy
Official implementation of the WACV 2024 paper CLIP-DIY
Language:Jupyter Notebook34 2 23
mever-team/ausil
Authors official Tensorflow implementation of the "Audio-based Near-Duplicate Video Retrieval with Audio Similarity Learning" [ICPR 2020]
Language:Python33 4 03
gkakogeorgiou/mados
[ISPRS Journal of Photogrammetry and Remote Sensing] Detecting Marine Pollutants and Sea Surface Features with Deep Learning in Sentinel-2 Imagery
Language:Python32 2 06
psaltaath/tadn-mot
Transformer based Decision Networks for MOT
Language:Python22 3 16
MKLab-ITI/multimedia-geotagging
Contains the implementation of algorithms that estimate the geographic location of media content based on their content and metadata. It includes the participation in the MediaEval Placing Task 2014.
Language:Java13 12 04
conghui1002/DG-UCDIR
Language:Python10 1 21

billpsomas

billpsomas's Stars

meta-llama/llama

WongKinYiu/yolov9

facebookresearch/ConvNeXt

Guang000/Awesome-Dataset-Distillation

omerbt/MultiDiffusion

kennethleungty/Neural-Network-Architecture-Diagrams

google/break-a-scene

layumi/University1652-Baseline

ChenDelong1999/RemoteCLIP

OpenGVLab/unmasked_teacher

georgeretsi/smirk

MKLab-ITI/visil

miccunifi/SEARLE

samar-khanna/DiffusionSat

MKLab-ITI/ndvr-dml

billpsomas/simpool

vimar-gu/MinimaxDiffusion

navervision/CompoDiff

shashankvkt/DoRA_ICLR24

pals-ttic/adapting-CLIP

MKLab-ITI/intermediate-cnn-features

ExplainableML/Vision_by_Language

gkakogeorgiou/spot

nikosips/met

wysoczanska/clip-diy

mever-team/ausil

gkakogeorgiou/mados

psaltaath/tadn-mot

MKLab-ITI/multimedia-geotagging

conghui1002/DG-UCDIR