billpsomas

Deep Learning | Computer Vision | Research

National Technical University of AthensAthens, Greece

billpsomas's Stars

rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Language:Jupyter Notebook33.7k 364 1074.1k
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook15.3k 114 3901.4k
facebookresearch/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook11.1k 64 259944
voxel51/fiftyone
Refine high-quality datasets and visual AI models
Language:Python8.9k 62 1.5k567
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python6.1k 52 625478
princeton-vl/infinigen
Infinite Photorealistic Worlds using Procedural Generation
Language:Python5.4k 87 317472
AILab-CVC/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Language:Python4.7k 40 467459
ali-vilab/AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
Language:Python4k 86 103367
ultralytics/JSON2YOLO
Convert JSON annotations into YOLO format.
Language:Python873 8 56232
Qinying-Liu/Awesome-Open-Vocabulary-Semantic-Segmentation
A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..
466 17 425
inbarhub/DDPM_inversion
Official pytorch implementation of the paper: "An Edit Friendly DDPM Noise Space: Inversion and Manipulations". CVPR 2024.
Language:Python287 2 1213
MKLab-ITI/visil
Authors official PyTorch implementation of the "ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning" [ICCV 2019]
Language:Python209 10 2238
VamosC/CLIP4STR
An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".
Language:Python124 4 2515
MKLab-ITI/ndvr-dml
Authors official Tensorflow implementation of the "Near-Duplicate Video Retrieval with Deep Metric Learning" [ICCVW 2017]
Language:Python118 6 1618
YonghaoXu/SEANet
[AAAI 2019] Self-Ensembling Attention Networks: Addressing Domain Shift for Semantic Segmentation
Language:Python102 3 812
vishaal27/SuS-X
Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]
Language:Python94 3 95
billpsomas/rscir
Official PyTorch implementation and benchmark dataset for IGARSS 2024 ORAL paper: "Composed Image Retrieval for Remote Sensing"
Language:Python69 2 61
MKLab-ITI/intermediate-cnn-features
Feature extraction from videos based on intermediate layers of a Convolutional Neural Network.
Language:Python63 6 314
afiaka87/retrieval-augmented-diffusion
Retrieval augmented diffusion from CompVis.
Language:Jupyter Notebook51 2 07
gkordo/s2vs
Authors official PyTorch implementation of the "Self-Supervised Video Similarity Learning" [CVPRW 2023]
Language:Python38 2 42
mever-team/ausil
Authors official Tensorflow implementation of the "Audio-based Near-Duplicate Video Retrieval with Audio Similarity Learning" [ICPR 2020]
Language:Python32 4 03
aimagelab/freeda
FreeDA: Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation (CVPR 2024)
Language:Python29 6 22
mkoshkina/jersey-number-pipeline
A General Framework for Jersey Number Recognition in Sports Video
Language:Python25 1 23
MKLab-ITI/multimedia-geotagging
Contains the implementation of algorithms that estimate the geographic location of media content based on their content and metadata. It includes the participation in the MediaEval Placing Task 2014.
Language:Java13 12 04
mever-team/visloc-estimation
Authors official PyTorch implementation of the "Leveraging EfficientNet and Contrastive Learning for Accurate Global-scale Location Estimation" [ICMR 2021] and "Leveraging Selective Prediction for Reliable Image Geolocation" [MMM 2022]
Language:Python9 3 02
ttt-matching-based-vos/ttt_matching_vos
Authors official PyTorch implementation of the "Test-time Training for Matching-based Video Object Segmentation" [NeurIPS 2023]
Language:Python9 2 00
billpsomas/mars_crater_detection
Official PyTorch implementation for IGARSS 2024 paper: "Evaluation of Resource-Efficient Crater Detectors on Embedded Systems"
Language:C++6 3 00
Selefth/fair_neighborhood
Language:Jupyter Notebook4 1 00
yetigurbuz/generalized-sum-pooling
Official Tensorflow and PyTorch Implementation of "Generalized Sum Pooling for Metric Learning"
Language:Python4 2 01
YonghaoXu/UT-KD
Language:Python3 2 0

billpsomas

billpsomas's Stars

rasbt/LLMs-from-scratch

IDEA-Research/Grounded-Segment-Anything

facebookresearch/segment-anything-2

voxel51/fiftyone

OpenGVLab/InternVL

princeton-vl/infinigen

AILab-CVC/YOLO-World

ali-vilab/AnyDoor

ultralytics/JSON2YOLO

Qinying-Liu/Awesome-Open-Vocabulary-Semantic-Segmentation

inbarhub/DDPM_inversion

MKLab-ITI/visil

VamosC/CLIP4STR

MKLab-ITI/ndvr-dml

YonghaoXu/SEANet

vishaal27/SuS-X

billpsomas/rscir

MKLab-ITI/intermediate-cnn-features

afiaka87/retrieval-augmented-diffusion

gkordo/s2vs

mever-team/ausil

aimagelab/freeda

mkoshkina/jersey-number-pipeline

MKLab-ITI/multimedia-geotagging

mever-team/visloc-estimation

ttt-matching-based-vos/ttt_matching_vos

billpsomas/mars_crater_detection

Selefth/fair_neighborhood

yetigurbuz/generalized-sum-pooling

YonghaoXu/UT-KD