KathPra

PhD researcher in computer vision.

Germany

KathPra's Stars

meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python26.9k3k
ExplainableML/WaffleCLIP
Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts"
Language:Python514
facebookresearch/MetaCLIP
ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering
Language:Python1.2k53
mlfoundations/open_clip
An open source implementation of CLIP.
Language:Python10.2k976
hendrycks/natural-adv-examples
A Harder ImageNet Test Set (CVPR 2021)
Language:Python59051
google-research/vision_transformer
Language:Jupyter Notebook10.3k1.3k
facebookresearch/ConvNeXt-V2
Code release for ConvNeXt V2 model
Language:Python1.5k118
hendrycks/imagenet-r
ImageNet-R(endition) and DeepAugment (ICCV 2021)
Language:Python25318
google-research/big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
Language:Jupyter Notebook2.3k152
mertyg/vision-language-models-are-bows
Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023
Language:Python24915
facebookresearch/grid-feats-vqa
Grid features pre-training code for visual question answering
Language:Python26846
baaivision/EVA
EVA Series: Visual Representation Fantasies from BAAI
Language:Python2.3k167
peteanderson80/bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
Language:Jupyter Notebook1.4k379
feifengwhu/TFRCVG
training fast r-cnn in visual genome
Language:Jupyter Notebook51
KathPra/Pixel-Power
Language:Jupyter Notebook1
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python83.4k22.5k
eliahuhorwitz/DeepSIM
Official PyTorch implementation of the paper: "DeepSIM: Image Shape Manipulation from a Single Augmented Training Sample" (ICCV 2021 Oral)
Language:Python42050
ssundaram21/dreamsim
DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data (NeurIPS 2023 Spotlight) / / / / When Does Perceptual Alignment Benefit Vision Representations? (NeurIPS 2024)
Language:Python38317
SSAW14/BeyondtheSpectrum
Implementation for the IJCAI2021 work "Beyond the Spectrum: Detecting Deepfakes via Re-synthesis"
Language:Python355
ilmoi/MML-Book
Code / solutions for Mathematics for Machine Learning (MML Book)
Language:Jupyter Notebook988165
facebookresearch/ImageBind
ImageBind One Embedding Space to Bind Them All
Language:Python8.3k762
UX-Decoder/Semantic-SAM
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
Language:Python2.3k111
cristinabustos16/CLIP-E
Code for the paper
Language:Jupyter Notebook61
IsaacBravo/ClimateVision-Website
This repository host the website for the research project Climate Vision.
Language:JavaScript2
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook47.4k5.6k
JohannesBuchner/imagehash
A Python Perceptual Image Hashing Module
Language:Python3.2k328
di-dimitrov/propaganda-techniques-in-memes
Language:Python17
shashankskagnihotri/cospgd
The official repository for CosPGD: a unified white-box adversarial attack for pixel-wise prediction tasks.
Language:Python11
KathPra/Datasets_ClimateVisions
This repo contains the data used in "Towards Understanding Climate Change Perceptions: A Social Media Dataset"
14
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
Language:Jupyter Notebook9.1k805

KathPra

KathPra's Stars

meta-llama/llama3

ExplainableML/WaffleCLIP

facebookresearch/MetaCLIP

mlfoundations/open_clip

hendrycks/natural-adv-examples

google-research/vision_transformer

facebookresearch/ConvNeXt-V2

hendrycks/imagenet-r

google-research/big_vision

mertyg/vision-language-models-are-bows

facebookresearch/grid-feats-vqa

baaivision/EVA

peteanderson80/bottom-up-attention

feifengwhu/TFRCVG

KathPra/Pixel-Power

pytorch/pytorch

eliahuhorwitz/DeepSIM

ssundaram21/dreamsim

SSAW14/BeyondtheSpectrum

ilmoi/MML-Book

facebookresearch/ImageBind

UX-Decoder/Semantic-SAM

cristinabustos16/CLIP-E

IsaacBravo/ClimateVision-Website

facebookresearch/segment-anything

JohannesBuchner/imagehash

di-dimitrov/propaganda-techniques-in-memes

shashankskagnihotri/cospgd

KathPra/Datasets_ClimateVisions

facebookresearch/dinov2