KathPra's Stars
meta-llama/llama3
The official Meta Llama 3 GitHub site
ExplainableML/WaffleCLIP
Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts"
facebookresearch/MetaCLIP
ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering
mlfoundations/open_clip
An open source implementation of CLIP.
hendrycks/natural-adv-examples
A Harder ImageNet Test Set (CVPR 2021)
google-research/vision_transformer
facebookresearch/ConvNeXt-V2
Code release for ConvNeXt V2 model
hendrycks/imagenet-r
ImageNet-R(endition) and DeepAugment (ICCV 2021)
google-research/big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
mertyg/vision-language-models-are-bows
Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023
facebookresearch/grid-feats-vqa
Grid features pre-training code for visual question answering
baaivision/EVA
EVA Series: Visual Representation Fantasies from BAAI
peteanderson80/bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
feifengwhu/TFRCVG
training fast r-cnn in visual genome
KathPra/Pixel-Power
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
eliahuhorwitz/DeepSIM
Official PyTorch implementation of the paper: "DeepSIM: Image Shape Manipulation from a Single Augmented Training Sample" (ICCV 2021 Oral)
ssundaram21/dreamsim
DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data (NeurIPS 2023 Spotlight) / / / / When Does Perceptual Alignment Benefit Vision Representations? (NeurIPS 2024)
SSAW14/BeyondtheSpectrum
Implementation for the IJCAI2021 work "Beyond the Spectrum: Detecting Deepfakes via Re-synthesis"
ilmoi/MML-Book
Code / solutions for Mathematics for Machine Learning (MML Book)
facebookresearch/ImageBind
ImageBind One Embedding Space to Bind Them All
UX-Decoder/Semantic-SAM
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
cristinabustos16/CLIP-E
Code for the paper
IsaacBravo/ClimateVision-Website
This repository host the website for the research project Climate Vision.
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
JohannesBuchner/imagehash
A Python Perceptual Image Hashing Module
di-dimitrov/propaganda-techniques-in-memes
shashankskagnihotri/cospgd
The official repository for CosPGD: a unified white-box adversarial attack for pixel-wise prediction tasks.
KathPra/Datasets_ClimateVisions
This repo contains the data used in "Towards Understanding Climate Change Perceptions: A Social Media Dataset"
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.