KathPra

PhD researcher in computer vision.

Germany

KathPra's Stars

google-research/google-research
Google Research
Language:Jupyter Notebook34.7k 751 1.3k8k
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Language:Python32.9k 317 9514.8k
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Language:Jupyter Notebook15.9k 193 4052.3k
lmcinnes/umap
Uniform Manifold Approximation and Projection
Language:Python7.6k 127 804813
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Language:Python7.2k 46 311732
vikhyat/moondream
tiny vision language model
Language:Jupyter Notebook7k 65 154549
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Language:Python3.9k 30 268348
rom1504/clip-retrieval
Easily compute clip embeddings and build a clip retrieval system with them
Language:Jupyter Notebook2.5k 25 233218
webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
Language:Python2.4k 24 351193
fastai/imagenette
A smaller subset of 10 easily classified classes from Imagenet, and a little more French
Language:Jupyter Notebook994 12 2475
beichenzbc/Long-CLIP
[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"
Language:Python734 13 9837
LAION-AI/CLIP_benchmark
CLIP-like model evaluation
Language:Jupyter Notebook650 12 6580
LAION-AI/CLIP-based-NSFW-Detector
Language:Python329 4 1329
MILVLG/bottom-up-attention.pytorch
A PyTorch reimplementation of bottom-up-attention models
Language:Jupyter Notebook296 2 9576
berkeley-hipie/HIPIE
[NeurIPS2023] Code release for "Hierarchical Open-vocabulary Universal Image Segmentation"
Language:Jupyter Notebook276 7 2220
bjoern-andres/graph
Graphs and Graph Algorithms in C++, including Minimum Cost (Lifted) Multicuts
Language:C++237 24 1488
facebookresearch/isc2021
Code for the Image similarity challenge.
Language:Python195 8 442
lyakaap/ISC21-Descriptor-Track-1st
The 1st Place Solution of the Facebook AI Image Similarity Challenge (ISC21) : Descriptor Track.
Language:Python138 4 1319
chs20/RobustVLM
[ICML 2024] Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models
Language:Python115 7 84
benchaplin/hungarian-algorithm
Python 3 implementation of the Hungarian Algorithm
Language:Python72 1 712
aimagelab/safe-clip
Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models. ECCV 2024
Language:Python52 7 20
neuroexplicit-saar/Discover-then-Name
Code for the paper: Discover-then-Name: Task-Agnostic Concept Bottlenecks via Automated Concept Discovery. ECCV 2024.
Language:Python33 7 52
ZhouYuxuanYX/MultiMax
This is the official implementation of our ICML 2024 paper "MultiMax: Sparse and Multi-Modal Attention Learning""
Language:Python17 3 00
zhutong0219/ITIN
Multimodal Sentiment Analysis with Image-Text Interaction Network
Language:Python13 1 3
ZhouYuxuanYX/Benchmarking-and-Guiding-Adaptive-Sampling-Decoding-for-LLMs
Language:Python80
IsaacBravo/streamlit-app
This is an interactive app that allow users play around with the clip model to analyze images
Language:Python3 1 01
ZhouYuxuanYX/Maximum-Suppression-Regularization
Language:Python21
HY-Wong/Thesis
Language:Python1 1 01
shashankskagnihotri/adv_mmsegmentation
Language:Python1 1 00
shashankskagnihotri/pruneshift-public
Language:Python1 1 00

KathPra

KathPra's Stars

google-research/google-research

huggingface/pytorch-image-models

meta-llama/llama-recipes

lmcinnes/umap

IDEA-Research/GroundingDINO

vikhyat/moondream

rom1504/img2dataset

rom1504/clip-retrieval

webdataset/webdataset

fastai/imagenette

beichenzbc/Long-CLIP

LAION-AI/CLIP_benchmark

LAION-AI/CLIP-based-NSFW-Detector

MILVLG/bottom-up-attention.pytorch

berkeley-hipie/HIPIE

bjoern-andres/graph

facebookresearch/isc2021

lyakaap/ISC21-Descriptor-Track-1st

chs20/RobustVLM

benchaplin/hungarian-algorithm

aimagelab/safe-clip

neuroexplicit-saar/Discover-then-Name

ZhouYuxuanYX/MultiMax

zhutong0219/ITIN

ZhouYuxuanYX/Benchmarking-and-Guiding-Adaptive-Sampling-Decoding-for-LLMs

IsaacBravo/streamlit-app

ZhouYuxuanYX/Maximum-Suppression-Regularization

HY-Wong/Thesis

shashankskagnihotri/adv_mmsegmentation

shashankskagnihotri/pruneshift-public