bbhasnat's Stars
openai/glide-text2im
GLIDE: a diffusion-based text-conditional image synthesis model
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
ageitgey/face_recognition
The world's simplest facial recognition api for Python and the command line
Heldenkombinat/Logodetect
Find logos in images and videos in just one-shot. Never be embarrassed again to say that you have a small data situation!
exadel-inc/CompreFace
Leading free and open-source face recognition system
KevinMusgrave/pytorch-adapt
Domain adaptation made easy. Fully featured, modular, and customizable.
qanastek/HugsVision
HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision
google/automl
Google Brain AutoML
lukemelas/EfficientNet-PyTorch
A PyTorch implementation of EfficientNet
Megvii-BaseDetection/YOLOX
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
kobiso/CBAM-keras
CBAM implementation on Keras
xmu-xiaoma666/External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
osmr/imgclsmob
Sandbox for training deep learning networks
microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
chinhsuanwu/mobilevit-pytorch
A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
CyberZHG/keras-self-attention
Attention mechanism for processing sequential data that considers the context for each timestamp.
pytorch/captum
Model interpretability and understanding for PyTorch
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
bentoml/BentoML
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!