futureisatyourhand

I believe that one day I will succeed.

Institute of Computing Technology, Chinese Academy of SciencesBeiJing

futureisatyourhand's Stars

facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook47.9k 308 6755.7k
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Language:Python44.7k 444 9.4k7.9k
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Language:Python32.4k 313 9324.8k
openai/gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
Language:Python22.6k 632 2665.5k
karpathy/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Language:Python20.3k 257 722.5k
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Language:Python10.9k 70 107691
jacobgil/pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Language:Python10.7k 45 4181.6k
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook10k 99 667975
autogluon/autogluon
Fast and Accurate ML in 3 Lines of Code
Language:Python8.1k 97 1.5k930
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Language:Python5.1k 49 453386
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Language:Jupyter Notebook4.8k 34 199648
open-mmlab/mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Language:Python4.4k 58 901755
InternLM/InternLM-XComposer
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Language:Python2.5k 43 391156
baaivision/EVA
EVA Series: Visual Representation Fantasies from BAAI
Language:Python2.3k 30 162167
salesforce/ALBEF
Code for ALBEF: a new vision-language pre-training method
Language:Python1.6k 13 141199
facebookresearch/ConvNeXt-V2
Code release for ConvNeXt V2 model
Language:Python1.5k 7 73120
AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
Language:C++1.5k 37 185176
minivision-ai/Silent-Face-Anti-Spoofing
静默活体检测（Silent-Face-Anti-Spoofing）
Language:Python1.4k 30 118451
microsoft/RegionCLIP
[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"
Language:Python717 10 10152
weijiaheng/Advances-in-Label-Noise-Learning
A curated (most recent) list of resources for Learning with Noisy Labels
687 16 158
chunbolang/BAM
Official PyTorch Implementation of Learning What Not to Segment: A New Perspective on Few-Shot Segmentation (CVPR'22 Oral & TPAMI'23).
Language:Python249 7 6443
Jyouhou/UnrealText
Synthetic Scene Text from 3D Engines
Language:C++247 10 2839
wenwenyu/TCM
Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)
Language:Jupyter Notebook184 13 1916
LAION-AI/scaling-laws-openclip
Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)
Language:Jupyter Notebook154 8 112
wangsr126/MAE-Lite
Official implement for ICML2023 paper: "A Closer Look at Self-Supervised Lightweight Vision Transformers"
Language:Python116 4 89
zejiangh/MILAN
PyTorch implementation of the paper "MILAN: Masked Image Pretraining on Language Assisted Representation" https://arxiv.org/pdf/2208.06049.pdf.
Language:Python79 2 06
bytedance/oclip
Language:Python50 3 116
futureisatyourhand/atlas300_ascend310_fewshotdetection
Language:Python4 1 00
futureisatyourhand/Top-Related-Meta-Learning-Method-for-Few-Shot-Detection
code about https://arxiv.org/pdf/2007.06837.pdf
Language:Python3 1 21
futureisatyourhand/self-supervised-learning
about self-supervised image classification and object detection
Language:Python2 1 00

futureisatyourhand

futureisatyourhand's Stars

facebookresearch/segment-anything

PaddlePaddle/PaddleOCR

huggingface/pytorch-image-models

openai/gpt-2

karpathy/minGPT

microsoft/LoRA

jacobgil/pytorch-grad-cam

salesforce/LAVIS

autogluon/autogluon

QwenLM/Qwen-VL

salesforce/BLIP

open-mmlab/mmocr

InternLM/InternLM-XComposer

baaivision/EVA

salesforce/ALBEF

facebookresearch/ConvNeXt-V2

AlibabaResearch/AdvancedLiterateMachinery

minivision-ai/Silent-Face-Anti-Spoofing

microsoft/RegionCLIP

weijiaheng/Advances-in-Label-Noise-Learning

chunbolang/BAM

Jyouhou/UnrealText

wenwenyu/TCM

LAION-AI/scaling-laws-openclip

wangsr126/MAE-Lite

zejiangh/MILAN

bytedance/oclip

futureisatyourhand/atlas300_ascend310_fewshotdetection

futureisatyourhand/Top-Related-Meta-Learning-Method-for-Few-Shot-Detection

futureisatyourhand/self-supervised-learning