NielsRogge
ML @HuggingFace. Interested in deep learning, NLP. Contributed 40+ models to HuggingFace Transformers
HuggingFaceBelgium
Pinned Repositories
awesome-huggingface
Repository containing awesome resources regarding Hugging Face tooling.
coco-eval
A tiny package supporting distributed computation of COCO metrics for PyTorch models.
CogVLM
a state-of-the-art-level open visual language model
Description2Process
Transforming textual descriptions into process models using deep learning
diffusion-notes
Some notes I took when learning about diffusion models.
tapas_utils
A package containing utils for the PyTorch version of the Tapas algorithm.
transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
unilm
UniLM - Unified Language Model Pre-training / Pre-training for NLP and Beyond
Vision-Transformer-papers
This repository contains an overview of important follow-up works based on the original Vision Transformer (ViT) by Google.
NielsRogge's Repositories
NielsRogge/awesome-huggingface
Repository containing awesome resources regarding Hugging Face tooling.
NielsRogge/CogVLM
a state-of-the-art-level open visual language model
NielsRogge/Depth-Anything
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
NielsRogge/mistral-src
Reference implementation of Mistral AI 7B v0.1 model.
NielsRogge/alignment-handbook
Robust recipes for to align language models with human and AI preferences
NielsRogge/ml-aim
This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Models
NielsRogge/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
NielsRogge/table-transformer
Model training and evaluation code for our dataset PubTables-1M, developed to support the task of table extraction from unstructured documents.
NielsRogge/vstar
PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"
NielsRogge/AudioSep
Official implementation of "Separate Anything You Describe"
NielsRogge/azure-search-openai-demo
A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure Cognitive Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.
NielsRogge/big_vision
Official codebase used to develop Vision Transformer, MLP-Mixer, LiT and more.
NielsRogge/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
NielsRogge/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
NielsRogge/FAST
Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation
NielsRogge/open_lm
A repository for research on medium sized language models.
NielsRogge/optimum
🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
NielsRogge/pytorch-image-models
PyTorch image models, scripts, pretrained weights -- (SE)ResNet/ResNeXT, DPN, EfficientNet, MixNet, MobileNet-V3/V2, MNASNet, Single-Path NAS, FBNet, and more
NielsRogge/scenic
Scenic: A Jax Library for Computer Vision Research and Beyond
NielsRogge/scikit-image
Image processing in Python
NielsRogge/CompoundSplit
Compound splitter for German
NielsRogge/datacomp
DataComp: In search of the next generation of multimodal datasets
NielsRogge/DDColor
[ICCV 2023] Official implementation of "DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders"
NielsRogge/GroundingDINO
The official implementation of "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
NielsRogge/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
NielsRogge/mamba
NielsRogge/mmdetection
OpenMMLab Detection Toolbox and Benchmark
NielsRogge/SPTSv2
The official implementation of SPTS v2: Single-Point Text Spotting
NielsRogge/T-MARS
Code for T-MARS data filtering
NielsRogge/Vim
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model