transformers

There are 6040 repositories under transformers topic.

microsoft/generative-ai-for-beginners
21 Lessons, Get Started Building with Generative AI
Language:Jupyter Notebook102k 897 19453.9k
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Language:Jupyter Notebook78.2k 668 20311.6k
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python64.2k 484 1386.5k
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Language:Python62.1k 302 7.6k7.5k
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Language:Python24.4k 158 2743.4k
deepset-ai/haystack
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
Language:MDX23.3k 158 4.1k2.5k
amusi/CVPR2025-Papers-with-Code
CVPR 2025 论文和开源项目合集
21.4k 293 2122.8k
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python20k 107 1.3k2.1k
arc53/DocsGPT
Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.
Language:Python17.3k 96 5761.9k
stas00/ml-engineering
Machine Learning Engineering Open Book
Language:Python15.7k 133 38957
huggingface/transformers.js
State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!
Language:JavaScript14.8k 94 8691k
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python14.1k 171 1.2k3.3k
BlinkDL/RWKV-LM
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.
Language:Python14.1k 137 261970
PaddlePaddle/PaddleNLP
Easy-to-use and powerful LLM and SLM library with awesome model zoo.
Language:Python12.8k 96 3.8k3.1k
neuml/txtai
💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows
Language:Python11.8k 108 927757
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
Language:Jupyter Notebook11.3k 146 4801.7k
qubvel-org/segmentation_models.pytorch
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
Language:Python11k 82 7151.8k
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python10.7k 132 1.2k1.6k
huggingface/tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
Language:Rust10.2k 123 1.1k991
niedev/RTranslator
Open source real-time translation app for Android that runs locally
Language:C++9.3k 70 122835
openvinotoolkit/openvino
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
Language:C++9.2k 185 3.2k2.8k
FoundationVision/VAR
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Jupyter Notebook8.5k 100 161542
intel/ipex-llm
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.
Language:Python8.4k 259 3k1.4k
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
Language:Python8.3k 42 716809
EleutherAI/gpt-neo
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
Language:Python8.3k 176 137963
lucidrains/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Language:Python7.9k 134 52681
jessevig/bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Language:Python7.7k 72 126848
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Language:Python7.3k 127 4601.1k
MaartenGr/BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
Language:Python7.2k 52 1.8k864
SkalskiP/courses
This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)
Language:Python6.2k 99 8565
microsoft/presidio
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
Language:Python6k 75 504838
lucidrains/x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
Language:Python5.7k 54 266485
lucidrains/DALLE-pytorch
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
Language:Python5.6k 92 277646
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Language:Jupyter Notebook5.6k 36 372527
imoneoi/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
Language:Python5.4k 48 190429
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python5.4k 106 146463

transformers

microsoft/generative-ai-for-beginners

rasbt/LLMs-from-scratch

labmlai/annotated_deep_learning_paper_implementations

hiyouga/LLaMA-Factory

lucidrains/vit-pytorch

deepset-ai/haystack

amusi/CVPR2025-Papers-with-Code

huggingface/peft

arc53/DocsGPT

stas00/ml-engineering

huggingface/transformers.js

NVIDIA/Megatron-LM

BlinkDL/RWKV-LM

PaddlePaddle/PaddleNLP

neuml/txtai

NielsRogge/Transformers-Tutorials

qubvel-org/segmentation_models.pytorch

speechbrain/speechbrain

huggingface/tokenizers

niedev/RTranslator

openvinotoolkit/openvino

FoundationVision/VAR

intel/ipex-llm

OpenRLHF/OpenRLHF

EleutherAI/gpt-neo

lucidrains/PaLM-rlhf-pytorch

jessevig/bertviz

EleutherAI/gpt-neox

MaartenGr/BERTopic

SkalskiP/courses

microsoft/presidio

lucidrains/x-transformers

lucidrains/DALLE-pytorch

OFA-Sys/Chinese-CLIP

imoneoi/openchat

huggingface/alignment-handbook