gpt-2

There are 739 repositories under gpt-2 topic.

BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Language:Python11.8k 136 195814
codota/TabNine
AI Code Completions
Language:Shell10.5k 138 571504
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Language:Python9.5k 63 102601
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
Language:Jupyter Notebook8.3k 127 4151.3k
EleutherAI/gpt-neo
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
Language:Python8.2k 178 139939
Morizeyao/GPT2-Chinese
Chinese version of GPT2 training code, using BERT tokenizer.
Language:Python7.4k 162 2511.7k
lonePatient/awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models，高质量中文预训练模型&大模型&多模态模型&大语言模型集合
Language:Python4.4k 89 10430
FoundationVision/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Python3.6k 112 64280
jaymody/picoGPT
An unnecessarily tiny implementation of GPT-2 in NumPy.
Language:Python3.1k 29 10401
yangjianxin1/GPT2-chitchat
GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI**)
Language:Python2.9k 41 117679
dbiir/UER-py
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
Language:Python2.9k 75 263524
stochasticai/xTuring
Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6
Language:Python2.5k 32 101197
guillaume-be/rust-bert
Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)
Language:Rust2.5k 39 207203
asyml/texar
Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/
Language:Python2.4k 78 159371
BrikerMan/Kashgari
Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.
Language:Python2.4k 64 377439
microsoft/DialoGPT
Large-scale pretraining for dialogue
Language:Python2.3k 55 82342
lxe/simple-llm-finetuner
Simple UI for LLM Model Finetuning
Language:Jupyter Notebook2k 20 49134
VHellendoorn/Code-LMs
Guide to using pre-trained large language models of source code
Language:Python1.7k 43 35238
huggingface/transfer-learning-conv-ai
🦄 State-of-the-Art Conversational AI with Transfer Learning
Language:Python1.7k 86 109426
thu-coai/CDial-GPT
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
Language:Python1.7k 28 108248
imcaspar/gpt2-ml
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
Language:Python1.7k 38 89334
codota/tabnine-vscode
Visual Studio Code client for Tabnine. https://marketplace.visualstudio.com/items?itemName=TabNine.tabnine-vscode
Language:TypeScript1.3k 36 361171
explosion/spacy-transformers
🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
Language:Python1.3k 31 0161
mishalhossin/Discord-AI-Chatbot
This Discord chatbot is incredibly versatile. Powered incredibly fast Groq API
Language:Python1.3k 29 156403
cedrickchee/awesome-transformer-nlp
A curated list of NLP resources focused on Transformer networks, attention mechanism, GPT, BERT, ChatGPT, LLMs, and transfer learning.
1k 39 0127
turtlesoupy/this-word-does-not-exist
This Word Does Not Exist
Language:Python1k 9 4884
guinmoon/LLMFarm
llama and other large language models on iOS and MacOS offline using GGML library.
Language:Swift1k 13 6659
Tencent/TencentPretrain
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo
Language:Python990 20 36139
graykode/gpt-2-Pytorch
Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation
Language:Python947 27 18223
tg12/gpt_jailbreak_status
This is a repository that aims to provide updates on the status of jailbreaking the OpenAI GPT language model.
Language:HTML890 36 767
asyml/texar-pytorch
Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/
Language:Python744 24 138118
codota/tabnine-vim
Vim client for TabNine. https://vimawesome.com/plugin/tabnine-vim
Language:Python668 19 10036
re-search/DocProduct
Medical Q&A with Deep Language Models
Language:Jupyter Notebook561 25 32158
voidful/TextRL
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
Language:Python526 11 2365
yangjianxin1/CPM
Easy-to-use CPM for Chinese text generation（基于CPM的中文文本生成）
Language:Python524 9 27135
codota/tabnine-intellij
Jetbrains IDEs client for TabNine. Compatible with all IntelliJ-based IDEs. https://plugins.jetbrains.com/plugin/12798-tabnine
Language:Kotlin509 18 10857

gpt-2

BlinkDL/RWKV-LM

codota/TabNine

microsoft/LoRA

NielsRogge/Transformers-Tutorials

EleutherAI/gpt-neo

Morizeyao/GPT2-Chinese

lonePatient/awesome-pretrained-chinese-nlp-models

FoundationVision/VAR

jaymody/picoGPT

yangjianxin1/GPT2-chitchat

dbiir/UER-py

stochasticai/xTuring

guillaume-be/rust-bert

asyml/texar

BrikerMan/Kashgari

microsoft/DialoGPT

lxe/simple-llm-finetuner

VHellendoorn/Code-LMs

huggingface/transfer-learning-conv-ai

thu-coai/CDial-GPT

imcaspar/gpt2-ml

codota/tabnine-vscode

explosion/spacy-transformers

mishalhossin/Discord-AI-Chatbot

cedrickchee/awesome-transformer-nlp

turtlesoupy/this-word-does-not-exist

guinmoon/LLMFarm

Tencent/TencentPretrain

graykode/gpt-2-Pytorch

tg12/gpt_jailbreak_status

asyml/texar-pytorch

codota/tabnine-vim

re-search/DocProduct

voidful/TextRL

yangjianxin1/CPM

codota/tabnine-intellij