My hands-on learning experience with transformers (GPTs) - 中文 Readme
- GPT Basics - definition, code implementation, use guide
- GPT Advance - high level overview, research papers, applications
- The Anotated Transformer - Havard
- Huggingface Transformer - provides APIs and tools to easily download and train state-of-the-art pretrained models
- GPT Apps - real life applications of GPT (e.g ChatGPT, Jarvis)
Readings
- Attension is All You Need Paper - Google
- The Illustrated Transformer - by Jay Alammar
- The Annotated Transformer - Harvard NLP
- Techniques for training large neural networks - Open AI
- Huggingface Transformer
- The State of GPT - Andrej Karpathy
- CS231n Convolutional Neural Networks for Visual Recognition - Stanford
- Intro to Deep Learning - MIT
- Layer Normalization - Google
- Intro To Natural Language Processing - Transformers
- Matrix multiplication - Wikepedia
Repos
- nanoGPT - by Andrej Karpathy
- The Annotated Transformer - Harvard
- Tensor2Tensor - Tensorflow implementation of the transformer
MIT License