/nano-transformers

Nano Transformers, Transformer, GPT, GPT-2, GPT-3 (AST) etc.

Primary LanguageJupyter Notebook

nano-transformers

Nano-Transformers, a project for Transformer related education and knowledge quick reference purpose.

This project targets at

  • help you easily understand Transformers from detailed simple codes
  • help you easily write Keras codes, with less complains on TensorFlow
  • help you easily get investment from VC if you were working on web3

2017 Transformer

Nano Transformer

2018 GPT-1

Nano GPT-1

2019 GPT-2

Nano GPT-2

2020 GPT-3

Nano GPT-3

2021 InstructGPT

InstructGPT, not fully done yet.

  • Tried to get emergent ability in small model small data. Failed. It can't reasoning, but memory well.

References