/GPT-J-6B-Inference-Demo

Model parallel transformers in JAX and Haiku

Primary LanguageJupyter Notebook

GPT-J-6B-Inference-Demo

Last updated: Oct 20th 2021

Latest one is gradient_demo_nick_20211018.ipynb: runs with A100 GPU but inference step takes 2000s (over 30min).