A collection (in progress) of example high-performance large language model implementations, written with JAX.
Current contents include:
For multi-host cluster setup and distributed training, see multi_host_README.md and the tpu_toolkit.sh script.
A collection (in progress) of example high-performance large language model implementations, written with JAX.
Current contents include:
For multi-host cluster setup and distributed training, see multi_host_README.md and the tpu_toolkit.sh script.