AI-Hypercomputer/maxtext

Inconsistent environment variable names

gabeweisz opened this issue · 0 comments

MaxText uses the environment variables JAX_COORDINATOR_IP, JAX_COORDINATOR_PORT, NNODES, and NODE_RANK for multi-system GPU training, but JAX_COORDINATOR_ADDRESS, a fixed port, JAX_PROCESS_COUNT, and a combination of several environment variables, and for multi-system CPU training. It would be great if both configurations used the same environment variables