ADOPT: Modified Adam Can Converge with Any $β_2$ with the Optimal Rate Unofficial MLX Implementation of "ADOPT: Modified Adam Can Converge with Any $β_2$ with the Optimal Rate".