/mlx_sharding

Distributed Inference for mlx LLm

Primary LanguagePython

Watchers