/mlx_parallm

Fast parallel LLM inference for MLX

Primary LanguageJupyter Notebook

Issues