Mlx on Local First AI

Mlx on Local First AIhttps://localfirstai.eu/tags/mlx/Recent content in Mlx on Local First AIHugoen-usTue, 09 Jun 2026 00:00:00 +0000Same Hardware. Different Runtime. Same Result.https://localfirstai.eu/posts/2026-06-09-mlx-vs-ollama-runtime/Tue, 09 Jun 2026 00:00:00 +0000https://localfirstai.eu/posts/2026-06-09-mlx-vs-ollama-runtime/MLX and Ollama both run gemma4:26b on Mac Mini M4 Pro. Neither cliffs through 40K tokens. The Flash Attention cliff from Exp 007 was an Ollama implementation artefact, not a hardware property — now confirmed by an independent runtime.