Performance on Local First AI

Performance on Local First AIhttps://localfirstai.eu/tags/performance/Recent content in Performance on Local First AIHugoen-usTue, 28 Apr 2026 00:00:00 +0000The Memory Bandwidth Cliff: Lessons from an AI Runawayhttps://localfirstai.eu/posts/2026-04-28-incident_003_alpha_post/Tue, 28 Apr 2026 00:00:00 +0000https://localfirstai.eu/posts/2026-04-28-incident_003_alpha_post/An investigation into the super-quadratic prefill latency and memory bandwidth bottleneck observed on the Gemma 4 26B stack.The Control Plane and the Data Plane: Managing the AI Thinking Taxhttps://localfirstai.eu/posts/2026-04-22-control-plane-vs-data-plane/Thu, 23 Apr 2026 00:00:00 +0000https://localfirstai.eu/posts/2026-04-22-control-plane-vs-data-plane/How to distinguish between agent reasoning and model thinking to prevent system-melting runaway generations.Should We Stop Asking Local LLMs to Think?https://localfirstai.eu/posts/should-we-stop-asking-local-llms-to-think/Tue, 21 Apr 2026 00:00:00 +0000https://localfirstai.eu/posts/should-we-stop-asking-local-llms-to-think/What Adam Smith, neuroscience, and a melting Mac Mini taught me about the real division of cognitive labour.