We Didn't Notice

On June 11, 2026, the US government issued an export control directive requiring Anthropic to suspend access to its two most capable AI models — Claude Fable 5 and Claude Mythos 5 — for all foreign nationals, worldwide, effective immediately. Within hours, every product built on those models lost access: Cognition’s Devin, Agent Arena, and any Spanish agency that had integrated Claude into its workflow. No migration window. No prior notice. The model was available, and then it was not.

The technical community converged quickly on one lesson: reliance on a single frontier API now carries explicit geopolitical risk. Practitioners who had built workflows around US-hosted models discovered, abruptly, that their dependency was not on a software service. It was on a foreign export policy.

Here is what CasaSol experienced on June 11: nothing.

The CasaSol enrichment pipeline has no Anthropic API call. Gemma 4 26B runs locally, under Ollama, on a Mac Mini M4 Pro inside the office. The Router — a smaller model on the same hardware — classifies the input. Raw listing data comes in as voice notes and text, is enriched across 15 structured fields, sanitised for GDPR compliance, and written to a local SQLite database and a ChromaDB vector store. The entire enrichment cycle is offline by design, not as a concession to limited connectivity.

The MCP server exposes that corpus to Claude Desktop for buyer queries. Claude Desktop, in this setup, is a query interface — it sends a search request, the MCP server retrieves the relevant enriched listings, and returns structured data. The intelligence itself — the enriched corpus built from months of agent observations — never left the agency’s hardware. When Anthropic suspended access for foreign nationals, agents using a Fable 5 session found their query interface disrupted. The corpus, the enrichment, and the private seller intelligence: untouched.

This is the distinction worth holding onto. The intelligence is not stored in the model. The model is a processing step. The corpus is the product.

There is a second dimension to June 11 that is less discussed. Alongside the access suspension, Anthropic updated its data-handling terms for Mythos-class models to impose 30-day prompt and output retention for trust-and-safety review. This applies to Claude Console zero-data-retention workspaces, Claude Enterprise and Claude Code with ZDR configurations, AWS Bedrock integrations, and Google Cloud Agent Platform and Microsoft Foundry deployments — organisations that had specifically arranged zero-retention in their contracts. An agency that had processed client negotiation notes, undisclosed defect records, or owner financial circumstances through a Bedrock-connected Claude workflow discovered, on June 11, that it now had a data retention exposure it did not have the day before. Under GDPR, the lawful basis for that processing — and the retention it now entails — is not self-evident.

CasaSol’s architecture cannot produce this failure mode. There is no cloud processor. There is no data in transit. A retention policy change at Anthropic’s headquarters in San Francisco has no legal surface in an agency’s back office in Nueva Andalucía.

The standard argument against local inference is that cloud models are better. In April 2026, the local AI community ranked Gemma 4 second among all open-weight models for inference quality — behind only Qwen 3.5 — across 544 tracked sources. That gap, where it exists, is a trade-off CasaSol accepts deliberately. A model with superior benchmark scores that can be suspended overnight by foreign export controls, and which now retains data for 30 days even for customers who opted out of retention, is not a neutral capability. It is a dependency whose cost had simply not been invoiced yet.

The lesson that emerged from community discussion after June 11 is not new, but the event made it unavoidable: treat frontier APIs as unstable dependencies, maintain model portability, and verify outputs continuously with harnesses. These are standard engineering positions. It is unusual to discover their value only after an incident.

None of this is an argument against frontier models. The most capable AI systems available today are closed and cloud-hosted, and they will remain that way. The point is narrower: capability and deployment dependency are separate decisions, and conflating them is an engineering error with a cost that is now measurable.

A local inference setup running an open-weight model can be wrong, slow, or behind the capability frontier. It cannot be externally suspended. It cannot retain data outside its physical location. It cannot change its terms of service overnight.

The next Chronos experiment — exp_018 — will treat this as an evidence question rather than an assertion. Three scenarios: planned Ollama service interruption, planned model weight unavailability, and complete network isolation. The hypothesis is that all three produce service degradation without data loss, and that recovery is a configuration decision rather than a migration. That should be true. Publishing the result either confirms it, or surfaces exactly where the architecture falls short. Both outcomes are informative.

Sovereignty is not a preference. It is operational continuity — the ability to keep running when a dependency you did not control changes its terms.