Ten days ago, the U.S. Department of Commerce issued an export control directive that forced Anthropic to disable Fable 5 and Mythos 5 for every user worldwide. European companies woke up one morning and their most capable AI models were simply gone. No warning. No transition period. Just a switch flipped in Washington.
That single event turned āsovereign AIā from an abstract policy talking point into an urgent business requirement. And it made Apertus, the Swiss open foundation model, suddenly very relevant to a lot of people whoād never heard of it before.
What Apertus actually is
Apertus (Latin for āopenā) is a family of large language models developed by the Swiss AI Initiative, a collaboration between EPFL, ETH Zurich, and CSCS (the Swiss National Supercomputing Centre). It was first released in September 2025 and has since expanded into multiple sizes.
The model family currently includes:
- 0.5B parameters (edge/mobile deployment)
- 1.5B parameters (lightweight tasks)
- 4B parameters (the newest v1.1, released June 2026)
- 8B parameters (general purpose)
- 70B parameters (highest capability)
Itās a decoder-only transformer, pretrained on 15 trillion tokens using a staged curriculum of web, code, and math data. The architecture introduces a novel xIELU activation function, and the model was trained from scratch with the AdEMAMix optimizer. Post-training includes supervised fine-tuning and alignment via QRPO.
The entire thing is released under Apache 2.0. Not a custom license with weird restrictions. Not a ācommunity licenseā that bans commercial use over revenue thresholds. Proper Apache 2.0.
Who built it and why
This isnāt a startup project. Itās a national initiative funded through the ETH Domain (the Swiss federal research system). EPFL brings the machine learning research expertise. ETH Zurich contributes systems and engineering. CSCS provided the Alps supercomputer, using up to 4,096 GPUs for training.
Swisscom, Switzerlandās national telecom, deploys it on their sovereign Swiss AI platform for enterprise customers who need data residency guarantees.
The motivation was clear from the start: Europe needed a foundation model that was genuinely open, legally clean, and could be deployed on European infrastructure without depending on American companies or their governmentās policy decisions. What happened with Fable 5 proved the thesis correct, even if no one expected it to happen quite this dramatically.
What makes it different from other open models
Three things set Apertus apart from Llama, Mistral, and the rest of the open model ecosystem:
1. Fully open training data
This is the big one. When Meta says Llama is āopen,ā they mean the weights are available. The training data? Completely opaque. You have no idea what went in. Mistral is similar.
Apertus publishes everything. The architecture, the weights, the training recipes, and crucially, the training data reconstruction scripts. You can inspect and reproduce the entire pipeline. This matters for compliance, for research, and for trust.
2. GDPR and EU AI Act compliance by design
The training data was curated with European regulation in mind from day one. They respected opt-out consent of data owners, even applying it retroactively (honoring robots.txt directives from January 2025 even for data collected earlier). They avoid memorization of training data through what they call the āGoldfish objective.ā
The model ships with EU AI Act transparency documentation and a Code of Practice. There are dedicated contact addresses for PII removal requests and copyright takedown requests. This is the level of institutional responsibility youād expect from a federal research institution, not a startup trying to ship fast.
3. Genuine multilingual coverage
Apertus natively supports over 1,800 languages. Not as an afterthought. The training data uses 40% non-English content by design. For European organizations working across languages (EU institutions, multilateral organizations, companies operating in 20+ markets), this is genuinely useful.
The Fable 5 ban and why it changed everything
On June 12, 2026, the U.S. Department of Commerce ordered Anthropic to suspend access to Fable 5 and Mythos 5 for all foreign nationals, whether inside or outside the United States. Unable to verify nationality inside shared cloud infrastructure, Anthropic disabled both models worldwide.
This was unprecedented. It was the first time export controls had been used to shut down a major labās frontier AI systems. The immediate effect was that every European company using Anthropicās latest models lost access overnight.
The implications are broader than one model going offline. If the U.S. government can (and will) use export controls to restrict AI model access, then any European organization building on American AI models is accepting geopolitical risk. It doesnāt matter how good GPT-5.5 or Claudeās next model is if your government might lose access tomorrow.
This is where Apertus fits. Itās not the most capable model. Letās be honest about that. Itās not competing with frontier models on English-language reasoning or complex coding tasks. But it has something those models donāt: it canāt be turned off by a foreign government. It runs on European infrastructure, itās governed by European institutions, and its Apache 2.0 license means no one can revoke access once youāve downloaded it.
What Apertus is good at
For an open model of its size class, Apertus performs well on:
- Multilingual tasks: translation, summarization, and Q&A across European languages. The Ticino canton in Switzerland already uses it for in-house AI translation.
- General text generation: drafting, editing, reformulating content in multiple languages.
- Research and education: fully reproducible training pipeline means itās excellent for academic work.
- Compliance-sensitive deployments: when you need to demonstrate exactly what went into your model and where the data came from.
- Edge deployment: the 0.5B and 4B models run on constrained hardware, with INT3/INT4 quantized checkpoints available for Apple devices.
What Apertus isnāt good at
Being honest here. You shouldnāt pick Apertus if:
- You need absolute state-of-the-art English reasoning. GPT-5.5, Claude, and even Llama 4 Maverick will outperform the 70B Apertus on English benchmarks.
- You need frontier coding capabilities. Specialized coding models are still better for code generation.
- You need multimodal capabilities. Apertus is text-only for now. No image understanding, no audio.
- You only care about performance and donāt have compliance requirements. In that case, the biggest Llama or Mistral model will serve you better.
The strategic case for European organizations
Hereās my take. The Fable 5 ban was a wake-up call, but it shouldnāt have been a surprise. The trend toward using AI models as geopolitical leverage was visible for years. Export controls on chips. Restrictions on model weights. And now, access revocation for deployed models.
European organizations should be doing two things:
-
Diversifying their AI supply chain. Donāt build your entire product on a single American provider. Even if you keep using GPT or Claude for your primary workload, you need a fallback that canāt be switched off externally.
-
Building competence with open European models. Apertus isnāt the only option (Mistral is French, after all), but itās the most completely open one. If you invest in understanding and fine-tuning Apertus now, youāre building institutional knowledge that stays useful as the model improves.
Is Apertus a replacement for GPT-5.5? No. Is it a strategically important piece of infrastructure for European digital sovereignty? Absolutely. And given that itās Apache 2.0 and completely open, the barrier to trying it is essentially zero.
How to get started
All Apertus models are available on HuggingFace under the swiss-ai organization. The model is natively supported in the HuggingFace Transformers library (v4.56.0+), so you can load it with a few lines of Python. For production deployment, vLLM supports it out of the box with an OpenAI-compatible API.
If you want to try the newest release, swiss-ai/Apertus-v1.1-4B-Instruct is the latest model (June 2026). Itās small enough to run on a consumer GPU and gives you a good feel for the modelās capabilities.
FAQ
Is Apertus free to use commercially?
Yes. Itās released under Apache 2.0, which allows unrestricted commercial use, modification, and redistribution. Thereās no revenue cap or usage restrictions like youāll find with some Meta or Google model licenses.
How does Apertus compare to ChatGPT or Claude?
It doesnāt compete at the frontier level. Apertus 70B is roughly comparable to other open models in the 70B class (Llama 3.1 70B era). It wonāt match GPT-5.5 or Claude Fable on complex reasoning. Its strengths are openness, compliance, and multilingual coverage.
Can I fine-tune Apertus on my own data?
Absolutely. The open training recipes and Apache 2.0 license make fine-tuning straightforward. The 4B and 8B models are particularly practical for fine-tuning on a single GPU.
Is Apertus affected by the U.S. export controls?
No. Apertus is developed by Swiss federal research institutions, released under Apache 2.0, and can be deployed on European infrastructure. It has no dependency on U.S. companies or infrastructure.
What languages does Apertus support?
Over 1,800 languages natively. The training data was specifically curated with 40% non-English content, covering European languages, Asian languages, African languages, and indigenous languages. For European business use cases, it covers all EU official languages.
Where can I deploy Apertus?
Anywhere. You can run it on your own servers, on European cloud providers, or through Swisscomās sovereign AI platform. Since you download the weights directly, thereās no dependency on any external API that could be revoked.