
TL;DR
Switzerland's fully open foundation model promises transparent training data and EU compliance. The HN crowd has questions about actual performance.
The Swiss AI Initiative just shipped Apertus, a fully open foundation model developed by EPFL, ETH Zurich, and the Swiss National Supercomputing Centre (CSCS). The pitch: complete transparency, EU AI Act compliance, and training data you can actually inspect.
With 497 points and 167 comments on Hacker News, the response is... complicated.
Apertus ships in two sizes - 8B and 70B parameters - trained on 15 trillion tokens across 1,000+ languages. The headline differentiator is openness: training data, code, weights, methods, and alignment principles are all documented and reproducible.
From the official site:
Fully open model: open weights + open data + full training details including all data and training recipes.
The model was trained on CSCS's Alps supercomputer using up to 4,096 GPUs. Swisscom serves as the strategic partner, and you can access it through Hugging Face or the Public AI network.
Key claims:
The HN discussion splits into a few camps:
The core appeal is scientific reproducibility. One commenter put it well:
As long as the following remains true, this release ends up a bigger contribution to science at large than most other models trained "behind closed doors."
Several users pointed out that Apertus joins a small club of genuinely open models - alongside Allen AI's OLMo 3.1, MBZUAI's K2 Think V2, and Nvidia's Nemotron (though Nemotron has some proprietary data).
The most upvoted criticism is simple: is the model actually good?
One user shared benchmarks from Artificial Analysis showing Apertus trailing Nemotron significantly. The comparison link shows Nvidia's Nemotron-3-Ultra-550B and Super-120B outperforming both Apertus variants on standard intelligence benchmarks.
Another commenter reported:
The previous version of this model has been pretty bad, but claimed to adhere to copyright laws. However, based on my testing, that's not true either.
The multilingual claims also drew skepticism. One user noted that for "simple questions like 'how to say X in language Y' or 'how to conjugate verb X in language Y'" - the model "keeps hallucinating words that do not exist."
This is where things get philosophically interesting. Simon Willison noted that Apertus "uses fineweb, which is derived from Common Crawl, which is an unlicensed scrape of web pages."
This prompted a subthread about whether "sovereign AI" can really claim ethical high ground while using scraped data. One response:
You don't need a license to scrape the public web and analyze it, turn it into tokens and other transformations. Let's not expand copyright beyond the horrible monster it already is.
Willison's counterpoint:
That doesn't mean much to the many people I know of who refuse to use a technology that they see as being unethically created using the work of others without compensating them.
Several comments touched on why European AI independence matters. One user argued:
Frankly, I'm surprised there's not more urgency on the part of Europeans to reduce dependence on US tech.
The argument: even if Apertus isn't state-of-the-art today, having European-controlled AI infrastructure matters for data sovereignty and regulatory compliance.
Get the weekly deep dive
Tutorials on Claude Code, AI agents, and dev tools - delivered free every week.
From the archive
Jun 22, 2026 • 5 min read
Jun 22, 2026 • 8 min read
Jun 22, 2026 • 5 min read
Jun 22, 2026 • 6 min read
If you're evaluating Apertus for production use, here's what the evidence suggests:
Strengths:
Weaknesses:
Best use cases:
One commenter who actually uses Apertus in production:
I use it extensively. It is not ready for agentic use, but as a generic driving model for RAG use cases, it is pretty competent. You can build useful software with it.
The debate in this thread mirrors a broader tension in AI development: should we prioritize transparency and ethical sourcing even at the cost of capability? Or does "open" only matter if the model is actually competitive?
For now, Apertus represents a credible attempt at the transparency-first approach. Whether the capability gap closes depends on continued funding and research. The Swiss AI Initiative has the institutional backing - EPFL and ETH Zurich are serious players - but catching up to labs spending billions requires more than good intentions.
If you want a model where you can trace every training decision, Apertus delivers. If you need frontier performance, you're still looking at Nemotron, DeepSeek, or the closed models.
The market has room for both.
Read next
New benchmark data shows GPT-5.5 hallucinates 86% of the time when it does not know the answer - versus 28% for the open-weights GLM-5.2. The numbers challenge the assumption that bigger models equal more reliable output.
6 min readModern LLMs now use MoE routing, mixed attention variants, and fused vision encoders. The simple transformer stack is gone - here's what replaced it and why it matters for developers.
6 min readEpic Games open-sourced Lore, a centralized version control system designed for binary-heavy game projects. It uses Merkle trees, on-demand file hydration, and native chunked storage to handle terabyte-scale repos that Git struggles with.
7 min readTechnical content at the intersection of AI and development. Building with AI agents, Claude Code, and modern dev tools - then showing you exactly how it works.

New benchmark data shows GPT-5.5 hallucinates 86% of the time when it does not know the answer - versus 28% for the open...

Modern LLMs now use MoE routing, mixed attention variants, and fused vision encoders. The simple transformer stack is go...

A YC W25 startup open-sources CADAM, a browser-based tool that converts natural language to parametric OpenSCAD models....

Epic Games open-sourced Lore, a centralized version control system designed for binary-heavy game projects. It uses Merk...

A developer discovered that Claude Code's thinking output is summarized, not the raw reasoning. Here's what Anthropic's...

A bug in OpenAI's Codex CLI writes excessive trace logs to SQLite, potentially consuming 640TB/year of SSD writes. The i...

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.