2 items
2 posts
Fable 5 and Mythos 5 are gone for now. Here is the honest ranking of what to use today, from Opus 4.8 to GPT-5.5 to open-weight models, by task.
Benchmarks are useful, but frontend work fails in places leaderboards barely measure. Here is how Web Dev Arena turns AI model comparison into a practical UI evaluation workflow.

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.