4 items
4 posts
New benchmark data shows GPT-5.5 hallucinates 86% of the time when it does not know the answer - versus 28% for the open-weights GLM-5.2. The numbers challenge the assumption that bigger models equal more reliable output.
Claude Opus 4.7 vs GPT-5.5 for real TypeScript work. Benchmarks, pricing, model families, and practical differences.
OpenAI is turning ChatGPT into a hub. The new Apps feature lets you access external services directly inside conversations.

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.