8 items
3 posts, 1 tool, 4 guides
The Multi-Stream LLMs paper argues that agents are bottlenecked by single chat streams. The practical takeaway is not to rebuild everything today, but to design agent runtimes around separated channels.
A trending refusal-direction paper is a reminder that model safety cannot be treated as a thin refusal layer. Builders need layered controls around the model.
Researcher, auditor, reviewer, and other ready-made subagent types.
Prevent bloating the main conversation with research or exploration.
A new study from nrehiew quantifies a problem every Claude Code, Cursor, and Codex user has felt: models making huge diffs for tiny fixes. Here is why it happens, why tests do not catch it, and what to do about it.
Set up Codex Chronicle on macOS, manage permissions, and understand privacy, security, and troubleshooting.

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.