AI AGENTS

204 items

198 posts, 2 tools, 4 guides

BlogJun 23, 2026

F3 Is a Reminder That File Formats Are Becoming Runtime Contracts

F3 is trending on Hacker News as a research prototype for a future-proof columnar file format. The useful takeaway is not to replace Parquet tomorrow. It is that data files are starting to carry more of their own runtime contract.

Data Engineering File Formats Wasm Developer Tools AI Agents

BlogJun 23, 2026

GitHub Copilot CLI, BYOK, and AI Credits: The New Cost-Control Stack

GitHub's June Copilot updates point beyond autocomplete: CLI access, bring-your-own-key model routing, AI credit metrics, and external agent providers make Copilot a governed agent platform.

GitHub Copilot AI Coding Developer Tools Pricing AI Agents

BlogJun 23, 2026

LangChain Rubrics Make Agent Evals Part of the Runtime

LangChain's rubrics for Deep Agents point at a practical agent pattern: self-correction works only when rubrics are versioned, executable, and sampled against human review.

LangChain Agent Evals AI Agents Developer Tools Reliability

BlogJun 23, 2026

Mistral OCR 4 and Unlimited OCR Make Document Parsing an Agent Runtime Choice

Mistral OCR 4 and Baidu's Unlimited OCR both hit Hacker News today. The useful takeaway for developers is that OCR is no longer just text extraction. It is becoming a runtime decision for document agents.

OCR Document AI AI Agents Mistral Open Source

BlogJun 23, 2026

OpenAI Agent Builder and Evals Are Shutting Down: Move the Agent Stack Into Code

OpenAI's June deprecations put Agent Builder, hosted Evals, and reusable prompts on a November 30 shutdown path. Here is the practical migration plan: Agents SDK, repo-owned prompts, and eval receipts.

OpenAI Agents SDK Agent Builder Evals AI Agents

BlogJun 23, 2026

OpenAI Daybreak Shows the AppSec Bottleneck Is Patching, Not Finding

OpenAI's Daybreak and Patch the Planet point at the real agentic AppSec shift: security agents only matter when they produce validated, reviewable patches maintainers can actually merge.

OpenAI Security AI Agents Codex AppSec

BlogJun 23, 2026

OpenMontage Shows the Real Future of AI Video: Agents, Not Editors

OpenMontage is trending because it treats video production like a repo-shaped agent workflow: scripts, assets, render pipelines, review loops, and coding agents working across the whole process.

AI Agents Video Open Source Claude Code Codex

BlogJun 23, 2026

Prompt Injection Is Really Role Confusion

New role-confusion research explains why prompt injection keeps surviving better prompts. Models do not reliably perceive which text is instruction, tool output, user content, or their own reasoning.

Prompt Injection AI Security AI Agents LLM Security Developer Tools

BlogJun 23, 2026

TikZ Editor Is a WYSIWYG LaTeX Figure Tool Built Almost Entirely by Codex

A developer used OpenAI Codex to build a fully open-source WYSIWYG editor for TikZ figures. The technical approach and reception on Hacker News offer a useful case study in what agent-built software looks like when shipped.

News Hacker News Codex AI Agents Developer Tools Open Source

BlogJun 22, 2026

Microsoft Agent Framework Developer Guide: AutoGen + Semantic Kernel Unified

Microsoft merged AutoGen and Semantic Kernel into a single production-ready SDK. Here is everything developers need to know: architecture, installation, migration paths, pricing, and when to use it over LangGraph or CrewAI.

Microsoft AI Agents AutoGen Semantic Kernel Agent Frameworks Python .NET

BlogJun 22, 2026

Oak: A New Version Control System Built for AI Agents

Oak rethinks version control for agentic workflows with virtual mounts, faster snapshots, and lower VCS-related token overhead. Here's what the HN community thinks about this Show HN.

News Hacker News Developer Tools AI Agents Version Control

BlogJun 22, 2026

Fugu Ultra's Frontier Performance Claim, Explained Without the Hype

Sakana says Fugu Ultra stands with Fable, Mythos, GPT-5.5, Gemini, and Opus by orchestrating models instead of being one giant model. Here is what the benchmarks show, what is novel, and what still needs proof.

ai-benchmarks ai-models model-routing ai-agents

BlogJun 22, 2026

Sakana Fugu Ultra: The Model Router Making the Frontier Look Less Proprietary

Sakana Fugu Ultra is not just another giant model. It is a learned orchestration layer that routes work across expert models, matches frontier benchmark claims, and makes a serious case for multi-model AI systems.

ai-models model-routing ai-agents open-models

BlogJun 21, 2026

Agentic AI Reliability Is a Systems Problem

The Bayer and Thoughtworks PRINCE case study is a useful reminder that reliable agentic AI comes from context routing, traces, evals, monitoring, and human review, not from a better prompt alone.

AI Agents Agent Infrastructure RAG Evals Developer Workflow

BlogJun 21, 2026

AI Coding Agents Move the Bottleneck to Review Queues

As coding agents get easier to delegate to, the scarce resource shifts from code generation to review capacity, CI minutes, environment reliability, and merge discipline.

AI Coding AI Agents Developer Tools GitHub Copilot Agent Infrastructure

BlogJun 20, 2026

Agent Evals Need Baseline Receipts

Hex's data-agent lab shows the practical eval pattern AI teams should copy: compare candidates against stable baselines, keep receipts, and judge changes by task behavior.

AI Agents Agent Infrastructure Developer Tools Evals Data Agents

BlogJun 20, 2026

Cloudflare Temporary Accounts: Let Agents Deploy Without OAuth Flows

Cloudflare shipped wrangler deploy --temporary on June 19, 2026. AI agents can now deploy Workers, D1 databases, and KV stores without browser auth flows. Here is how it works.

AI Agents Cloudflare Infrastructure Developer Tools

BlogJun 20, 2026

The Definitive Guide to Loop Engineering in Claude Code and Codex

Goal, loop, routine. Three verbs, two tools, one hard part. A complete field guide to running agentic loops in Claude Code and Codex, the real commands, the patterns people actually run, and the two failure modes that burn money.

Loop Engineering Claude Code Codex AI Agents Automation Developer Workflow

BlogJun 19, 2026

MCP Goes Stateless: The 2026-07-28 Migration Guide

The MCP 2026-07-28 release candidate drops sessions entirely. Here is what changes, what breaks, and how to migrate your MCP servers before the July 28 deadline.

MCP Model Context Protocol AI Agents Migration Guide TypeScript

BlogJun 19, 2026

Zero-Touch OAuth Is the MCP Feature Enterprises Were Waiting For

MCP's new enterprise-managed authorization flow is not just less login friction. It moves agent tool access into identity, policy, and audit systems enterprises already understand.

MCP AI Agents AI Security Developer Workflow Enterprise AI

PreviousPage 3 of 11Next

Get Smarter About AI Dev

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.

One email per weekReal code, not theoryFree forever

Browse All Tags