DEVELOPER WORKFLOW

66 articles

All TopicsDeveloper WorkflowAI Agents AI Coding Claude Code Codex Security AI Security

LATEST

Harness Handbook Shows the Missing Map for Coding Agents

A July 2026 paper from Tencent Hunyuan turns agent harnesses into behavior-level maps. The useful lesson for builders is simple: code search is not enough when one behavior spans prompts, tools, state, permissions, and runtime policy.

July 16, 2026•8 min read

Read Article

New8 min read

SkillHone Shows Why Agent Skills Need Decision History

SkillHone is a July 2026 paper about evolving agent skills across sessions. The useful takeaway for developers is simple: do not save only the latest SKILL.md. Save the decisions that explain why it changed.

AI Agents Agent Skills AI Coding Developer Workflow Evals

New9 min read

Long-Horizon Terminal Bench Shows Why Coding Agents Still Stall

Long-Horizon-Terminal-Bench tests coding agents on 46 terminal tasks that can run for 90 minutes. The takeaway is not that agents are useless. It is that evals need to measure endurance, recovery, and partial progress.

AI Agents AI Coding Evals Benchmarks Developer Workflow

New9 min read

Microsoft's CLI Coding Agent Study: The Rollout Pattern Teams Should Copy

A Microsoft field study found that CLI coding-agent adoption spreads through peers and managers, while adopters merged roughly 24% more pull requests. The lesson is not to buy more seats. It is to instrument rollout, retention, cost, and review quality from day one.

AI Coding Coding Agents Claude Code GitHub Copilot Developer Workflow

New8 min read

Dockerless Verification Is The Next Coding Agent Bottleneck

ByteDance's Dockerless paper asks whether coding-agent patches can be verified without spinning up per-repo environments. The practical answer is not replace CI. It is use cheaper evidence before CI.

AI Agents AI Coding Developer Workflow CI/CD Research

New8 min read

Vera Shows Agent Safety Needs Test Oracles, Not Vibes

A new Vera paper tests Codex, Claude Code, OpenClaw, and Hermes with executable safety cases. The useful lesson is not panic. It is evidence-grounded agent QA.

AI Security AI Agents Codex Claude Code Developer Workflow

New7 min read

Program-as-Weights Turns Prompts Into Local Fuzzy Functions

The Program-as-Weights paper is a useful signal for developers: some LLM calls may move from per-request API prompts into compact local artifacts that behave like reusable fuzzy functions.

AI Coding Local AI LLM Research Developer Workflow

7 min read

Non-Developers Using AI Agents Need Platform Engineering

OpenAI's workplace agent data points to a practical shift: non-developers are starting to use agents for real work, so engineering teams need paved paths, policy, and receipts.

AI Agents OpenAI Platform Engineering Developer Workflow Enterprise AI

8 min read

Agent PR Governance: The New Rules for Copilot Reviews

GitHub's June Copilot review updates point to a practical policy stack for agent-authored pull requests: validation, review depth, repo instructions, attribution, and release-note accountability.

GitHub Copilot AI Code Review AI Agents Developer Workflow Governance

8 min read

Agent Sandbox Architecture: How to Choose the Right Runtime Boundary

AI agents are getting their own computers. Here is how to choose a sandbox architecture: filesystem isolation, network policy, secrets boundaries, snapshots, and when shell access is overkill.

AI Agents Security Agent Infrastructure Sandboxes Developer Workflow

8 min read

Agent Workflows as Code: Why State Machines Beat Prompt Checklists

Aharness, LangChain's custom harness pattern, and OpenAI's code-first migration all point to the same next step: agent processes need typed gates, validated evidence, and controlled transitions.

AI Agents Codex Agent Infrastructure Developer Workflow TypeScript

7 min read

Agentic AI Reliability Is a Systems Problem

The Bayer and Thoughtworks PRINCE case study is a useful reminder that reliable agentic AI comes from context routing, traces, evals, monitoring, and human review, not from a better prompt alone.

AI Agents Agent Infrastructure RAG Evals Developer Workflow

16 min read

The Definitive Guide to Loop Engineering in Claude Code and Codex

Goal, loop, routine. Three verbs, two tools, one hard part. A complete field guide to running agentic loops in Claude Code and Codex, the real commands, the patterns people actually run, and the two failure modes that burn money.

Loop Engineering Claude Code Codex AI Agents Automation Developer Workflow

Showing 12 of 65 articles

Keep exploring Developer Workflow

- Developer Workflow Topic Hub - tools and guides for Developer Workflow from the Developers Digest directory
- Compare Tools - dive deeper across the Developers Digest knowledge base
- Developers Digest on YouTube - video tutorials covering Developer Workflow and more

Explore 736 topics

Browse All Topics

DEVELOPER WORKFLOW

Harness Handbook Shows the Missing Map for Coding Agents

Keep exploring Developer Workflow

Get Smarter About AI Dev

DEVELOPER WORKFLOW

Harness Handbook Shows the Missing Map for Coding Agents

Keep exploring Developer Workflow

Get Smarter About AI Dev