Building SaaS with AI Agents in 2026: The Complete Workflow

Q: Which AI tool is best for building a SaaS?

[Claude Code](/blog/what-is-claude-code) is the best primary tool for SaaS development. Its terminal-based agent reads your entire codebase, executes multi-step tasks autonomously, and works with your existing git workflow. Pair it with [Cursor](/tools/cursor) for fast UI iteration. Use [v0](/tools/v0) for rapid component scaffolding. The combination of Claude Code for backend and complex logic plus Cursor for frontend iteration covers the full stack.

Official Sources#

Primary documentation for the tools and workflows covered in this guide. Verify current features and pricing against these sources before starting your build.

Resource	Official Source
Claude Code	Claude Code Docs
Next.js	Next.js Documentation
Clerk	Clerk Documentation
Drizzle ORM	Drizzle Documentation
Neon	Neon Documentation
Stripe	Stripe Documentation
Git Worktrees	Git Worktree Manual

Building a SaaS product used to take a team of three to five engineers several months to reach launch. In 2026, a single developer with AI agents can ship a production-ready SaaS in days to weeks. This is not hyperbole. It is the documented reality of how products are being built right now.

But the gap between "can ship in days" and "does ship in days" is entirely about workflow. Developers who use AI agents like faster typing assistants get incremental speedups. Developers who restructure their entire workflow around agent capabilities get 10x to 50x improvements.

This is the complete workflow for building a SaaS product with AI agents, from the first idea to the first paying customer.

Phase 1: Planning with AI#

The planning phase is where most AI-assisted builds either succeed or fail, and most developers skip it entirely. They open their terminal, start prompting, and let the agent figure it out. This produces working code that goes in the wrong direction.

For the implementation path around this, pair it with Claude Code Agent Teams, Subagents, and MCP: The 2026 Playbook and Why Skills Beat Prompts for Coding Agents in 2026; those guides connect the idea to a shippable TypeScript stack.

Writing the PRD#

Before touching code, write a Product Requirements Document. This is not a 40-page enterprise document. It is 1 to 2 pages that answer five questions:

What is the product? One sentence. "A cron job monitoring service that sends alerts when jobs fail."
Who is the user? One sentence. "Developers running scheduled tasks in production."
What are the core features? Numbered list of 5 to 8 features, prioritized.
What is the tech stack? Specific choices with reasons.
What does the data model look like? Tables, relationships, key fields.

The PRD serves two audiences: you (for clarity) and the AI agent (for context). When you start coding, the PRD becomes part of your project context. The agent reads it and understands what it is building and why.

Markdown

# CronWatch PRD

Cron job monitoring SaaS. Developers add their cron jobs, set alert
rules, and get notified when jobs fail, run late, or produce errors.

## Users
Developers and DevOps engineers running scheduled tasks in production.

## Core Features (Priority Order)
1. Dashboard showing all monitored cron jobs with status
2. Webhook endpoint that cron jobs ping on start/complete/fail
3. Alert rules: email and Slack when job fails or misses window
4. Run history with logs and duration tracking
5. Team support: invite members, share dashboards
6. Pricing: Starter ($9, 10 jobs), Pro ($29, 50 jobs), Business ($79, unlimited)

## Stack
- Next.js 16 (App Router, Server Components)
- Neon (Postgres via Drizzle ORM)
- Clerk (auth, organizations)
- Tailwind (styling)
- Deployed on Coolify (Hetzner VPS)

## Data Model
- users: id, clerkId, email, plan, createdAt
- cronJobs: id, userId, name, schedule, webhookUrl, status, lastPingAt
- cronRuns: id, jobId, startedAt, completedAt, status, logs
- alerts: id, jobId, type, channel, config

This PRD is 30 lines. It took 15 minutes to write. It will save hours of misdirected agent work because every coding session starts with the agent knowing exactly what it is building.

Decomposing into Iterations#

Do not give the agent the entire PRD and say "build this." That is the equivalent of handing a new hire a 50-page spec on their first day. Instead, decompose the PRD into iterations of 2 to 4 features each.

Code

Iteration 1: Foundation
- Project scaffold (Next.js + Clerk + Neon + Drizzle)
- Database schema for users, cronJobs, cronRuns
- Auth flow: sign up, sign in, dashboard shell
- Basic dashboard showing empty state

Iteration 2: Core Feature
- Webhook endpoint for cron job pings
- Job status tracking (healthy, late, failed)
- Run history table with logs
- Dashboard populated with real data

Iteration 3: Alerts
- Alert rules configuration UI
- Email notification system
- Slack webhook integration
- Alert history and acknowledgment

Iteration 4: Billing and Launch
- Stripe integration via Autumn credits
- Pricing page with plan comparison
- Plan-gated feature limits
- Landing page, deploy, go live

Each iteration is a self-contained unit of work that produces a deployable increment. The agent can complete an iteration in a focused session. You review, adjust, and move to the next iteration.

Phase 2: Scaffolding#

Scaffolding is the phase where AI agents provide the most dramatic speedup. What takes a developer 2 to 4 hours of manual setup takes an agent 5 minutes.

The Scaffold Prompt#

Code

Read the PRD in CLAUDE.md. Set up Iteration 1:

1. Initialize Next.js 16 with TypeScript, Tailwind, App Router
2. Install and configure Clerk (auth provider)
3. Install and configure Drizzle ORM with Neon Postgres
4. Create the database schema from the PRD data model
5. Set up the project structure:
   - src/db/schema.ts (Drizzle schema)
   - src/db/queries/ (query functions)
   - src/actions/ (server actions)
   - src/components/ (React components)
6. Create a basic authenticated dashboard shell
7. Push the schema to the database
8. Commit with message "scaffold: next.js + clerk + neon + drizzle"

The agent handles dependency installation, configuration file creation, environment variable setup, schema definition, and the initial page structure. The output is a working, authenticated application with a database connection.

What to Watch During Scaffolding#

Even with a clear prompt, scaffolding can go wrong. Watch for these:

Dependency version conflicts. AI agents sometimes install incompatible package versions. Review the package.json before moving on. Run the dev server and verify it starts cleanly.

Auth configuration gaps. Clerk needs environment variables, middleware configuration, and provider wrapping. Verify the full auth flow works: sign up, sign in, redirect to dashboard, sign out.

Database connection. Push the schema and verify tables exist. Run a simple query to confirm the connection works through the ORM.

Type safety. Check that TypeScript strict mode is enabled and that the schema types propagate to queries and server actions. Type errors caught here prevent cascading issues later.

The scaffold phase should end with a running application that you can sign into and see an empty dashboard. If it does not run, fix it before proceeding. Technical debt compounds faster with AI agents because they build on top of whatever exists.

Phase 3: Parallel Development#

This is where the workflow diverges from traditional development. Instead of building features sequentially, you build them in parallel using multiple agent sessions.

The Parallel Pattern#

Identify features within an iteration that have no dependencies on each other. Assign each to a separate agent session or worktree.

For Iteration 2 of CronWatch:

Code

Agent 1 (worktree: feature/webhook-endpoint):
  Build the webhook endpoint for cron job pings.
  - POST /api/webhook/:jobId with start/complete/fail events
  - Update job status and create cronRun records
  - Handle duplicate pings and out-of-order events

Agent 2 (worktree: feature/run-history):
  Build the run history table component.
  - Query cronRuns for a given job
  - Display start time, duration, status, truncated logs
  - Pagination for jobs with many runs
  - Empty state for jobs with no runs yet

Agent 3 (worktree: feature/dashboard-populated):
  Populate the dashboard with real job data.
  - Query all cronJobs for the authenticated user
  - Display status badges (healthy/late/failed)
  - Last ping timestamp and next expected ping
  - Quick actions: pause, delete, view history

Each agent works in its own Git worktree, making changes that do not conflict with the others. When all three finish, you merge the worktrees in sequence, resolving any conflicts.

Why Parallel Works#

Sequential development has a fundamental bottleneck: the developer's context window. You can only think about one feature at a time. While you build the webhook endpoint, the dashboard and run history are blocked.

Parallel agent development removes this bottleneck. Three features that would take three sequential days take one parallel day. The developer's job shifts from writing code to reviewing and integrating code.

The constraint on parallelism is dependency. Features that depend on each other's output cannot run in parallel. The webhook endpoint must exist before the dashboard can display real data. But within a dependency layer, everything is parallelizable.

Managing Worktrees#

Git worktrees are the mechanism that makes parallel development practical. Each worktree is a separate checkout of the same repository, allowing multiple branches to be active simultaneously.

Terminal

# Create worktrees for parallel features
git worktree add ../cronwatch-webhook feature/webhook-endpoint
git worktree add ../cronwatch-history feature/run-history
git worktree add ../cronwatch-dashboard feature/dashboard-populated

# Run agents in each worktree
cd ../cronwatch-webhook && claude
cd ../cronwatch-history && claude
cd ../cronwatch-dashboard && claude

When agents complete their work:

Terminal

# Merge back to main
git checkout main
git merge feature/webhook-endpoint
git merge feature/run-history
git merge feature/dashboard-populated

# Clean up worktrees
git worktree remove ../cronwatch-webhook
git worktree remove ../cronwatch-history
git worktree remove ../cronwatch-dashboard

The key discipline is keeping features genuinely independent within a parallel batch. If Agent 2 needs to import a type that Agent 1 is creating, they are not independent. Move the shared type to the base branch before starting the parallel batch.

Newsletter

Get the weekly deep dive

Tutorials on Claude Code, AI agents, and dev tools, delivered free every week.

From the archive

Context Engineering: The Highest-Leverage Skill in AI-Assisted Development

Apr 9, 2026 • 14 min read

The MCP Server Ecosystem: A Developer's Guide for 2026

Apr 9, 2026 • 13 min read

Self-Improving AI Agents: Building Systems That Learn From Their Mistakes

Apr 9, 2026 • 13 min read

AI Agent Memory Patterns

Apr 3, 2026 • 9 min read

Phase 4: The Build Loop#

Within each feature (whether parallel or sequential), the agent follows a consistent build loop.

Step 1: Plan#

Start every feature with a plan prompt. The agent reads the project context, understands the current state of the codebase, and outlines its approach before writing code.

Code

Plan the webhook endpoint feature. Read the schema, understand the
existing patterns in src/actions/ and src/db/queries/, and outline:
1. What files to create or modify
2. What the API contract looks like
3. How you'll handle edge cases (duplicate pings, invalid jobId)
4. What tests you'll write

Review the plan before approving execution. This is your highest-leverage intervention point. Catching an architectural mistake at the planning stage costs 30 seconds. Catching it after implementation costs an hour.

Step 2: Implement#

Approve the plan and let the agent implement. For most features, this is hands-off. The agent creates files, writes code, and makes incremental progress.

Watch the terminal output but do not interrupt unless the agent is going clearly wrong. Frequent interruptions break the agent's multi-step reasoning chain and produce worse results than letting it complete and then correcting.

Step 3: Verify#

After implementation, verify the feature works.

Code

Run the dev server. Navigate to the webhook endpoint.
Send a test POST request with curl:
  curl -X POST http://localhost:3000/api/webhook/test-job-id \
    -H "Content-Type: application/json" \
    -d '{"event": "complete", "duration": 1234}'
Verify the response and check the database for the new cronRun record.

Automated verification is better than manual verification. If the agent wrote tests, run them. If it did not, have it write tests before moving on.

Step 4: Review#

Read the code the agent wrote. Not every line, but the important parts: data access patterns, error handling, security boundaries, and type definitions.

AI-generated code has consistent failure patterns:

Missing error handling for edge cases
Overly permissive type definitions (using any)
Missing authorization checks on API routes
Hardcoded values that should be configuration
Missing input validation on user-facing endpoints

These are the things to look for during review. The structural code (routing, component layout, query construction) is usually correct.

Step 5: Commit#

Commit after every feature. Do not batch multiple features into one commit. Atomic commits make it possible to revert a single feature without losing others.

Code

Commit this feature with message:
"add webhook endpoint for cron job pings"

Phase 5: Testing#

Testing in an AI-agent workflow is different from traditional testing because the agent can both write and run tests.

The Testing Prompt#

Code

Write tests for the webhook endpoint:
1. Happy path: valid ping creates a cronRun record
2. Invalid jobId returns 404
3. Duplicate ping within 1 second is idempotent
4. Missing event field returns 400
5. Unauthorized request returns 401

Use vitest. Mock the database layer. Run the tests and fix any failures.

The agent writes the tests, runs them, sees failures, and fixes them. This test-write-fix loop is one of the strongest applications of AI agents because the feedback loop is immediate and unambiguous. The test either passes or it does not.

Integration Testing#

Unit tests verify individual functions. Integration tests verify that the system works end to end. Have the agent write integration tests that exercise the full stack:

Code

Write an integration test that:
1. Creates a test user via Clerk test helpers
2. Creates a cron job via the API
3. Sends a webhook ping
4. Verifies the dashboard shows the updated status
5. Cleans up test data

Integration tests are harder for agents to write correctly because they depend on external services (database, auth provider). Provide explicit instructions about how to set up test fixtures and mock external dependencies.

The Testing Gap#

AI agents are better at writing tests than most developers assume. They are worse at knowing which tests matter. An agent will happily write 50 unit tests for a utility function and zero tests for the authorization logic that protects user data.

Your job as the developer is to direct testing effort toward the highest-risk code. Security boundaries, payment processing, data mutations, and authorization checks need thorough tests. Rendering logic and formatting utilities need minimal tests.

Phase 6: Billing and Monetization#

Billing is the phase where most AI-built projects stall. The agent can scaffold Stripe integration, but the business logic around plans, limits, and upgrades requires careful specification.

The Billing Spec#

Code

Implement Stripe billing via Autumn SDK:

Plans:
- Starter ($9/mo): 10 monitored jobs, email alerts only
- Pro ($29/mo): 50 monitored jobs, email + Slack alerts
- Business ($79/mo): Unlimited jobs, all alert channels, priority support

Implementation:
1. Pricing page at /pricing with plan comparison
2. Checkout flow: pricing -> Stripe checkout -> redirect to dashboard
3. Plan enforcement: check user's plan before creating jobs
4. Upgrade/downgrade flow in account settings
5. Middleware: redirect unauthenticated users to /pricing, not /dashboard
6. Owner bypass: my account (isOwner flag) skips all billing checks

The specification must be explicit about enforcement. "Check user's plan before creating jobs" tells the agent to add authorization logic. Without this, the agent builds a pricing page that looks correct but does not actually gate features. The same problem shows up in AI coding tools pricing: sticker price means less than the limits behind it.

Plan Gating#

The most common mistake in AI-built billing systems is decorative pricing. The pricing page exists. The checkout flow works. But the application does not actually enforce limits. Users on the free tier can access pro features because no middleware checks their subscription status.

Verify plan gating explicitly:

Code

Test plan gating:
1. Sign in as a Starter user
2. Try to create an 11th cron job
3. Verify the app shows an upgrade prompt, not a success message
4. Try to add a Slack alert
5. Verify the app shows "Pro plan required"

If the agent did not implement enforcement, it will fail these tests. Fix it before launch.

Phase 7: Deployment#

Deployment is the easiest phase to automate and the one where most developers waste the most time.

The Deploy Checklist#

Code

Deploy to production:

1. Environment variables:
   - Verify all env vars are set in Coolify/Vercel
   - DATABASE_URL, CLERK_SECRET_KEY, STRIPE_SECRET_KEY, etc.

2. Database:
   - Run migrations against production database
   - Verify schema matches local

3. Build:
   - Run production build locally first: npm run build
   - Fix any build errors before pushing

4. DNS:
   - Add A record pointing to server IP
   - Add CNAME for www subdomain
   - Wait for propagation (usually < 5 minutes with Cloudflare)

5. Push:
   - git push origin main
   - Monitor build logs in Coolify
   - Verify deployment at production URL

6. Smoke test:
   - Sign up flow
   - Create a cron job
   - Send a webhook ping
   - Verify alert delivery

The agent can execute most of this checklist autonomously. The developer verifies the smoke tests and handles any DNS issues that arise.

Post-Deploy Monitoring#

The first 24 hours after deployment are critical. Set up basic monitoring:

Code

Add a health endpoint at /api/health that checks:
1. Database connection (run a simple query)
2. Clerk auth service availability
3. Response time under 200ms

Add error tracking via a simple error boundary that logs
to the server. We'll add proper monitoring later.

Start simple. A health endpoint that returns 200 or 500 is enough for launch. Add Sentry, DataDog, or equivalent later when you have users generating real traffic.

Phase 8: Iteration After Launch#

Launching is not the end. It is the beginning of the feedback loop that actually matters.

User Feedback Loop#

The first real users will find issues that no amount of testing catches. The workflow for handling feedback with AI agents:

User reports an issue
Create a GitHub issue with reproduction steps
Assign the issue to an agent session
Agent reads the issue, reproduces it, fixes it, writes a test
You review the fix and merge
Deploy

This loop can turn around bug fixes in minutes instead of hours. The agent handles the tedious parts (reproduction, investigation, fix implementation) while you handle the judgment parts (is this the right fix, does it introduce regressions, should we prioritize this).

Feature Requests#

New features follow the same pattern as the initial build. Write a spec, decompose into tasks, execute with agents, review, deploy. Each iteration builds on the foundation that previous iterations created.

The key advantage of AI-assisted iteration is speed. A feature that would take a traditional sprint (two weeks) can ship in a day. This speed means you can respond to user feedback faster, which means users see their requests implemented quickly, which means higher retention.

Common Failure Modes#

Failure 1: No Project Context#

Starting an agent session without CLAUDE.md, without a PRD, without architecture documentation. The agent guesses at conventions and makes decisions you would not make. Every session requires corrections that compound into technical debt.

Fix: Invest 30 minutes in project context before writing any code. Update the context as the project evolves. Our README generator is a quick way to bootstrap that first context file from a folder of code.

Failure 2: No Review#

Trusting agent output without reading it. The agent writes correct-looking code that has subtle bugs: missing auth checks, race conditions, unhandled edge cases. These ship to production and become user-facing bugs.

Fix: Review every feature before merging. Focus on security boundaries, data mutations, and error handling.

Failure 3: Monolithic Prompts#

Giving the agent a 500-word prompt that describes an entire feature with all its edge cases. The agent loses track of requirements in the middle and produces incomplete implementations.

Fix: Decompose complex features into sequential steps. Each prompt should describe one clear unit of work.

Failure 4: Ignoring Tests#

Shipping features without tests because the agent did not write them automatically. When the next iteration modifies shared code, existing features break silently.

Fix: Make tests part of every feature prompt. "Implement X and write tests for Y" should be the standard format.

Failure 5: Premature Optimization#

Asking the agent to build performance optimizations, caching layers, and scaling infrastructure before you have users. This wastes time on problems that do not exist yet.

Fix: Launch with the simplest implementation that works. Optimize when monitoring shows you where the bottlenecks actually are.

The Economics#

A SaaS product that took a 4-person team three months to build now takes one developer with AI agents two to four weeks. The cost breakdown:

Item	Traditional	AI-Assisted
Developer salaries (3 months)	$60,000 to $150,000	$10,000 to $25,000 (1 developer)
AI tooling	$0	$200 to $400/month
Infrastructure	$100 to $500/month	$50 to $200/month
Time to first revenue	3 to 6 months	2 to 4 weeks

The math is not close. AI-assisted development does not just reduce cost. It compresses the time to revenue, which changes the economics of what is worth building. Products that were not viable with a three-month development cycle become viable with a two-week cycle.

This is not a future state. This is how SaaS products are being built and shipped today. The workflow is learnable, the tools are available, and the only barrier is adopting a new way of working.

Frequently Asked Questions#

How long does it take to build a SaaS with AI agents?#

A production-ready SaaS can ship in two to four weeks with AI agents, compared to three to six months with a traditional team. The timeline depends on feature complexity, your familiarity with the workflow, and how well you define requirements upfront. Simple MVPs with auth, database, and core feature can ship in under a week. Complex products with billing, team features, and integrations take closer to the four-week mark.

Which AI tool is best for building a SaaS?#

Claude Code is the best primary tool for SaaS development. Its terminal-based agent reads your entire codebase, executes multi-step tasks autonomously, and works with your existing git workflow. Pair it with Cursor for fast UI iteration. Use v0 for rapid component scaffolding. The combination of Claude Code for backend and complex logic plus Cursor for frontend iteration covers the full stack.

What should I include in a PRD for AI development?#

A PRD for AI-assisted development needs five sections: what the product is (one sentence), who the user is (one sentence), core features (5-8 items, prioritized), tech stack (specific choices with reasons), and data model (tables, relationships, key fields). Keep it to 1-2 pages. The PRD becomes part of your project context so the agent understands what it is building and why.

How do parallel worktrees speed up development?#

Git worktrees let you have multiple branches checked out simultaneously in separate directories. You can run multiple AI agents in parallel, each working on an independent feature in its own worktree. Three features that would take three sequential days take one parallel day. When agents complete their work, you merge the worktrees back to main. The constraint is dependency - features that depend on each other cannot run in parallel.

What are the common failure modes when building with AI?#

Five patterns cause most failures. No project context: starting without CLAUDE.md or a PRD means the agent guesses at conventions. No review: trusting agent output without reading security boundaries and error handling. Monolithic prompts: giving 500-word prompts causes the agent to lose track of requirements. Ignoring tests: shipping without tests means the next iteration breaks existing features. Premature optimization: building scaling infrastructure before you have users.

How do I handle billing with AI-generated code?#

Be explicit about enforcement in your prompts. Specify plans, limits, and upgrade flows. After the agent implements billing, verify plan gating explicitly by testing that users on lower tiers cannot access higher-tier features. The most common mistake is decorative pricing where the pricing page exists but the app does not actually enforce limits. Always test the enforcement before launch.

Can AI agents write production-quality code?#

Yes, with appropriate review. AI agents regularly produce code that ships to production. Quality depends on your prompts, your test coverage, and your review process. Focus review on security boundaries, data mutations, error handling, and authorization checks. The structural code (routing, component layout, query construction) is usually correct. AI-generated code has consistent failure patterns you learn to spot and fix.

What is the build loop for AI-assisted development?#

Five steps per feature. Plan: have the agent outline its approach before writing code. Implement: approve the plan and let it run without interruption. Verify: run the dev server and test the feature works. Review: read the important parts of the code (security, data access, error handling). Commit: atomic commits per feature for easy rollback. Repeat this loop for every feature, whether sequential or parallel.

Official Sources#

Primary documentation for the tools and workflows covered in this guide. Verify current features and pricing against these sources before starting your build.

Resource	Official Source
Claude Code	Claude Code Docs
Next.js	Next.js Documentation
Clerk	Clerk Documentation
Drizzle ORM	Drizzle Documentation
Neon	Neon Documentation
Stripe	Stripe Documentation
Git Worktrees	Git Worktree Manual

This is the complete workflow for building a SaaS product with AI agents, from the first idea to the first paying customer.

Phase 1: Planning with AI#

Writing the PRD#

Before touching code, write a Product Requirements Document. This is not a 40-page enterprise document. It is 1 to 2 pages that answer five questions:

What is the product? One sentence. "A cron job monitoring service that sends alerts when jobs fail."
Who is the user? One sentence. "Developers running scheduled tasks in production."
What are the core features? Numbered list of 5 to 8 features, prioritized.
What is the tech stack? Specific choices with reasons.
What does the data model look like? Tables, relationships, key fields.

Markdown

# CronWatch PRD

Cron job monitoring SaaS. Developers add their cron jobs, set alert
rules, and get notified when jobs fail, run late, or produce errors.

## Users
Developers and DevOps engineers running scheduled tasks in production.

## Core Features (Priority Order)
1. Dashboard showing all monitored cron jobs with status
2. Webhook endpoint that cron jobs ping on start/complete/fail
3. Alert rules: email and Slack when job fails or misses window
4. Run history with logs and duration tracking
5. Team support: invite members, share dashboards
6. Pricing: Starter ($9, 10 jobs), Pro ($29, 50 jobs), Business ($79, unlimited)

## Stack
- Next.js 16 (App Router, Server Components)
- Neon (Postgres via Drizzle ORM)
- Clerk (auth, organizations)
- Tailwind (styling)
- Deployed on Coolify (Hetzner VPS)

## Data Model
- users: id, clerkId, email, plan, createdAt
- cronJobs: id, userId, name, schedule, webhookUrl, status, lastPingAt
- cronRuns: id, jobId, startedAt, completedAt, status, logs
- alerts: id, jobId, type, channel, config

This PRD is 30 lines. It took 15 minutes to write. It will save hours of misdirected agent work because every coding session starts with the agent knowing exactly what it is building.

Decomposing into Iterations#

Code

Iteration 1: Foundation
- Project scaffold (Next.js + Clerk + Neon + Drizzle)
- Database schema for users, cronJobs, cronRuns
- Auth flow: sign up, sign in, dashboard shell
- Basic dashboard showing empty state

Iteration 2: Core Feature
- Webhook endpoint for cron job pings
- Job status tracking (healthy, late, failed)
- Run history table with logs
- Dashboard populated with real data

Iteration 3: Alerts
- Alert rules configuration UI
- Email notification system
- Slack webhook integration
- Alert history and acknowledgment

Iteration 4: Billing and Launch
- Stripe integration via Autumn credits
- Pricing page with plan comparison
- Plan-gated feature limits
- Landing page, deploy, go live

Each iteration is a self-contained unit of work that produces a deployable increment. The agent can complete an iteration in a focused session. You review, adjust, and move to the next iteration.

Phase 2: Scaffolding#

Scaffolding is the phase where AI agents provide the most dramatic speedup. What takes a developer 2 to 4 hours of manual setup takes an agent 5 minutes.

The Scaffold Prompt#

Code

Read the PRD in CLAUDE.md. Set up Iteration 1:

1. Initialize Next.js 16 with TypeScript, Tailwind, App Router
2. Install and configure Clerk (auth provider)
3. Install and configure Drizzle ORM with Neon Postgres
4. Create the database schema from the PRD data model
5. Set up the project structure:
   - src/db/schema.ts (Drizzle schema)
   - src/db/queries/ (query functions)
   - src/actions/ (server actions)
   - src/components/ (React components)
6. Create a basic authenticated dashboard shell
7. Push the schema to the database
8. Commit with message "scaffold: next.js + clerk + neon + drizzle"

What to Watch During Scaffolding#

Even with a clear prompt, scaffolding can go wrong. Watch for these:

Dependency version conflicts. AI agents sometimes install incompatible package versions. Review the package.json before moving on. Run the dev server and verify it starts cleanly.

Auth configuration gaps. Clerk needs environment variables, middleware configuration, and provider wrapping. Verify the full auth flow works: sign up, sign in, redirect to dashboard, sign out.

Database connection. Push the schema and verify tables exist. Run a simple query to confirm the connection works through the ORM.

Type safety. Check that TypeScript strict mode is enabled and that the schema types propagate to queries and server actions. Type errors caught here prevent cascading issues later.

Phase 3: Parallel Development#

This is where the workflow diverges from traditional development. Instead of building features sequentially, you build them in parallel using multiple agent sessions.

The Parallel Pattern#

Identify features within an iteration that have no dependencies on each other. Assign each to a separate agent session or worktree.

For Iteration 2 of CronWatch:

Code

Agent 1 (worktree: feature/webhook-endpoint):
  Build the webhook endpoint for cron job pings.
  - POST /api/webhook/:jobId with start/complete/fail events
  - Update job status and create cronRun records
  - Handle duplicate pings and out-of-order events

Agent 2 (worktree: feature/run-history):
  Build the run history table component.
  - Query cronRuns for a given job
  - Display start time, duration, status, truncated logs
  - Pagination for jobs with many runs
  - Empty state for jobs with no runs yet

Agent 3 (worktree: feature/dashboard-populated):
  Populate the dashboard with real job data.
  - Query all cronJobs for the authenticated user
  - Display status badges (healthy/late/failed)
  - Last ping timestamp and next expected ping
  - Quick actions: pause, delete, view history

Each agent works in its own Git worktree, making changes that do not conflict with the others. When all three finish, you merge the worktrees in sequence, resolving any conflicts.

Why Parallel Works#

Managing Worktrees#

Git worktrees are the mechanism that makes parallel development practical. Each worktree is a separate checkout of the same repository, allowing multiple branches to be active simultaneously.

Terminal

# Create worktrees for parallel features
git worktree add ../cronwatch-webhook feature/webhook-endpoint
git worktree add ../cronwatch-history feature/run-history
git worktree add ../cronwatch-dashboard feature/dashboard-populated

# Run agents in each worktree
cd ../cronwatch-webhook && claude
cd ../cronwatch-history && claude
cd ../cronwatch-dashboard && claude

When agents complete their work:

Terminal

# Merge back to main
git checkout main
git merge feature/webhook-endpoint
git merge feature/run-history
git merge feature/dashboard-populated

# Clean up worktrees
git worktree remove ../cronwatch-webhook
git worktree remove ../cronwatch-history
git worktree remove ../cronwatch-dashboard

Newsletter

Get the weekly deep dive

Tutorials on Claude Code, AI agents, and dev tools, delivered free every week.

From the archive

Context Engineering: The Highest-Leverage Skill in AI-Assisted Development

Apr 9, 2026 • 14 min read

The MCP Server Ecosystem: A Developer's Guide for 2026

Apr 9, 2026 • 13 min read

Self-Improving AI Agents: Building Systems That Learn From Their Mistakes

Apr 9, 2026 • 13 min read

AI Agent Memory Patterns

Apr 3, 2026 • 9 min read

Phase 4: The Build Loop#

Within each feature (whether parallel or sequential), the agent follows a consistent build loop.

Step 1: Plan#

Start every feature with a plan prompt. The agent reads the project context, understands the current state of the codebase, and outlines its approach before writing code.

Code

Plan the webhook endpoint feature. Read the schema, understand the
existing patterns in src/actions/ and src/db/queries/, and outline:
1. What files to create or modify
2. What the API contract looks like
3. How you'll handle edge cases (duplicate pings, invalid jobId)
4. What tests you'll write

Step 2: Implement#

Approve the plan and let the agent implement. For most features, this is hands-off. The agent creates files, writes code, and makes incremental progress.

Step 3: Verify#

After implementation, verify the feature works.

Code

Run the dev server. Navigate to the webhook endpoint.
Send a test POST request with curl:
  curl -X POST http://localhost:3000/api/webhook/test-job-id \
    -H "Content-Type: application/json" \
    -d '{"event": "complete", "duration": 1234}'
Verify the response and check the database for the new cronRun record.

Automated verification is better than manual verification. If the agent wrote tests, run them. If it did not, have it write tests before moving on.

Step 4: Review#

Read the code the agent wrote. Not every line, but the important parts: data access patterns, error handling, security boundaries, and type definitions.

AI-generated code has consistent failure patterns:

Missing error handling for edge cases
Overly permissive type definitions (using any)
Missing authorization checks on API routes
Hardcoded values that should be configuration
Missing input validation on user-facing endpoints

These are the things to look for during review. The structural code (routing, component layout, query construction) is usually correct.

Step 5: Commit#

Commit after every feature. Do not batch multiple features into one commit. Atomic commits make it possible to revert a single feature without losing others.

Code

Commit this feature with message:
"add webhook endpoint for cron job pings"

Phase 5: Testing#

Testing in an AI-agent workflow is different from traditional testing because the agent can both write and run tests.

The Testing Prompt#

Code

Write tests for the webhook endpoint:
1. Happy path: valid ping creates a cronRun record
2. Invalid jobId returns 404
3. Duplicate ping within 1 second is idempotent
4. Missing event field returns 400
5. Unauthorized request returns 401

Use vitest. Mock the database layer. Run the tests and fix any failures.

Integration Testing#

Unit tests verify individual functions. Integration tests verify that the system works end to end. Have the agent write integration tests that exercise the full stack:

Code

Write an integration test that:
1. Creates a test user via Clerk test helpers
2. Creates a cron job via the API
3. Sends a webhook ping
4. Verifies the dashboard shows the updated status
5. Cleans up test data

The Testing Gap#

Phase 6: Billing and Monetization#

Billing is the phase where most AI-built projects stall. The agent can scaffold Stripe integration, but the business logic around plans, limits, and upgrades requires careful specification.

The Billing Spec#

Code

Implement Stripe billing via Autumn SDK:

Plans:
- Starter ($9/mo): 10 monitored jobs, email alerts only
- Pro ($29/mo): 50 monitored jobs, email + Slack alerts
- Business ($79/mo): Unlimited jobs, all alert channels, priority support

Implementation:
1. Pricing page at /pricing with plan comparison
2. Checkout flow: pricing -> Stripe checkout -> redirect to dashboard
3. Plan enforcement: check user's plan before creating jobs
4. Upgrade/downgrade flow in account settings
5. Middleware: redirect unauthenticated users to /pricing, not /dashboard
6. Owner bypass: my account (isOwner flag) skips all billing checks

Plan Gating#

Verify plan gating explicitly:

Code

Test plan gating:
1. Sign in as a Starter user
2. Try to create an 11th cron job
3. Verify the app shows an upgrade prompt, not a success message
4. Try to add a Slack alert
5. Verify the app shows "Pro plan required"

If the agent did not implement enforcement, it will fail these tests. Fix it before launch.

Phase 7: Deployment#

Deployment is the easiest phase to automate and the one where most developers waste the most time.

The Deploy Checklist#

Code

Deploy to production:

1. Environment variables:
   - Verify all env vars are set in Coolify/Vercel
   - DATABASE_URL, CLERK_SECRET_KEY, STRIPE_SECRET_KEY, etc.

2. Database:
   - Run migrations against production database
   - Verify schema matches local

3. Build:
   - Run production build locally first: npm run build
   - Fix any build errors before pushing

4. DNS:
   - Add A record pointing to server IP
   - Add CNAME for www subdomain
   - Wait for propagation (usually < 5 minutes with Cloudflare)

5. Push:
   - git push origin main
   - Monitor build logs in Coolify
   - Verify deployment at production URL

6. Smoke test:
   - Sign up flow
   - Create a cron job
   - Send a webhook ping
   - Verify alert delivery

The agent can execute most of this checklist autonomously. The developer verifies the smoke tests and handles any DNS issues that arise.

Post-Deploy Monitoring#

The first 24 hours after deployment are critical. Set up basic monitoring:

Code

Add a health endpoint at /api/health that checks:
1. Database connection (run a simple query)
2. Clerk auth service availability
3. Response time under 200ms

Add error tracking via a simple error boundary that logs
to the server. We'll add proper monitoring later.

Start simple. A health endpoint that returns 200 or 500 is enough for launch. Add Sentry, DataDog, or equivalent later when you have users generating real traffic.

Phase 8: Iteration After Launch#

Launching is not the end. It is the beginning of the feedback loop that actually matters.

User Feedback Loop#

The first real users will find issues that no amount of testing catches. The workflow for handling feedback with AI agents:

User reports an issue
Create a GitHub issue with reproduction steps
Assign the issue to an agent session
Agent reads the issue, reproduces it, fixes it, writes a test
You review the fix and merge
Deploy

Feature Requests#

Common Failure Modes#

Failure 1: No Project Context#

Failure 2: No Review#

Fix: Review every feature before merging. Focus on security boundaries, data mutations, and error handling.

Failure 3: Monolithic Prompts#

Giving the agent a 500-word prompt that describes an entire feature with all its edge cases. The agent loses track of requirements in the middle and produces incomplete implementations.

Fix: Decompose complex features into sequential steps. Each prompt should describe one clear unit of work.

Failure 4: Ignoring Tests#

Shipping features without tests because the agent did not write them automatically. When the next iteration modifies shared code, existing features break silently.

Fix: Make tests part of every feature prompt. "Implement X and write tests for Y" should be the standard format.

Failure 5: Premature Optimization#

Asking the agent to build performance optimizations, caching layers, and scaling infrastructure before you have users. This wastes time on problems that do not exist yet.

Fix: Launch with the simplest implementation that works. Optimize when monitoring shows you where the bottlenecks actually are.

The Economics#

A SaaS product that took a 4-person team three months to build now takes one developer with AI agents two to four weeks. The cost breakdown:

Item	Traditional	AI-Assisted
Developer salaries (3 months)	$60,000 to $150,000	$10,000 to $25,000 (1 developer)
AI tooling	$0	$200 to $400/month
Infrastructure	$100 to $500/month	$50 to $200/month
Time to first revenue	3 to 6 months	2 to 4 weeks

This is not a future state. This is how SaaS products are being built and shipped today. The workflow is learnable, the tools are available, and the only barrier is adopting a new way of working.

Official Sources#

Phase 1: Planning with AI#

Writing the PRD#

Decomposing into Iterations#

Phase 2: Scaffolding#

The Scaffold Prompt#

What to Watch During Scaffolding#

Phase 3: Parallel Development#

The Parallel Pattern#

Why Parallel Works#

Managing Worktrees#

Context Engineering: The Highest-Leverage Skill in AI-Assisted Development

The MCP Server Ecosystem: A Developer's Guide for 2026

Self-Improving AI Agents: Building Systems That Learn From Their Mistakes

AI Agent Memory Patterns

Phase 4: The Build Loop#

Step 1: Plan#

Step 2: Implement#

Step 3: Verify#

Step 4: Review#

Step 5: Commit#

Phase 5: Testing#

The Testing Prompt#

Integration Testing#

The Testing Gap#

Phase 6: Billing and Monetization#

The Billing Spec#

Plan Gating#

Phase 7: Deployment#

The Deploy Checklist#

Post-Deploy Monitoring#

Phase 8: Iteration After Launch#

User Feedback Loop#

Feature Requests#

Common Failure Modes#

Failure 1: No Project Context#

Failure 2: No Review#

Failure 3: Monolithic Prompts#

Failure 4: Ignoring Tests#

Failure 5: Premature Optimization#

The Economics#

Frequently Asked Questions#

How long does it take to build a SaaS with AI agents?#

Which AI tool is best for building a SaaS?#

What should I include in a PRD for AI development?#

How do parallel worktrees speed up development?#

What are the common failure modes when building with AI?#

How do I handle billing with AI-generated code?#

Can AI agents write production-quality code?#

What is the build loop for AI-assisted development?#

Building a SaaS with Claude Code: End-to-End Guide

Ship Code While You Sleep: The Overnight Agent Workflow

Multi-Agent Systems: How to Orchestrate Multiple AI Agents in TypeScript

Try These Tools

Related Tools

n8n

LangChain / LangGraph

Composio

CrewAI

Apps from Developers Digest

Overnight Agents

DevDigest Academy

Auto Company

Related Guides

Claude Code Complete Course

Claude Code Setup Guide

Getting Started with Claude Code

Related Videos

Get Started Building A SAAS with Cursor in 13 Mins

Building Effective AI Agents with VectorShift

Agents 101: How to Build and Deploy Anything with AI Agents

Related Posts

Building a SaaS with Claude Code: End-to-End Guide

Ship Code While You Sleep: The Overnight Agent Workflow

Multi-Agent Systems: How to Orchestrate Multiple AI Agents in TypeScript

How to Coordinate Multiple AI Agents: The Definitive Guide for 2026

How to Build AI Agents in TypeScript

How to Use Claude Code with Next.js

Build with the member tools

Get Smarter About AI Dev