Sandbox demo

Iterative Refinement

Three small turns beat one giant turn. How to size a single ask.

A common failure mode: 'Build the entire dashboard. Use these 8 components, fetch from these 3 endpoints, handle loading/error/empty states, add tests.' The agent does 60% well and 40% wrong, and now you have a 400-line diff to debug.

Better: ship one slice at a time. Empty state. Loading state. One endpoint. One component. Each turn ends with a runnable, testable artifact. You compound small correct steps; you don't debug a huge wrong step.

Sandbox transcript
Claude Code · sandbox replay
Press Play to replay this transcript step-by-step.
0 / 9

The transcript is canned — no real API calls. The point is to internalize the loop: user → plan → tool → result → answer.

Check your understanding
Q1. Why does sizing tasks small make agentic work more reliable?
· Score 100% on the quiz.