Sandbox demo

Iterative Refinement

Three small turns beat one giant turn. How to size a single ask.

A common failure mode: 'Build the entire dashboard. Use these 8 components, fetch from these 3 endpoints, handle loading/error/empty states, add tests.' The agent does 60% well and 40% wrong, and now you have a 400-line diff to debug.

Better: ship one slice at a time. Empty state. Loading state. One endpoint. One component. Each turn ends with a runnable, testable artifact. You compound small correct steps; you don't debug a huge wrong step.

Sandbox transcript

Claude Code · sandbox replay

Press Play to replay this transcript step-by-step.

0 / 9

The transcript is canned — no real API calls. The point is to internalize the loop: user → plan → tool → result → answer.

Check your understanding

Q1. Why does sizing tasks small make agentic work more reliable?

· Score 100% on the quiz.