अध्याय छह

Agents
बनाना

Calculator और coworker के
बीच का फ़र्क़।

Chatbot तुम्हारे बोलने का इंतज़ार करता है। Agent नहीं करता। उसे एक goal दो — "Mars colonization पर research करो और एक report लिखो" — और वो काम को steps में तोड़ता है, सही tools चुनता है, एक-एक करके execute करता है, अपने results check करता है, और काम पूरा होने तक चलता रहता है। ये फ़र्क़ — instructions का जवाब देने और एक goal pursue करने में — यही calculator और coworker का फ़र्क़ है।

Agents एक loop पर चलते हैं: goal लो, steps plan करो, tools से execute करो, results observe करो, progress evaluate करो, adjust करो। वो एक बार plan बनाकर blindly execute नहीं करते — सबसे अच्छे agents gracefully recover करते हैं जब step तीन बिगड़ जाए। लेकिन इस power का एक dark side है: agents सिर्फ़ ग़लत जवाब नहीं देते, वो उन पर act करते हैं, फिर टूटे हुए result से अगला decision लेते हैं। Failures compound होती हैं।

Agent Loop

हर agent एक ही पाँच-step cycle पर चलता है। ये steps समझना — यही फ़र्क़ है कि तुम कुछ काम का बनाओ या कुछ ऐसा जो infinite loop में तुम्हारे API credits जला दे।

Goal

एक clear objective लो। "Mars पर research करो" goal नहीं है। "Mars colonization की challenges का 500 शब्दों का summary लिखो, sources के साथ" — ये goal है।

Plan

Goal को concrete steps में तोड़ो। अच्छे planners flexible plans बनाते हैं। बुरे planners rigid scripts बनाते हैं जो पहले surprise पर टूट जाती हैं।

Execute

हर step को सही tool से run करो — search engines, code interpreters, file readers, APIs। Agents सिर्फ़ सोचते नहीं। वो करते हैं।

Observe

क्या हुआ record करो और results check करो। Memory के बिना, agent शायद वही failing action बार-बार दोहराए।

Evaluate

ये काम किया? Goal के क़रीब पहुँचे? कुछ ग़लत है तो plan adjust करो और दोबारा loop करो। यही agent को script से अलग करता है।

Key insight

Chatbot एक single turn है। Agent एक पूरी conversation है — ख़ुद से, अपने tools से, और दुनिया से। सबसे अच्छे agent builders autonomy maximize नहीं करते। वो सोच-समझकर checkpoints design करते हैं जहाँ agent रुकता है और इंसान को decide करने देता है कि आगे बढ़ें या नहीं।

अब ख़ुद बनाने का time है। एक tank agent design करो — उसकी capabilities चुनो, budget allocate करो, strategy लिखो — फिर arena में deploy करो और उसे autonomously लड़ते देखो।

Agent Arena

Configure

Capabilities

5 pts left

Armor

Speed

Power

Rate

Range

Strategy (system prompt)

Close distance. Fire constantly. Prioritize damage over survival.

Opponent

Share this course

Goal ये नहीं है कि हर failure रोको — goal ये है कि failure recoverable हो।

अगला: तुम एक असली agent use करोगे — जो तुम्हारा code पढ़ता है, changes लिखता है, tests run करता है, और अपनी ग़लतियाँ ख़ुद fix करता है। मिलो coding agents से।

Agent Swarm

New tool unlocked!

Agentsबनाना

Agent Loop

Agent Swarm

Agent Swarm

Agents
बनाना