EB-6
Evaluation, Failures & Tuning
Agents fail differently from traditional software — usually silently — so measuring them requires a discipline of its own. *Reference module for the structured format: every topic below is split into **Theory**, **Use cases**, and **Practical exercises** (concept-check → applied), and the block closes with a put-into-practice **capstone**.*
Theme: Validation →