Evaluation Concepts Who this is for: teams testing agent behavior. Topics Regression evaluation. Test cases. Criteria. LLM-as-judge tradeoffs. Trace review. Debugging failed runs. Next Steps Evaluation and Debugging Evaluation Project