Back to course

Testing Agents: Evaluating Accuracy

Agentic AI Engineering (Build, don't just prompt)

Why Testing is Hard

Agents are non-deterministic. We'll use LangSmith to trace calls and evaluate if the agent reached the correct conclusion.