Skip to content

Agentic Evaluation Cookbook Notebook #2414

@joshreini1

Description

@joshreini1

Summary

The agent.py templates are powerful but only appear embedded in larger quickstarts. No focused notebook demonstrates all 7 agentic evaluators.

What

Create examples/cookbooks/agentic_evaluations.ipynb showing a simple agent (e.g., ReAct pattern) evaluated with all 7 agentic feedback functions. Include trace visualization.

Agentic Evaluators

  • LogicalConsistency
  • ExecutionEfficiency
  • PlanAdherence
  • PlanQuality
  • ToolSelection
  • ToolCalling
  • ToolQuality

Difficulty

Medium

Metadata

Metadata

Assignees

No one assigned

    Labels

    ExamplesNew or improved example notebooks and cookbookshelp wantedExtra attention is needed

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions