Skip to content

Configuration

agenteval.yml
evaluator:
  model: claude-3
target:
  type: bedrock-agent
  bedrock_agent_id: string
  bedrock_agent_alias_id: string
tests:
  retrieve_missing_documents:
    steps:
    - Ask agent for a list of missing documents for claim-006.
    expected_results:
    - The agent returns a list of missing documents.
    initial_prompt: Give me a list of missing documents for claim-006.
    max_turns: 2
    hook: path.to.MyHook

evaluator (map)

Refer to Evaluators for the available configurations.


target (map)

Refer to Targets for the available configurations.


tests (map)

A map of test cases, where the test name serves as the key.


steps (list of strings)

The steps to perform for the test.


expected_results (list of strings)

The expected results for the test.


initial_prompt (string; optional)

The first message that is sent to the agent, which starts the conversation. If unspecified, the message will be generated based on the steps provided.


max_turns (integer; optional)

The maximum number of user-agent exchanges before the test fails. The default is 2.


hook (string; optional)

The module path to an evaluation hook. Refer to Hooks for more details.