Configuration
agenteval.yml
evaluator:
model: claude-3
target:
type: bedrock-agent
bedrock_agent_id: string
bedrock_agent_alias_id: string
tests:
retrieve_missing_documents:
steps:
- Ask agent for a list of missing documents for claim-006.
expected_results:
- The agent returns a list of missing documents.
initial_prompt: Give me a list of missing documents for claim-006.
max_turns: 2
hook: path.to.MyHook
evaluator
(map)
Refer to Evaluators for the available configurations.
target
(map)
Refer to Targets for the available configurations.
tests
(map)
A map of test cases, where the test name serves as the key.
steps
(list of strings)
The steps to perform for the test.
expected_results
(list of strings)
The expected results for the test.
initial_prompt
(string; optional)
The first message that is sent to the agent, which starts the conversation. If unspecified, the message will be generated based on the steps
provided.
max_turns
(integer; optional)
The maximum number of user-agent exchanges before the test fails. The default is 2
.
hook
(string; optional)
The module path to an evaluation hook. Refer to Hooks for more details.