curl --request POST \
--url https://api.example.com/v2/{entity}/{project}/eval_results/query \
--header 'Authorization: Basic <encoded-value>' \
--header 'Content-Type: application/json' \
--data '
{
"evaluation_call_ids": [
"<string>"
],
"evaluation_run_ids": [
"<string>"
],
"require_intersection": false,
"include_raw_data_rows": false,
"resolve_row_refs": false,
"include_rows": true,
"include_summary": false,
"summary_require_intersection": true,
"limit": 123,
"offset": 0
}
'{
"rows": [
{
"row_digest": "<string>",
"raw_data_row": null,
"evaluations": [
{
"evaluation_call_id": "<string>",
"trials": [
{
"predict_and_score_call_id": "<string>",
"predict_call_id": "<string>",
"model_output": null,
"scores": {},
"model_latency_seconds": 123,
"total_tokens": 123,
"scorer_call_ids": {}
}
]
}
]
}
],
"total_rows": 123,
"summary": {
"row_count": 0,
"evaluations": [
{
"evaluation_call_id": "<string>",
"trial_count": 0,
"scorer_stats": [
{
"scorer_key": "<string>",
"path": "<string>",
"trial_count": 0,
"numeric_count": 0,
"numeric_mean": 123,
"pass_true_count": 0,
"pass_known_count": 0,
"pass_rate": 123,
"pass_signal_coverage": 123
}
],
"evaluation_ref": "<string>",
"model_ref": "<string>",
"display_name": "<string>",
"trace_id": "<string>",
"started_at": "<string>"
}
]
},
"warnings": [
"<string>"
]
}Eval Results Query
Read grouped evaluation result rows for one or more evaluations.
curl --request POST \
--url https://api.example.com/v2/{entity}/{project}/eval_results/query \
--header 'Authorization: Basic <encoded-value>' \
--header 'Content-Type: application/json' \
--data '
{
"evaluation_call_ids": [
"<string>"
],
"evaluation_run_ids": [
"<string>"
],
"require_intersection": false,
"include_raw_data_rows": false,
"resolve_row_refs": false,
"include_rows": true,
"include_summary": false,
"summary_require_intersection": true,
"limit": 123,
"offset": 0
}
'{
"rows": [
{
"row_digest": "<string>",
"raw_data_row": null,
"evaluations": [
{
"evaluation_call_id": "<string>",
"trials": [
{
"predict_and_score_call_id": "<string>",
"predict_call_id": "<string>",
"model_output": null,
"scores": {},
"model_latency_seconds": 123,
"total_tokens": 123,
"scorer_call_ids": {}
}
]
}
]
}
],
"total_rows": 123,
"summary": {
"row_count": 0,
"evaluations": [
{
"evaluation_call_id": "<string>",
"trial_count": 0,
"scorer_stats": [
{
"scorer_key": "<string>",
"path": "<string>",
"trial_count": 0,
"numeric_count": 0,
"numeric_mean": 123,
"pass_true_count": 0,
"pass_known_count": 0,
"pass_rate": 123,
"pass_signal_coverage": 123
}
],
"evaluation_ref": "<string>",
"model_ref": "<string>",
"display_name": "<string>",
"trace_id": "<string>",
"started_at": "<string>"
}
]
},
"warnings": [
"<string>"
]
}Documentation Index
Fetch the complete documentation index at: https://wb-21fd5541-dbrian-docs-2514-bedrock-agents.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Authorizations
Basic authentication header of the form Basic <encoded-value>, where <encoded-value> is the base64-encoded string username:password.
Body
Evaluation root call IDs to include.
Alias for evaluation call IDs from the Evaluation Runs API.
When true, only include rows present in all requested evaluations.
When true, populate raw_data_row on each result row. Inline rows are returned as their dict value; dataset-referenced rows are returned as the ref string unless resolve_row_refs is also true.
When true (requires include_raw_data_rows=True), resolve dataset-row reference strings to actual row data via a table lookup. When false, dataset-row refs are returned as-is.
When true, include grouped row/trial data in rows and compute total_rows for the requested row-level view.
When true, include aggregated scorer/evaluation summary data in summary.
Optional intersection behavior for the summary section. When null, the value of require_intersection is used.
Optional row-level page size applied after grouping and intersection.
Optional row-level page offset applied after grouping and intersection.
Was this page helpful?