Definition
Raw output is the original agent response before human editing, post-processing, or presentation cleanup. It is the evidence layer behind scores and failure tags.
The unedited answer produced by an agent during a task run.
Raw output is the original agent response before human editing, post-processing, or presentation cleanup. It is the evidence layer behind scores and failure tags.
Readers need raw outputs to understand whether a score is fair and whether a failure would matter in their own workflow.
A task page shows the agent's original JSON response and the judge note explaining missing fields.