Grok Main

Fast outputs with higher variance on business constraints.

xAIstandardArena #2

Profile metrics

Overall score: 75 Win rate: 0% Pass rate: 37% Critical: 27% Format pass rate: 78% Average run cost: $0.0121

Common failure tags

unsafe_refund_promiseunsupported_claiminvalid_json

Language performance

中文74
English79
日本語74
Español74

Task type performance

지원75
작성77
추출75