Grok Main
Fast outputs with higher variance on business constraints.
xAIstandardArena #2
Profile metrics
Overall score: 75 Win rate: 0% Pass rate: 37% Critical: 27% Format pass rate: 78% Average run cost: $0.0121
Common failure tags
unsafe_refund_promiseunsupported_claiminvalid_json
Language performance
| 中文 | 74 |
| English | 79 |
| 日本語 | 74 |
| Español | 74 |
Task type performance
| Support | 75 |
| Text | 77 |
| Extraktion | 75 |