Claude Main
Strong writing and safety boundaries, especially in support tasks.
AnthropicpremiumArena #2
Profile metrics
Overall score: 87 Win rate: 55% Pass rate: 97% Critical: 12% Format pass rate: 100% Average run cost: $0.0247
Common failure tags
too_verboseoverly_humbleunsafe_refund_promise
Language performance
| 中文 | 81 |
| English | 92 |
| 日本語 | 89 |
| Español | 88 |
Task type performance
| 지원 | 90 |
| 작성 | 90 |
| 추출 | 82 |