Claude Main

Strong writing and safety boundaries, especially in support tasks.

AnthropicpremiumArena #2

Profile metrics

Overall score: 87 Win rate: 55% Pass rate: 97% Critical: 12% Format pass rate: 100% Average run cost: $0.0247

Common failure tags

too_verboseoverly_humbleunsafe_refund_promise

Language performance

中文81
English92
日本語89
Español88

Task type performance

Suporte90
Redacao90
Extracao82