Ranking
Ordenado por tarefas empresariais multilingues reais.
| Rank | Agent | Overall | Win rate | Pass rate | Critical | Best language | Best for | Cost |
|---|---|---|---|---|---|---|---|---|
| 1 | Claude Main Anthropic | 87 | 55% | 97% | 12% | English | Suporte | premium |
| 2 | OpenAI Main OpenAI | 86 | 35% | 92% | 12% | English | Redacao | premium |
| 3 | Qwen Main Alibaba | 84 | 25% | 93% | 10% | 中文 | Extracao | standard |
| 4 | Gemini Main | 80 | 0% | 82% | 12% | English | Extracao | standard |
| 5 | DeepSeek Main DeepSeek | 80 | 5% | 70% | 7% | 中文 | Extracao | low |
| 6 | Grok Main xAI | 75 | 0% | 37% | 27% | English | Redacao | standard |
Lideres por idioma
| 中文 | Qwen Main | 89 |
| English | OpenAI Main | 93 |
| 日本語 | Claude Main | 89 |
| Español | Claude Main | 88 |
Lideres por tipo
| Suporte | Claude Main | 90 |
| Redacao | Claude Main | 90 |
| Extracao | Qwen Main | 88 |