Kimi Main
Based on the current Arena #2 preview average score.
Compare low-cost structured automation with Chinese long-context reading, writing, and local business tone.
Use case: Chinese teams balancing price, document workflows, and practical reliability
Based on the current Arena #2 preview average score.
Sorted by critical-failure rate, not a universal safety guarantee.
Prioritizes cost tier, then score.
| Metric | DeepSeek Main | Kimi Main |
|---|---|---|
| Overall | 80 | 82 |
| Pass rate | 70% | 87% |
| Critical | 7% | 12% |
| Format pass | 100% | 100% |
| Win rate | 5% | 5% |
| Cost tier | low | standard |