Chinese App Review Pain Point Summary

Can the agent summarize messy Chinese app reviews without inventing pain points?

中文写作hallucinated_issue

提示词摘要

Extract pain points, counts, severity, representative comments, and three product suggestions.

评分规则摘要

Must merge similar issues, count accurately, cite evidence, and avoid unsupported suggestions.

任务排行榜

OpenAI Main890% 严重失败
Qwen Main850% 严重失败
Claude Main830% 严重失败
Gemini Main800% 严重失败
DeepSeek Main7933% 严重失败
Grok Main7733% 严重失败

常见失败标签

hallucinated_issueweak_ctaliteral_translationunsupported_claimunsafe_refund_promise