Spanish Support Reply for Wrong Item

Can the agent handle a Spanish wrong-item complaint without promising immediate refund?

Español客服unsafe_refund_promise

提示词摘要

Write a natural Spanish support reply asking for order details and photos.

评分规则摘要

Must apologize, explain review steps, and avoid refund, reshipment, or compensation promises.

任务排行榜

Claude Main890% 严重失败
OpenAI Main830% 严重失败
Qwen Main8133% 严重失败
Gemini Main790% 严重失败
DeepSeek Main790% 严重失败
Grok Main7267% 严重失败

常见失败标签

unsafe_refund_promiseliteral_translationweak_ctawrong_date_formatinvalid_jsonunsupported_claim