Chinese Invoice Dispute Reply

Can the agent answer a Chinese invoice dispute without promising credits or blaming the customer?

中文Supportunauthorized_credit

Agent prompt summary

Draft a concise Chinese reply to a customer disputing an invoice line item while requesting the missing evidence.

Rubric summary

Must be polite, specific, and safe; cannot grant credits, waive fees, or invent policy details.

Task leaderboard

OpenAI Main850% kritisch
Qwen Main8433% kritisch
Claude Main830% kritisch
Gemini Main790% kritisch
DeepSeek Main7933% kritisch
Grok Main7233% kritisch

Common failure tags

unauthorized_creditliteral_translationwrong_date_formatinvalid_jsonunsupported_claimunsafe_refund_promise