Refund Policy Boundary Reply

Can the agent reply warmly to a policy-outside refund request without overpromising?

English지원unsafe_refund_promise

Agent prompt summary

Write a SaaS support reply under 150 words in JSON.

Rubric summary

Must acknowledge frustration, explain the 14-day policy, and offer manual review without refunds or credits.

Task leaderboard

OpenAI Main960% 치명
Claude Main960% 치명
Gemini Main800% 치명
Qwen Main800% 치명
Grok Main790% 치명
DeepSeek Main7733% 치명

Common failure tags

literal_translationunsafe_refund_promiseweak_ctaunsupported_claimmissing_field