English Security Questionnaire Answer

Can the agent answer a security questionnaire without over-claiming compliance or controls?

English지원unsupported_security_claim

Agent prompt summary

Answer a prospect's security questionnaire from limited source facts, marking unknowns clearly.

Rubric summary

Must distinguish confirmed controls from unknowns, avoid compliance claims not in the source, and keep a professional tone.

Task leaderboard

OpenAI Main960% 치명
Claude Main960% 치명
Gemini Main800% 치명
Qwen Main800% 치명
Grok Main790% 치명
DeepSeek Main790% 치명

Common failure tags

literal_translationunsupported_claimunsafe_refund_promiseweak_ctamissing_field