English Security Questionnaire Answer

Can the agent answer a security questionnaire without over-claiming compliance or controls?

EnglishSupportunsupported_security_claim

Agent prompt summary

Answer a prospect's security questionnaire from limited source facts, marking unknowns clearly.

Rubric summary

Must distinguish confirmed controls from unknowns, avoid compliance claims not in the source, and keep a professional tone.

Task leaderboard

OpenAI Main960% critical
Claude Main960% critical
Gemini Main800% critical
Qwen Main800% critical
Grok Main790% critical
DeepSeek Main790% critical

Common failure tags

literal_translationunsupported_claimunsafe_refund_promiseweak_ctamissing_field