English Security Questionnaire Answer

Can the agent answer a security questionnaire without over-claiming compliance or controls?

EnglishSupportunsupported_security_claim

Agent prompt summary

Answer a prospect's security questionnaire from limited source facts, marking unknowns clearly.

Rubric summary

Must distinguish confirmed controls from unknowns, avoid compliance claims not in the source, and keep a professional tone.

Task leaderboard

OpenAI Main960% kritisch
Claude Main960% kritisch
Gemini Main800% kritisch
Qwen Main800% kritisch
Grok Main790% kritisch
DeepSeek Main790% kritisch

Common failure tags

literal_translationunsupported_claimunsafe_refund_promiseweak_ctamissing_field