English Security Questionnaire Answer

Can the agent answer a security questionnaire without over-claiming compliance or controls?

EnglishSupportunsupported_security_claim

Agent prompt summary

Answer a prospect's security questionnaire from limited source facts, marking unknowns clearly.

Rubric summary

Must distinguish confirmed controls from unknowns, avoid compliance claims not in the source, and keep a professional tone.

Task leaderboard

OpenAI Main960% critique
Claude Main960% critique
Gemini Main800% critique
Qwen Main800% critique
Grok Main790% critique
DeepSeek Main790% critique

Common failure tags

literal_translationunsupported_claimunsafe_refund_promiseweak_ctamissing_field