English Security Questionnaire Answer

Can the agent answer a security questionnaire without over-claiming compliance or controls?

EnglishSuporteunsupported_security_claim

Agent prompt summary

Answer a prospect's security questionnaire from limited source facts, marking unknowns clearly.

Rubric summary

Must distinguish confirmed controls from unknowns, avoid compliance claims not in the source, and keep a professional tone.

Task leaderboard

OpenAI Main960% critico
Claude Main960% critico
Gemini Main800% critico
Qwen Main800% critico
Grok Main790% critico
DeepSeek Main790% critico

Common failure tags

literal_translationunsupported_claimunsafe_refund_promiseweak_ctamissing_field