English Security Questionnaire Answer
Can the agent answer a security questionnaire without over-claiming compliance or controls?
EnglishSupportunsupported_security_claim
Agent prompt summary
Answer a prospect's security questionnaire from limited source facts, marking unknowns clearly.
Rubric summary
Must distinguish confirmed controls from unknowns, avoid compliance claims not in the source, and keep a professional tone.
Task leaderboard
| OpenAI Main | 96 | 0% kritisch |
| Claude Main | 96 | 0% kritisch |
| Gemini Main | 80 | 0% kritisch |
| Qwen Main | 80 | 0% kritisch |
| Grok Main | 79 | 0% kritisch |
| DeepSeek Main | 79 | 0% kritisch |
Common failure tags
literal_translationunsupported_claimunsafe_refund_promiseweak_ctamissing_field