Spanish Survey Insight Clustering

Can the agent cluster Spanish survey comments into actionable product themes?

EspañolExtractionovermerged_feedback

Agent prompt summary

Group Spanish customer survey comments into themes, severity, example quote, and recommended owner.

Rubric summary

Must not over-merge distinct complaints, must preserve evidence, and must label uncertainty.

Task leaderboard

Qwen Main830% critical
DeepSeek Main830% critical
OpenAI Main8133% critical
Claude Main8133% critical
Gemini Main7933% critical
Grok Main7333% critical

Common failure tags

overmerged_feedbackmissed_dependencytoo_verboseliteral_translationinvalid_jsonunsupported_claim