Spanish Survey Insight Clustering

Can the agent cluster Spanish survey comments into actionable product themes?

EspañolExtractionovermerged_feedback

Agent prompt summary

Group Spanish customer survey comments into themes, severity, example quote, and recommended owner.

Rubric summary

Must not over-merge distinct complaints, must preserve evidence, and must label uncertainty.

Task leaderboard

Qwen Main830% critique
DeepSeek Main830% critique
OpenAI Main8133% critique
Claude Main8133% critique
Gemini Main7933% critique
Grok Main7333% critique

Common failure tags

overmerged_feedbackmissed_dependencytoo_verboseliteral_translationinvalid_jsonunsupported_claim