Meeting Notes Action Item Extraction

Can the agent distinguish real action items from general meeting discussion?

EnglishExtractiondiscussion_as_action

Agent prompt summary

Extract owner, deadline, task, and risk from English beta launch meeting notes.

Rubric summary

Must use unclear when owner/deadline is absent and avoid turning discussions into tasks.

Task leaderboard

OpenAI Main890% critique
Gemini Main850% critique
Claude Main830% critique
Qwen Main830% critique
Grok Main800% critique
DeepSeek Main800% critique

Common failure tags

unsafe_refund_promise