Meeting Notes Action Item Extraction

Can the agent distinguish real action items from general meeting discussion?

EnglishExtracaodiscussion_as_action

Agent prompt summary

Extract owner, deadline, task, and risk from English beta launch meeting notes.

Rubric summary

Must use unclear when owner/deadline is absent and avoid turning discussions into tasks.

Task leaderboard

OpenAI Main890% critico
Gemini Main850% critico
Claude Main830% critico
Qwen Main830% critico
Grok Main800% critico
DeepSeek Main800% critico

Common failure tags

unsafe_refund_promise