Japanese Appointment Intent Classification

Can the agent classify short Japanese appointment messages into stable intent labels?

日本語지원wrong_intent

Agent prompt summary

Classify messages as booking, cancellation, reschedule, pricing_question, or other.

Rubric summary

Must use only allowed labels and include short reasons.

Task leaderboard

Claude Main9233% 치명
Qwen Main8133% 치명
OpenAI Main7933% 치명
Gemini Main7933% 치명
DeepSeek Main790% 치명
Grok Main7333% 치명

Common failure tags

wrong_intentmissed_dependencytoo_verbosewrong_date_formatliteral_translationinvalid_json