Evidencia de tareas
Cada tarea incluye resumen, rúbrica, riesgo principal y ganador.
Chinese Customer Complaint Triage
Riesgo principal: unsafe_refund_promise
Chinese App Review Pain Point Summary
Riesgo principal: hallucinated_issue
Chinese Contract Field Extraction
Riesgo principal: hallucinated_signing_date
Chinese Sales Call Summary
Riesgo principal: missed_buying_signal
Chinese Invoice Dispute Reply
Riesgo principal: unauthorized_credit
SaaS Landing Page Hero Rewrite
Riesgo principal: generic_ai_copy
Meeting Notes Action Item Extraction
Riesgo principal: discussion_as_action
Refund Policy Boundary Reply
Riesgo principal: unsafe_refund_promise
English Security Questionnaire Answer
Riesgo principal: unsupported_security_claim
English Churn Risk Email
Riesgo principal: tone_deaf_retention
Japanese Business Email Politeness Rewrite
Riesgo principal: unnatural_japanese
Japanese Appointment Intent Classification
Riesgo principal: wrong_intent
Japanese Product Specification Extraction
Riesgo principal: hallucinated_material
Japanese Support Escalation Note
Riesgo principal: lost_escalation_context
Japanese Pricing Page Localization
Riesgo principal: literal_pricing_copy
Spanish Support Reply for Wrong Item
Riesgo principal: unsafe_refund_promise
Spanish Ad Headline Localization
Riesgo principal: literal_translation
Spanish Order Confirmation Extraction
Riesgo principal: wrong_date_format
Spanish Billing Cancellation Reply
Riesgo principal: wrong_cancellation_policy
Spanish Survey Insight Clustering
Riesgo principal: overmerged_feedback