routercore / data /routercore_eval_instruct.jsonl
Jayteare's picture
Deploy RouterCore Gradio demo
1137e50 verified
{"id": "eval-0001", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nNeed an automation identity for team security in staging.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_service_account\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"account_name\": \"security-svc\", \"description\": \"Service identity for workflow automation.\", \"environment\": \"staging\", \"team\": \"security\"}, \"status\": \"routed\", \"workflow\": \"create_service_account\"}"}
{"id": "eval-0002", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nNeed a small Node.js app named reporting-web-app in centralus for reporting.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_web_app\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"app_name\": \"reporting-web-app\", \"diagnostics_enabled\": true, \"environment\": \"staging\", \"region\": \"centralus\", \"runtime\": \"nodejs20\", \"team\": \"reporting\"}, \"status\": \"routed\", \"workflow\": \"create_web_app\"}"}
{"id": "eval-0003", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nticket: security production api, runtime Node.js, region East US, diagnostics on\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_web_app\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"app_name\": \"security-web-app\", \"diagnostics_enabled\": true, \"environment\": \"prod\", \"region\": \"eastus\", \"runtime\": \"nodejs20\", \"team\": \"security\"}, \"status\": \"routed\", \"workflow\": \"create_web_app\"}"}
{"id": "eval-0004", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nNeed a small Python app named security-web-app in westus for security.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_web_app\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"app_name\": \"security-web-app\", \"diagnostics_enabled\": false, \"environment\": \"prod\", \"region\": \"westus\", \"runtime\": \"python311\", \"team\": \"security\"}, \"status\": \"routed\", \"workflow\": \"create_web_app\"}"}
{"id": "eval-0005", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nticket: finance staging api, runtime Node.js, region Central US, diagnostics on\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_web_app\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"app_name\": \"finance-web-app\", \"diagnostics_enabled\": false, \"environment\": \"staging\", \"region\": \"centralus\", \"runtime\": \"nodejs20\", \"team\": \"finance\"}, \"status\": \"routed\", \"workflow\": \"create_web_app\"}"}
{"id": "eval-0006", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nCreate a nightly scheduler job named finance-nightly-job for model-refresh in production.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_scheduler_job\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"environment\": \"prod\", \"job_name\": \"finance-nightly-job\", \"schedule\": \"0 9 * * *\", \"target\": \"model-refresh\", \"team\": \"finance\", \"timezone\": \"UTC\"}, \"status\": \"routed\", \"workflow\": \"create_scheduler_job\"}"}
{"id": "eval-0007", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\ninfra: bucket for reporting, env staging, region us-central1, class standard\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_storage_bucket\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"bucket_name\": \"reporting-bucket\", \"environment\": \"staging\", \"region\": \"us-central1\", \"storage_class\": \"standard\", \"team\": \"reporting\"}, \"status\": \"routed\", \"workflow\": \"create_storage_bucket\"}"}
{"id": "eval-0008", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nCreate a service account named security-svc for the security team in staging.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_service_account\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"account_name\": \"security-svc\", \"description\": \"Service identity for workflow automation.\", \"environment\": \"staging\", \"team\": \"security\"}, \"status\": \"routed\", \"workflow\": \"create_service_account\"}"}
{"id": "eval-0009", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nNeed an automation identity for team growth in staging.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_service_account\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"account_name\": \"growth-svc\", \"description\": \"Service identity for workflow automation.\", \"environment\": \"staging\", \"team\": \"growth\"}, \"status\": \"routed\", \"workflow\": \"create_service_account\"}"}
{"id": "eval-0010", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nNeed an automation identity for team platform in production.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_service_account\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"account_name\": \"platform-svc\", \"description\": \"Service identity for workflow automation.\", \"environment\": \"prod\", \"team\": \"platform\"}, \"status\": \"routed\", \"workflow\": \"create_service_account\"}"}
{"id": "eval-0011", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\ndaily reporting job, details later\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.74, \"workflow\": \"create_scheduler_job\"}], \"clarifying_question\": \"What job name should RouterCore use?\", \"confidence\": 0.74, \"failure_reasons\": [\"Missing required fields: job_name, schedule, environment\"], \"missing_fields\": [\"job_name\", \"schedule\", \"environment\"], \"parameters\": {\"target\": \"reporting\"}, \"status\": \"needs_clarification\", \"workflow\": \"create_scheduler_job\"}"}
{"id": "eval-0012", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nservice account request, owner team finance\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.74, \"workflow\": \"create_service_account\"}], \"clarifying_question\": \"What account name should RouterCore use?\", \"confidence\": 0.74, \"failure_reasons\": [\"Missing required fields: account_name, environment\"], \"missing_fields\": [\"account_name\", \"environment\"], \"parameters\": {\"team\": \"finance\"}, \"status\": \"needs_clarification\", \"workflow\": \"create_service_account\"}"}
{"id": "eval-0013", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nNeed an automation identity for team growth in staging.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_service_account\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"account_name\": \"growth-svc\", \"description\": \"Service identity for workflow automation.\", \"environment\": \"staging\", \"team\": \"growth\"}, \"status\": \"routed\", \"workflow\": \"create_service_account\"}"}
{"id": "eval-0014", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\ncron 0 9 * * * target model-refresh env prod timezone America/New_York\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_scheduler_job\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"environment\": \"prod\", \"job_name\": \"platform-nightly-job\", \"schedule\": \"0 9 * * *\", \"target\": \"model-refresh\", \"team\": \"platform\", \"timezone\": \"America/New_York\"}, \"status\": \"routed\", \"workflow\": \"create_scheduler_job\"}"}
{"id": "eval-0015", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nNeed an automation identity for team platform in development.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_service_account\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"account_name\": \"platform-svc\", \"description\": \"Service identity for workflow automation.\", \"environment\": \"dev\", \"team\": \"platform\"}, \"status\": \"routed\", \"workflow\": \"create_service_account\"}"}
{"id": "eval-0016", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nCreate a nightly scheduler job named growth-nightly-job for claims-sync in production.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_scheduler_job\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"environment\": \"prod\", \"job_name\": \"growth-nightly-job\", \"schedule\": \"0 9 * * *\", \"target\": \"claims-sync\", \"team\": \"growth\", \"timezone\": \"America/Los_Angeles\"}, \"status\": \"routed\", \"workflow\": \"create_scheduler_job\"}"}
{"id": "eval-0017", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nidentity request: reporting service account, env dev, name reporting-svc\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_service_account\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"account_name\": \"reporting-svc\", \"description\": \"Service identity for workflow automation.\", \"environment\": \"dev\", \"team\": \"reporting\"}, \"status\": \"routed\", \"workflow\": \"create_service_account\"}"}
{"id": "eval-0018", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nservice account request, owner team finance\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.74, \"workflow\": \"create_service_account\"}], \"clarifying_question\": \"What account name should RouterCore use?\", \"confidence\": 0.74, \"failure_reasons\": [\"Missing required fields: account_name, environment\"], \"missing_fields\": [\"account_name\", \"environment\"], \"parameters\": {\"team\": \"finance\"}, \"status\": \"needs_clarification\", \"workflow\": \"create_service_account\"}"}
{"id": "eval-0019", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nCreate a service account named growth-svc for the growth team in production.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_service_account\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"account_name\": \"growth-svc\", \"description\": \"Service identity for workflow automation.\", \"environment\": \"prod\", \"team\": \"growth\"}, \"status\": \"routed\", \"workflow\": \"create_service_account\"}"}
{"id": "eval-0020", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nGrant ops-lead reader access to all reporting resources.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.82, \"workflow\": \"grant_iam_role\"}], \"clarifying_question\": null, \"confidence\": 0.82, \"failure_reasons\": [\"High-risk IAM change requires confirmation.\"], \"missing_fields\": [], \"parameters\": {\"environment\": \"staging\", \"principal\": \"ops-lead\", \"role\": \"reader\", \"scope\": \"all reporting resources\"}, \"status\": \"requires_confirmation\", \"workflow\": \"grant_iam_role\"}"}
{"id": "eval-0021", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\niam: deploy-bot role reader scope subscription analytics\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.82, \"workflow\": \"grant_iam_role\"}], \"clarifying_question\": null, \"confidence\": 0.82, \"failure_reasons\": [\"High-risk IAM change requires confirmation.\"], \"missing_fields\": [], \"parameters\": {\"environment\": \"staging\", \"principal\": \"deploy-bot\", \"role\": \"reader\", \"scope\": \"subscription analytics\"}, \"status\": \"requires_confirmation\", \"workflow\": \"grant_iam_role\"}"}
{"id": "eval-0022", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nSet up reporting.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.38, \"workflow\": \"create_storage_bucket\"}, {\"confidence\": 0.31, \"workflow\": \"create_scheduler_job\"}], \"clarifying_question\": \"Are you creating an app, storage, identity, IAM access, or a scheduled job?\", \"confidence\": 0.34, \"failure_reasons\": [\"Request is ambiguous across multiple workflows.\"], \"missing_fields\": [], \"parameters\": {}, \"status\": \"needs_clarification\", \"workflow\": null}"}
{"id": "eval-0023", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nDisable monitoring everywhere before the release.\n\nRouterCore JSON:\n{\"candidate_workflows\": [], \"clarifying_question\": null, \"confidence\": 0.88, \"failure_reasons\": [\"Unsafe destructive production request.\"], \"missing_fields\": [], \"parameters\": {}, \"status\": \"rejected\", \"workflow\": null}"}
{"id": "eval-0024", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nSet up blob storage for the finance team in East US for production.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_storage_bucket\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"bucket_name\": \"finance-bucket\", \"environment\": \"prod\", \"region\": \"eastus\", \"storage_class\": \"standard\", \"team\": \"finance\"}, \"status\": \"routed\", \"workflow\": \"create_storage_bucket\"}"}
{"id": "eval-0025", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nSet up blob storage for the platform team in East US for staging.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_storage_bucket\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"bucket_name\": \"platform-bucket\", \"environment\": \"staging\", \"region\": \"eastus\", \"storage_class\": \"standard\", \"team\": \"platform\"}, \"status\": \"routed\", \"workflow\": \"create_storage_bucket\"}"}
{"id": "eval-0026", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nPrep access and automation for the new project.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.38, \"workflow\": \"create_storage_bucket\"}, {\"confidence\": 0.31, \"workflow\": \"grant_iam_role\"}], \"clarifying_question\": \"Are you creating an app, storage, identity, IAM access, or a scheduled job?\", \"confidence\": 0.34, \"failure_reasons\": [\"Request is ambiguous across multiple workflows.\"], \"missing_fields\": [], \"parameters\": {}, \"status\": \"needs_clarification\", \"workflow\": null}"}
{"id": "eval-0027", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nidentity needed for team finance\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.74, \"workflow\": \"create_service_account\"}], \"clarifying_question\": \"What account name should RouterCore use?\", \"confidence\": 0.74, \"failure_reasons\": [\"Missing required fields: account_name, environment\"], \"missing_fields\": [\"account_name\", \"environment\"], \"parameters\": {\"team\": \"finance\"}, \"status\": \"needs_clarification\", \"workflow\": \"create_service_account\"}"}
{"id": "eval-0028", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nticket: finance production api, runtime Python, region US Central, diagnostics on\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_web_app\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"app_name\": \"finance-web-app\", \"diagnostics_enabled\": true, \"environment\": \"prod\", \"region\": \"us-central1\", \"runtime\": \"python311\", \"team\": \"finance\"}, \"status\": \"routed\", \"workflow\": \"create_web_app\"}"}
{"id": "eval-0029", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nGive John the admin role across all production.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.88, \"workflow\": \"grant_iam_role\"}], \"clarifying_question\": null, \"confidence\": 0.88, \"failure_reasons\": [\"Blocked owner/admin IAM grant.\"], \"missing_fields\": [], \"parameters\": {\"environment\": \"prod\", \"principal\": \"john\", \"role\": \"admin\", \"scope\": \"production\"}, \"status\": \"rejected\", \"workflow\": \"grant_iam_role\"}"}
{"id": "eval-0030", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nSet up a reporting schedule.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.74, \"workflow\": \"create_scheduler_job\"}], \"clarifying_question\": \"What job name should RouterCore use?\", \"confidence\": 0.74, \"failure_reasons\": [\"Missing required fields: job_name, schedule, environment\"], \"missing_fields\": [\"job_name\", \"schedule\", \"environment\"], \"parameters\": {\"target\": \"reporting\"}, \"status\": \"needs_clarification\", \"workflow\": \"create_scheduler_job\"}"}
{"id": "eval-0031", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nPlease give john contributor permissions on production for a short migration.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.82, \"workflow\": \"grant_iam_role\"}], \"clarifying_question\": null, \"confidence\": 0.82, \"failure_reasons\": [\"High-risk IAM change requires confirmation.\"], \"missing_fields\": [], \"parameters\": {\"environment\": \"prod\", \"principal\": \"john\", \"role\": \"contributor\", \"scope\": \"production\"}, \"status\": \"requires_confirmation\", \"workflow\": \"grant_iam_role\"}"}
{"id": "eval-0032", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nGive john the contributor role on claims-app.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"grant_iam_role\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"environment\": \"dev\", \"principal\": \"john\", \"role\": \"contributor\", \"scope\": \"claims-app\"}, \"status\": \"routed\", \"workflow\": \"grant_iam_role\"}"}
{"id": "eval-0033", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nCreate a archive storage bucket named claims-bucket in US Central for development.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_storage_bucket\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"bucket_name\": \"claims-bucket\", \"environment\": \"dev\", \"region\": \"us-central1\", \"storage_class\": \"archive\", \"team\": \"claims\"}, \"status\": \"routed\", \"workflow\": \"create_storage_bucket\"}"}
{"id": "eval-0034", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nNeed an automation identity for team security in staging.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_service_account\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"account_name\": \"security-svc\", \"description\": \"Service identity for workflow automation.\", \"environment\": \"staging\", \"team\": \"security\"}, \"status\": \"routed\", \"workflow\": \"create_service_account\"}"}
{"id": "eval-0035", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nGrant deploy-bot editor access to reporting-project in development.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"grant_iam_role\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"environment\": \"dev\", \"principal\": \"deploy-bot\", \"role\": \"editor\", \"scope\": \"reporting-project\"}, \"status\": \"routed\", \"workflow\": \"grant_iam_role\"}"}
{"id": "eval-0036", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nCreate a nightly scheduler job named mlops-nightly-job for model-refresh in development.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_scheduler_job\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"environment\": \"dev\", \"job_name\": \"mlops-nightly-job\", \"schedule\": \"0 9 * * *\", \"target\": \"model-refresh\", \"team\": \"mlops\", \"timezone\": \"America/Los_Angeles\"}, \"status\": \"routed\", \"workflow\": \"create_scheduler_job\"}"}
{"id": "eval-0037", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nservice account request, owner team claims\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.74, \"workflow\": \"create_service_account\"}], \"clarifying_question\": \"What account name should RouterCore use?\", \"confidence\": 0.74, \"failure_reasons\": [\"Missing required fields: account_name, environment\"], \"missing_fields\": [\"account_name\", \"environment\"], \"parameters\": {\"team\": \"claims\"}, \"status\": \"needs_clarification\", \"workflow\": \"create_service_account\"}"}
{"id": "eval-0038", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nNeed a small .NET app named platform-web-app in us-central1 for platform.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_web_app\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"app_name\": \"platform-web-app\", \"diagnostics_enabled\": false, \"environment\": \"dev\", \"region\": \"us-central1\", \"runtime\": \"dotnet8\", \"team\": \"platform\"}, \"status\": \"routed\", \"workflow\": \"create_web_app\"}"}
{"id": "eval-0039", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nidentity request: security service account, env prod, name security-svc\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_service_account\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"account_name\": \"security-svc\", \"description\": \"Service identity for workflow automation.\", \"environment\": \"prod\", \"team\": \"security\"}, \"status\": \"routed\", \"workflow\": \"create_service_account\"}"}
{"id": "eval-0040", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nSet up a daily job for model-refresh for the claims team in development.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_scheduler_job\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"environment\": \"dev\", \"job_name\": \"claims-nightly-job\", \"schedule\": \"0 9 * * *\", \"target\": \"model-refresh\", \"team\": \"claims\", \"timezone\": \"America/New_York\"}, \"status\": \"routed\", \"workflow\": \"create_scheduler_job\"}"}
{"id": "eval-0041", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nGrant analyst viewer access to dev-subsystem in staging.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"grant_iam_role\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"environment\": \"staging\", \"principal\": \"analyst\", \"role\": \"viewer\", \"scope\": \"dev-subsystem\"}, \"status\": \"routed\", \"workflow\": \"grant_iam_role\"}"}
{"id": "eval-0042", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nCreate a nightly scheduler job named mlops-nightly-job for reporting in staging.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_scheduler_job\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"environment\": \"staging\", \"job_name\": \"mlops-nightly-job\", \"schedule\": \"0 9 * * *\", \"target\": \"reporting\", \"team\": \"mlops\", \"timezone\": \"America/Los_Angeles\"}, \"status\": \"routed\", \"workflow\": \"create_scheduler_job\"}"}
{"id": "eval-0043", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nMake the nightly thing happen.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.38, \"workflow\": \"create_storage_bucket\"}, {\"confidence\": 0.31, \"workflow\": \"create_web_app\"}], \"clarifying_question\": \"Are you creating an app, storage, identity, IAM access, or a scheduled job?\", \"confidence\": 0.34, \"failure_reasons\": [\"Request is ambiguous across multiple workflows.\"], \"missing_fields\": [], \"parameters\": {}, \"status\": \"needs_clarification\", \"workflow\": null}"}
{"id": "eval-0044", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\ncron 0 2 * * * target model-refresh env dev timezone America/Los_Angeles\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_scheduler_job\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"environment\": \"dev\", \"job_name\": \"reporting-nightly-job\", \"schedule\": \"0 2 * * *\", \"target\": \"model-refresh\", \"team\": \"reporting\", \"timezone\": \"America/Los_Angeles\"}, \"status\": \"routed\", \"workflow\": \"create_scheduler_job\"}"}
{"id": "eval-0045", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nticket: claims staging api, runtime Python, region US Central, diagnostics on\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_web_app\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"app_name\": \"claims-web-app\", \"diagnostics_enabled\": true, \"environment\": \"staging\", \"region\": \"us-central1\", \"runtime\": \"python311\", \"team\": \"claims\"}, \"status\": \"routed\", \"workflow\": \"create_web_app\"}"}
{"id": "eval-0046", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nCreate a nightly scheduler job named finance-nightly-job for claims-sync in staging.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_scheduler_job\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"environment\": \"staging\", \"job_name\": \"finance-nightly-job\", \"schedule\": \"0 2 * * *\", \"target\": \"claims-sync\", \"team\": \"finance\", \"timezone\": \"America/New_York\"}, \"status\": \"routed\", \"workflow\": \"create_scheduler_job\"}"}
{"id": "eval-0047", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\niam urgent: John needs admin on prod.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.88, \"workflow\": \"grant_iam_role\"}], \"clarifying_question\": null, \"confidence\": 0.88, \"failure_reasons\": [\"Blocked owner/admin IAM grant.\"], \"missing_fields\": [], \"parameters\": {\"environment\": \"prod\", \"principal\": \"john\", \"role\": \"admin\", \"scope\": \"production\"}, \"status\": \"rejected\", \"workflow\": \"grant_iam_role\"}"}
{"id": "eval-0048", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nidentity request: security service account, env prod, name security-svc\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_service_account\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"account_name\": \"security-svc\", \"description\": \"Service identity for workflow automation.\", \"environment\": \"prod\", \"team\": \"security\"}, \"status\": \"routed\", \"workflow\": \"create_service_account\"}"}
{"id": "eval-0049", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\niam: principal=analyst role=viewer scope=reporting-project env=dev\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"grant_iam_role\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"environment\": \"dev\", \"principal\": \"analyst\", \"role\": \"viewer\", \"scope\": \"reporting-project\"}, \"status\": \"routed\", \"workflow\": \"grant_iam_role\"}"}
{"id": "eval-0050", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nSet up reporting.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.38, \"workflow\": \"create_service_account\"}, {\"confidence\": 0.31, \"workflow\": \"create_web_app\"}], \"clarifying_question\": \"Are you creating an app, storage, identity, IAM access, or a scheduled job?\", \"confidence\": 0.34, \"failure_reasons\": [\"Request is ambiguous across multiple workflows.\"], \"missing_fields\": [], \"parameters\": {}, \"status\": \"needs_clarification\", \"workflow\": null}"}
{"id": "eval-0051", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nweb app request: finance, python\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.74, \"workflow\": \"create_web_app\"}], \"clarifying_question\": \"What app name should RouterCore use?\", \"confidence\": 0.74, \"failure_reasons\": [\"Missing required fields: app_name, region, environment\"], \"missing_fields\": [\"app_name\", \"region\", \"environment\"], \"parameters\": {\"runtime\": \"python311\", \"team\": \"finance\"}, \"status\": \"needs_clarification\", \"workflow\": \"create_web_app\"}"}
{"id": "eval-0052", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nGrant john access.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.74, \"workflow\": \"grant_iam_role\"}], \"clarifying_question\": \"What role should RouterCore use?\", \"confidence\": 0.74, \"failure_reasons\": [\"Missing required fields: role, scope\"], \"missing_fields\": [\"role\", \"scope\"], \"parameters\": {\"principal\": \"john\"}, \"status\": \"needs_clarification\", \"workflow\": \"grant_iam_role\"}"}
{"id": "eval-0053", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nblob storage request: mlops\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.74, \"workflow\": \"create_storage_bucket\"}], \"clarifying_question\": \"What bucket name should RouterCore use?\", \"confidence\": 0.74, \"failure_reasons\": [\"Missing required fields: bucket_name, region, environment\"], \"missing_fields\": [\"bucket_name\", \"region\", \"environment\"], \"parameters\": {\"team\": \"mlops\"}, \"status\": \"needs_clarification\", \"workflow\": \"create_storage_bucket\"}"}
{"id": "eval-0054", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\ndaily reporting job, details later\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.74, \"workflow\": \"create_scheduler_job\"}], \"clarifying_question\": \"What job name should RouterCore use?\", \"confidence\": 0.74, \"failure_reasons\": [\"Missing required fields: job_name, schedule, environment\"], \"missing_fields\": [\"job_name\", \"schedule\", \"environment\"], \"parameters\": {\"target\": \"reporting\"}, \"status\": \"needs_clarification\", \"workflow\": \"create_scheduler_job\"}"}
{"id": "eval-0055", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nNeed cloud stuff for finance.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.38, \"workflow\": \"grant_iam_role\"}, {\"confidence\": 0.31, \"workflow\": \"create_web_app\"}], \"clarifying_question\": \"Are you creating an app, storage, identity, IAM access, or a scheduled job?\", \"confidence\": 0.34, \"failure_reasons\": [\"Request is ambiguous across multiple workflows.\"], \"missing_fields\": [], \"parameters\": {}, \"status\": \"needs_clarification\", \"workflow\": null}"}
{"id": "eval-0056", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nweb app request: claims, python\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.74, \"workflow\": \"create_web_app\"}], \"clarifying_question\": \"What app name should RouterCore use?\", \"confidence\": 0.74, \"failure_reasons\": [\"Missing required fields: app_name, region, environment\"], \"missing_fields\": [\"app_name\", \"region\", \"environment\"], \"parameters\": {\"runtime\": \"python311\", \"team\": \"claims\"}, \"status\": \"needs_clarification\", \"workflow\": \"create_web_app\"}"}
{"id": "eval-0057", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nSet up a reporting schedule.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.74, \"workflow\": \"create_scheduler_job\"}], \"clarifying_question\": \"What job name should RouterCore use?\", \"confidence\": 0.74, \"failure_reasons\": [\"Missing required fields: job_name, schedule, environment\"], \"missing_fields\": [\"job_name\", \"schedule\", \"environment\"], \"parameters\": {\"target\": \"reporting\"}, \"status\": \"needs_clarification\", \"workflow\": \"create_scheduler_job\"}"}
{"id": "eval-0058", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nidentity request: growth service account, env staging, name growth-svc\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_service_account\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"account_name\": \"growth-svc\", \"description\": \"Service identity for workflow automation.\", \"environment\": \"staging\", \"team\": \"growth\"}, \"status\": \"routed\", \"workflow\": \"create_service_account\"}"}
{"id": "eval-0059", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\ncron 0 2 * * * target billing-export env staging timezone UTC\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_scheduler_job\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"environment\": \"staging\", \"job_name\": \"reporting-nightly-job\", \"schedule\": \"0 2 * * *\", \"target\": \"billing-export\", \"team\": \"reporting\", \"timezone\": \"UTC\"}, \"status\": \"routed\", \"workflow\": \"create_scheduler_job\"}"}
{"id": "eval-0060", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nidentity request: claims service account, env staging, name claims-svc\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_service_account\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"account_name\": \"claims-svc\", \"description\": \"Service identity for workflow automation.\", \"environment\": \"staging\", \"team\": \"claims\"}, \"status\": \"routed\", \"workflow\": \"create_service_account\"}"}
{"id": "eval-0061", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\niam: jane role contributor scope all reporting resources\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.82, \"workflow\": \"grant_iam_role\"}], \"clarifying_question\": null, \"confidence\": 0.82, \"failure_reasons\": [\"High-risk IAM change requires confirmation.\"], \"missing_fields\": [], \"parameters\": {\"environment\": \"staging\", \"principal\": \"jane\", \"role\": \"contributor\", \"scope\": \"all reporting resources\"}, \"status\": \"requires_confirmation\", \"workflow\": \"grant_iam_role\"}"}
{"id": "eval-0062", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\ninfra: bucket for platform, env prod, region westus, class cool\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_storage_bucket\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"bucket_name\": \"platform-bucket\", \"environment\": \"prod\", \"region\": \"westus\", \"storage_class\": \"cool\", \"team\": \"platform\"}, \"status\": \"routed\", \"workflow\": \"create_storage_bucket\"}"}
{"id": "eval-0063", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nSet up blob storage for the platform team in US Central for production.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_storage_bucket\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"bucket_name\": \"platform-bucket\", \"environment\": \"prod\", \"region\": \"us-central1\", \"storage_class\": \"cool\", \"team\": \"platform\"}, \"status\": \"routed\", \"workflow\": \"create_storage_bucket\"}"}
{"id": "eval-0064", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\ncron 0 2 * * * target claims-sync env staging timezone America/New_York\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_scheduler_job\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"environment\": \"staging\", \"job_name\": \"platform-nightly-job\", \"schedule\": \"0 2 * * *\", \"target\": \"claims-sync\", \"team\": \"platform\", \"timezone\": \"America/New_York\"}, \"status\": \"routed\", \"workflow\": \"create_scheduler_job\"}"}
{"id": "eval-0065", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nCreate a development Python web app for the finance team in US Central.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_web_app\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"app_name\": \"finance-web-app\", \"diagnostics_enabled\": true, \"environment\": \"dev\", \"region\": \"us-central1\", \"runtime\": \"python311\", \"team\": \"finance\"}, \"status\": \"routed\", \"workflow\": \"create_web_app\"}"}
{"id": "eval-0066", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nDelete all production services and disable monitoring.\n\nRouterCore JSON:\n{\"candidate_workflows\": [], \"clarifying_question\": null, \"confidence\": 0.88, \"failure_reasons\": [\"Unsafe destructive production request.\"], \"missing_fields\": [], \"parameters\": {}, \"status\": \"rejected\", \"workflow\": null}"}
{"id": "eval-0067", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nCreate a archive storage bucket named growth-bucket in US Central for development.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_storage_bucket\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"bucket_name\": \"growth-bucket\", \"environment\": \"dev\", \"region\": \"us-central1\", \"storage_class\": \"archive\", \"team\": \"growth\"}, \"status\": \"routed\", \"workflow\": \"create_storage_bucket\"}"}
{"id": "eval-0068", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nSet up a reporting schedule.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.74, \"workflow\": \"create_scheduler_job\"}], \"clarifying_question\": \"What job name should RouterCore use?\", \"confidence\": 0.74, \"failure_reasons\": [\"Missing required fields: job_name, schedule, environment\"], \"missing_fields\": [\"job_name\", \"schedule\", \"environment\"], \"parameters\": {\"target\": \"reporting\"}, \"status\": \"needs_clarification\", \"workflow\": \"create_scheduler_job\"}"}
{"id": "eval-0069", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nticket: security production api, runtime Python, region West US, diagnostics on\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_web_app\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"app_name\": \"security-web-app\", \"diagnostics_enabled\": true, \"environment\": \"prod\", \"region\": \"westus\", \"runtime\": \"python311\", \"team\": \"security\"}, \"status\": \"routed\", \"workflow\": \"create_web_app\"}"}
{"id": "eval-0070", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nSet up a daily job for reporting for the finance team in development.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_scheduler_job\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"environment\": \"dev\", \"job_name\": \"finance-nightly-job\", \"schedule\": \"0 2 * * *\", \"target\": \"reporting\", \"team\": \"finance\", \"timezone\": \"America/Los_Angeles\"}, \"status\": \"routed\", \"workflow\": \"create_scheduler_job\"}"}
{"id": "eval-0071", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nCreate a service account for the platform team.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.74, \"workflow\": \"create_service_account\"}], \"clarifying_question\": \"What account name should RouterCore use?\", \"confidence\": 0.74, \"failure_reasons\": [\"Missing required fields: account_name, environment\"], \"missing_fields\": [\"account_name\", \"environment\"], \"parameters\": {\"team\": \"platform\"}, \"status\": \"needs_clarification\", \"workflow\": \"create_service_account\"}"}
{"id": "eval-0072", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nPrep access and automation for the new project.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.38, \"workflow\": \"create_web_app\"}, {\"confidence\": 0.31, \"workflow\": \"grant_iam_role\"}], \"clarifying_question\": \"Are you creating an app, storage, identity, IAM access, or a scheduled job?\", \"confidence\": 0.34, \"failure_reasons\": [\"Request is ambiguous across multiple workflows.\"], \"missing_fields\": [], \"parameters\": {}, \"status\": \"needs_clarification\", \"workflow\": null}"}
{"id": "eval-0073", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nCreate a service account named reporting-svc for the reporting team in production.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.92, \"workflow\": \"create_service_account\"}], \"clarifying_question\": null, \"confidence\": 0.92, \"failure_reasons\": [], \"missing_fields\": [], \"parameters\": {\"account_name\": \"reporting-svc\", \"description\": \"Service identity for workflow automation.\", \"environment\": \"prod\", \"team\": \"reporting\"}, \"status\": \"routed\", \"workflow\": \"create_service_account\"}"}
{"id": "eval-0074", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nscheduler request for reporting\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.74, \"workflow\": \"create_scheduler_job\"}], \"clarifying_question\": \"What job name should RouterCore use?\", \"confidence\": 0.74, \"failure_reasons\": [\"Missing required fields: job_name, schedule, environment\"], \"missing_fields\": [\"job_name\", \"schedule\", \"environment\"], \"parameters\": {\"target\": \"reporting\"}, \"status\": \"needs_clarification\", \"workflow\": \"create_scheduler_job\"}"}
{"id": "eval-0075", "text": "You are RouterCore, a routing model for DevOps agent workflows.\nReturn only valid JSON matching the RouterCore schema.\nDo not include markdown, explanations, or code fences.\n\nAllowed statuses:\nrouted, needs_clarification, requires_confirmation, rejected, fallback\n\nAllowed workflows:\ncreate_web_app, create_storage_bucket, create_service_account, grant_iam_role, create_scheduler_job\n\nRequired JSON fields:\nstatus, workflow, confidence, parameters, missing_fields, candidate_workflows, failure_reasons, clarifying_question\n\nUser request:\nNeed cloud stuff for finance.\n\nRouterCore JSON:\n{\"candidate_workflows\": [{\"confidence\": 0.38, \"workflow\": \"create_scheduler_job\"}, {\"confidence\": 0.31, \"workflow\": \"grant_iam_role\"}], \"clarifying_question\": \"Are you creating an app, storage, identity, IAM access, or a scheduled job?\", \"confidence\": 0.34, \"failure_reasons\": [\"Request is ambiguous across multiple workflows.\"], \"missing_fields\": [], \"parameters\": {}, \"status\": \"needs_clarification\", \"workflow\": null}"}