ConflictEnv Final Reasoning Model

This is the final fine-tuned model for the ConflictEnv executive assistant task. It has been trained using GRPO to handle complex scheduling conflicts with a focus on reasoning-first behavior.

Usage

Start prompts with Scenario: ... Details: ... and expect a <thought> block followed by a JSON action.

Downloads last month
236
Safetensors
Model size
2B params
Tensor type
F16
·
Inference Providers NEW
Input a message to start chatting with purvansh01/conflict-env-final.

Space using purvansh01/conflict-env-final 1