You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Project | Code | Collection

OpenThinkerAgent-32B-SFT-100K

OpenThoughts-Agent is an open-source effort to curate the best datasets for training agents. Our release includes datasets, models and our research codebase.

OpenThinkerAgent-32B-SFT-100K is post-trained from Qwen/Qwen3-32B with full-parameter SFT on the 100,000-example OpenThoughts-Agent-SFT-100K dataset (Top-4 task sources, GLM-4.7-AWQ teacher in the terminus-2 harness, ≥5-turn trace filter).

Performance

Evaluated in the terminus-2 harness (pass@1, mean over 3 stochastic re-runs):

Model Harness SWE-Bench-Verified-100 OpenThoughts-TBLite Terminal-Bench 2.0
Qwen/Qwen3-32B Terminus-2 26.7 13.7 7.5
OpenThinkerAgent-32B-SFT-100K Terminus-2 55.7 41.3 26.2

Data

The model is trained on OpenThoughts-Agent-SFT-100K: (task, agent-trajectory) pairs from the Top-4 task sources (SWE-Smith, StackExchange-SuperUser, StackExchange-Tezos with synthetic augmentation, IssueTasks). Trajectories are generated by GLM-4.7-AWQ in the terminus-2 harness and filtered to traces with at least 5 model turns.

Training hyperparameters

  • learning_rate: 4e-05
  • lr_scheduler_type: cosine, warmup_ratio 0.1
  • global_batch_size: 96
  • num_epochs: 5
  • cutoff_len: 32768
  • precision: bf16, DeepSpeed ZeRO-3

Links

Citation

@misc{openthoughts-agent,
  author = {Team, OpenThoughts-Agent},
  title = {{OpenThoughts-Agent: Data Recipes for Agentic Models}},
  howpublished = {https://www.openthoughts.ai/blog/agent},
  year = {2026}
}
Downloads last month
14
Safetensors
Model size
677k params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Artificial-Production-Units/OpenThinkerAgent-32B-SFT-100K

Base model

Qwen/Qwen3-32B
Finetuned
(507)
this model

Dataset used to train Artificial-Production-Units/OpenThinkerAgent-32B-SFT-100K