AgenticRL-blackbox-260401_231502_cd7fb81c

This repository contains Hugging Face exports for checkpoints from the Miles/Megatron run 260401_231502_cd7fb81c.

Each checkpoint is uploaded under its own subdirectory to avoid overwriting sharded weight filenames.

Checkpoints

  • global_step_9/: exported from iter_0000009
  • global_step_19/: exported from iter_0000019
  • global_step_29/: exported from iter_0000029
  • global_step_39/: exported from iter_0000039
  • global_step_49/: exported from iter_0000049
  • global_step_59/: exported from iter_0000059
  • global_step_69/: exported from iter_0000069
  • global_step_79/: exported from iter_0000079
  • global_step_89/: exported from iter_0000089
  • global_step_99/: exported from iter_0000099
  • global_step_109/: exported from iter_0000109
  • global_step_119/: exported from iter_0000119
  • global_step_129/: exported from iter_0000129
  • global_step_139/: exported from iter_0000139
  • global_step_149/: exported from iter_0000149
  • global_step_159/: exported from iter_0000159
  • global_step_169/: exported from iter_0000169
  • global_step_179/: exported from iter_0000179
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for PGCodeLLM/AgenticRL-blackbox-260401_231502_cd7fb81c

Base model

Qwen/Qwen3-32B
Finetuned
(514)
this model