DiffuCoder-7B-Instruct

The DiffuCoder-7B-Instruct model builds on the DiffuCoder-7B-Base checkpoint with instruction-tuning to better follow code-related prompts.

  • Training recipe: with a newly introduced pad token, we train this model with fixed length conditionally on OpenCoder-SFT data for 5 epochs.

  • Benchmarks: Demonstrates stronger instruction-following capabilities than the Base model.

More details and usage examples:

Acknowledgement

To power this HuggingFace model release, we reuse Dream's modeling architecture and generation utils.

Downloads last month
126
Safetensors
Model size
7.62B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for apple/DiffuCoder-7B-Instruct

Base model

Qwen/Qwen2.5-7B
Finetuned
(1)
this model
Finetunes
1 model