ahmedabdelwahed
/

Mojiz-DPO-1e-5-4000-steps-beta-1e-1

text2text-generation

Inference Endpoints

Model card Files Files and versions Community

This model was aligned using DPO with a 1e-5 learning rate for 4000 steps

Downloads last month: 14

Safetensors

Model size

582M params

Tensor type

F32

·

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.