valerielucro
/

mistral_gsm8k_preference_dataset_v2_beta_3

Inference Endpoints

Model card Files Files and versions Community

mistral_gsm8k_preference_dataset_v2_beta_3

1 contributor

History: 2 commits

valerielucro's picture

second iteration Qlora with DPO on full gsm8k preference dataset version 2.1 and 1 epoch and rank 64, beta 0.3

28fd9e7 verified 5 months ago

.gitattributes

1.52 kB

initial commit 5 months ago
README.md

5.18 kB

second iteration Qlora with DPO on full gsm8k preference dataset version 2.1 and 1 epoch and rank 64, beta 0.3 5 months ago
adapter_config.json

753 Bytes

second iteration Qlora with DPO on full gsm8k preference dataset version 2.1 and 1 epoch and rank 64, beta 0.3 5 months ago
adapter_model.safetensors

671 MB
LFS

second iteration Qlora with DPO on full gsm8k preference dataset version 2.1 and 1 epoch and rank 64, beta 0.3 5 months ago