valerielucro
/

mistral_gsm8k_dpo_cot_beta_0.8

Inference Endpoints

Model card Files Files and versions Community

mistral_gsm8k_dpo_cot_beta_0.8

1 contributor

History: 3 commits

valerielucro's picture

adapter trained with DPO on the gsm8k preference dataset with cot and 1 epoch

9c07365 verified 4 months ago

.gitattributes

1.52 kB

initial commit 4 months ago
README.md

5.18 kB

adapter trained with DPO on the gsm8k preference dataset with cot and 1 epoch 4 months ago
adapter_config.json

753 Bytes

adapter trained with DPO on the gsm8k preference dataset with cot and 1 epoch 4 months ago
adapter_model.safetensors

168 MB
LFS

adapter trained with DPO on the gsm8k preference dataset with cot and 1 epoch 4 months ago
special_tokens_map.json

437 Bytes

adapter trained with DPO on the gsm8k preference dataset with cot and 1 epoch 4 months ago
tokenizer.json

1.8 MB

adapter trained with DPO on the gsm8k preference dataset with cot and 1 epoch 4 months ago
tokenizer.model

493 kB
LFS

adapter trained with DPO on the gsm8k preference dataset with cot and 1 epoch 4 months ago
tokenizer_config.json

1.47 kB

adapter trained with DPO on the gsm8k preference dataset with cot and 1 epoch 4 months ago