valerielucro
/

mistral_gsm8k_sft_and_dpo

Inference Endpoints

Model card Files Files and versions Community

mistral_gsm8k_sft_and_dpo

1 contributor

History: 3 commits

valerielucro's picture

initial adapter with SFT+DPO on sample gsm8k preference dataset and 1 epoch

00ed487 verified 5 months ago

.gitattributes

1.52 kB

initial commit 5 months ago
README.md

5.19 kB

initial adapter with SFT+DPO on sample gsm8k preference dataset and 1 epoch 5 months ago
adapter_config.json

753 Bytes

initial adapter with SFT+DPO on sample gsm8k preference dataset and 1 epoch 5 months ago
adapter_model.safetensors

168 MB
LFS

initial adapter with SFT+DPO on sample gsm8k preference dataset and 1 epoch 5 months ago
generation_config.json

111 Bytes

initial adapter with SFT+DPO on sample gsm8k preference dataset and 1 epoch 5 months ago
special_tokens_map.json

437 Bytes

initial adapter with SFT+DPO on sample gsm8k preference dataset and 1 epoch 5 months ago
tokenizer.json

1.8 MB

initial adapter with SFT+DPO on sample gsm8k preference dataset and 1 epoch 5 months ago
tokenizer.model

493 kB
LFS

initial adapter with SFT+DPO on sample gsm8k preference dataset and 1 epoch 5 months ago
tokenizer_config.json

1.47 kB

initial adapter with SFT+DPO on sample gsm8k preference dataset and 1 epoch 5 months ago