IlyaGusev
/

vikhr_nemo_orpo_dostoevsky_12b

Model card Files Files and versions Community

Edit model card

Vikhr-Nemo fine-tuned with contrastive Russian literature.

Base model: https://huggingface.co/Vikhrmodels/Vikhr-Nemo-12B-Instruct-R-21-09-24
Dataset: https://huggingface.co/datasets/40umov/dostoevsky
Method: ORPO
Training config: https://github.com/IlyaGusev/saiga/blob/main/configs/models/doestoevsky_nemo_12b_orpo_m1.json
WandB: https://wandb.ai/ilyagusev/rulm_self_instruct/runs/4v4pcgej

Downloads last month: 22

Safetensors

Model size

12.2B params

Tensor type

BF16

·

Inference API

Unable to determine this model’s pipeline type. Check the docs .

Model tree for IlyaGusev/vikhr_nemo_orpo_dostoevsky_12b

Base model

mistralai/Mistral-Nemo-Base-2407

Finetuned

mistralai/Mistral-Nemo-Instruct-2407

Finetuned

Vikhrmodels/Vikhr-Nemo-12B-Instruct-R-21-09-24

Finetuned

(1)

this model

Merges

1 model

Dataset used to train IlyaGusev/vikhr_nemo_orpo_dostoevsky_12b