auryn_dpo_orpo_english

This is a ORPO fine-tune of meta-llama/Llama-3.2-1b trained on three epochs of https://huggingface.co/datasets/celsowm/auryn_dpo_orpo_english

Auryn is a fictional place intended to serve as a proof of concept for injecting knowledge into a large language model using ORPO.

Tutorial here: https://medium.com/@celsoaf/injecting-new-knowledge-into-an-llm-via-fine-tuning-with-orpo-017d3bfdb11b

Downloads last month: 4

Safetensors

Model size

1.24B params

Tensor type

BF16

Inference Providers NEW

This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.

Model tree for celsowm/auryn_dpo_orpo_english

Base model

meta-llama/Llama-3.2-1B

Finetuned

(233)

this model

celsowm
/

auryn_dpo_orpo_english

auryn_dpo_orpo_english

Model tree for celsowm/auryn_dpo_orpo_english

Dataset used to train celsowm/auryn_dpo_orpo_english