macadeliccc
/

Orpo-GutenLlama-3-8B-v2

Text Generation

text-generation-inference

Model card Files Files and versions Community

Orpo-GutenLlama-3-8B-v2

Training Params

Learning Rate: 8e-6
Batch Size: 1
Eval Batch size: 1
Gradient accumulation steps: 4
Epochs: 3
Training Loss: 0.88

Training time: 4 hours on 1x4090. This is a small 1800 sample fine tune to get comfortable with ORPO fine tuning before scaling up.

Downloads last month: 3

Safetensors

Model size

8.03B params

Tensor type

FP16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for macadeliccc/Orpo-GutenLlama-3-8B-v2

Quantizations

Datasets used to train macadeliccc/Orpo-GutenLlama-3-8B-v2