Uploaded model

Developed by: fff1234
License: apache-2.0
Finetuned from model : fff1234/rec_1_shot_test

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month: 10

Inference Examples

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for fff1234/DPO_test

Base model

meta-llama/Meta-Llama-3-8B

Quantized

unsloth/llama-3-8b-bnb-4bit

Finetuned

fff1234/rec_1_shot_test

Finetuned

(1)

this model