Uploaded model
- Developed by: darvec
- License: apache-2.0
- Finetuned from model : unsloth/llama-3-8b-Instruct
This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.
Model tree for darvec/rl-epoch1
Base model
unsloth/llama-3-8b-Instruct