sardukar's picture
Update README.md
d31d779 verified
|
raw
history blame contribute delete
No virus
523 Bytes
---
library_name: peft
base_model: NousResearch/Meta-Llama-3-8B-Instruct
license: mit
datasets:
- sardukar/physiology-mcqa-8k
language:
- en
---
# Model Card for Model ID
<!-- Provide a quick summary of what the model is/does. -->
This model is a 1 epoch training with ORPO Trainer on the [sardukar/physiology-mcqa-8k](https://huggingface.co/datasets/sardukar/physiology-mcqa-8k) dataset
Base model is NousResearch/Meta-Llama-3-8B-Instruct
**Training results**
![train_results](physiology-8k-rtx3060-train-complete.png)