sardukar's picture
Update README.md
d31d779 verified
|
raw
history blame contribute delete
No virus
523 Bytes
metadata
library_name: peft
base_model: NousResearch/Meta-Llama-3-8B-Instruct
license: mit
datasets:
  - sardukar/physiology-mcqa-8k
language:
  - en

Model Card for Model ID

This model is a 1 epoch training with ORPO Trainer on the sardukar/physiology-mcqa-8k dataset

Base model is NousResearch/Meta-Llama-3-8B-Instruct

Training results train_results