BrokenKeyboard / README.md
dhanushreddy29's picture
Update README.md
c03dfcd verified
|
raw
history blame
No virus
770 Bytes
---
datasets:
- argilla/distilabel-intel-orca-dpo-pairs
language:
- en
license: cc-by-nc-4.0
base_model:
- upstage/SOLAR-10.7B-Instruct-v1.0
---
# Model Card for Model ID
<!-- Provide a quick summary of what the model is/does. -->
Just testing out LLM Finetuning. Finetuned on [upstage/SOLAR-10.7B-Instruct-v1.0](https://huggingface.co/upstage/SOLAR-10.7B-Instruct-v1.0) using [argilla/distilabel-intel-orca-dpo-pairs](https://huggingface.co/datasets/argilla/distilabel-intel-orca-dpo-pairs).
Followed the Google Colab mentioned in this article: [https://towardsdatascience.com/fine-tune-a-mistral-7b-model-with-direct-preference-optimization-708042745aac](https://towardsdatascience.com/fine-tune-a-mistral-7b-model-with-direct-preference-optimization-708042745aac)