bhavinjawade
commited on
Commit
•
d22aa6e
1
Parent(s):
6a27cf0
Update README.md
Browse files
README.md
CHANGED
@@ -10,6 +10,8 @@ datasets:
|
|
10 |
This model card is instruction finetuned version of `upstage/SOLAR-10.7B-Instruct-v1.0` model. Trained on the Intel DPO Orca dataset using LoRA. Though it should be noted SOLAR-10.7B paper states that the
|
11 |
original model for alignment was trained on Intel ORCA DPO pairs. Retraining using DPO and LoRA shows slight (<1%) improvement on OpenLLM Leaderboard benchmarks against `SOLAR 10.7B-Instruct` and significant over `SOLAR 10.7B`
|
12 |
|
|
|
|
|
13 |
## How to Use This Model
|
14 |
|
15 |
To use the model `bhavinjawade/SOLAR-10B-OrcaDPO-Jawade`, follow these steps:
|
|
|
10 |
This model card is instruction finetuned version of `upstage/SOLAR-10.7B-Instruct-v1.0` model. Trained on the Intel DPO Orca dataset using LoRA. Though it should be noted SOLAR-10.7B paper states that the
|
11 |
original model for alignment was trained on Intel ORCA DPO pairs. Retraining using DPO and LoRA shows slight (<1%) improvement on OpenLLM Leaderboard benchmarks against `SOLAR 10.7B-Instruct` and significant over `SOLAR 10.7B`
|
12 |
|
13 |
+
![model_card_image](SOLAR_ORCA.png)
|
14 |
+
|
15 |
## How to Use This Model
|
16 |
|
17 |
To use the model `bhavinjawade/SOLAR-10B-OrcaDPO-Jawade`, follow these steps:
|