bhavinjawade
/

SOLAR-10B-OrcaDPO-Jawade

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

bhavinjawade commited on Jan 14

Commit

d22aa6e

•

1 Parent(s): 6a27cf0

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -10,6 +10,8 @@ datasets:
 This model card is instruction finetuned version of `upstage/SOLAR-10.7B-Instruct-v1.0` model. Trained on the Intel DPO Orca dataset using LoRA. Though it should be noted SOLAR-10.7B paper states that the
 original model for alignment was trained on Intel ORCA DPO pairs. Retraining using DPO and LoRA shows slight (<1%) improvement on OpenLLM Leaderboard benchmarks against `SOLAR 10.7B-Instruct` and significant over `SOLAR 10.7B`
 ## How to Use This Model
 To use the model `bhavinjawade/SOLAR-10B-OrcaDPO-Jawade`, follow these steps:

 This model card is instruction finetuned version of `upstage/SOLAR-10.7B-Instruct-v1.0` model. Trained on the Intel DPO Orca dataset using LoRA. Though it should be noted SOLAR-10.7B paper states that the
 original model for alignment was trained on Intel ORCA DPO pairs. Retraining using DPO and LoRA shows slight (<1%) improvement on OpenLLM Leaderboard benchmarks against `SOLAR 10.7B-Instruct` and significant over `SOLAR 10.7B`
+![model_card_image](SOLAR_ORCA.png)
 ## How to Use This Model
 To use the model `bhavinjawade/SOLAR-10B-OrcaDPO-Jawade`, follow these steps: