Harsha-Hermes-2.5-Mistral-7B
Harsha-Hermes-2.5-Mistral-7B is a DPO fine-tune of teknium/OpenHermes-2.5-Mistral-7B using the Intel/orca_dpo_pairs preference dataset and DPO notebook from Maxime Labonne.
- Downloads last month
- 1,468
Harsha-Hermes-2.5-Mistral-7B
Harsha-Hermes-2.5-Mistral-7B is a DPO fine-tune of teknium/OpenHermes-2.5-Mistral-7B using the Intel/orca_dpo_pairs preference dataset and DPO notebook from Maxime Labonne.