mlabonne's picture
Create README.md
a2bbf22 verified
|
raw
history blame
No virus
518 Bytes
metadata
license: other
datasets:
  - mlabonne/orpo-dpo-mix-40k
tags:
  - dpo

Daredevil-8B

image/jpeg

This is a DPO fine-tune of Daredevil-8-abliterated trained on one epoch of orpo-dpo-mix-40k.

πŸ† Evaluation

Open LLM Leaderboard

TBD.

Nous

TBD.

🌳 Model family tree

image/png