MT7Bi-dpo / README.md
satyamt's picture
Update README.md
50e9d61 verified
metadata
license: mit
language:
  - en
base_model: Technoculture/MT7Bi-sft
datasets:
  - Technoculture/MT7Bi-alpha-dpo-v0.2

MT7Bi-dpo

image/png

Technoculture/MT7Bi-sft (base) + Technoculture/MT7Bi-alpha-dpo-v0.2 (adapter)

Open LLM Leaderboard

image/png

Model Name ARC HellaSwag MMLU TruthfulQA Winogrande GSM8K
Orca-2-7b 78.4 76.1 53.7 52.4 74.2 47.2
LLAMA-2-7b 43.2 77.1 44.4 38.7 69.5 16
MT7Bi-sft 54.1 75.11 - 43.08 72.14 15.54
MT7bi-dpo 54.69 75.89 52.82 45.48 71.58 25.93