Edit model card

Model Card for free-evo-qwen72b-v0.8

Developed by : Freewheelin AI Technical Team

2024 4th May - avg. 81.28 Open Llm Leaderboard

Metric Value
Avg. 81.28
ARC (25-Shot) 79.86
HellaSwag (10-Shot) 91.32
MMLU (5-Shot) 78.00
TruthfulQA (0-shot) 74.85
Winogrande (5-shot) 87.77
GSM8k (5-shot) 75.89

Method

Process

You need two models with the same architecture.

  • Choose one model and fine-tune it to create a gap between the original model and the fine-tuned one. It doesn't matter whether the evaluation score is higher or lower.
  • Merge the two models.
  • Evaluate the merged model.
  • Fine-tune a specific evaluation part of the model if you need to increase the score for that part. (It's unlikely to work as you think, but you can try it.)
  • Merge the models again.
  • Evaluate again.
  • Keep going until the average evaluation score is higher than the original one.

That's it. Simple. You can create a framework to automate this process.

Base Architecture

  • QWEN2

Base Models

  • several QWEN2 based models
Downloads last month
4,851
Safetensors
Model size
72.3B params
Tensor type
FP16
·

Evaluation results