Edit model card

Llama-3-Smaug-8B

Built with Meta Llama 3

image/png

This model was built using the Smaug recipe for improving performance on real world multi-turn conversations applied to meta-llama/Meta-Llama-3-8B.

Model Description

Evaluation

########## First turn ##########
                   score
model             turn
llama3-8b-smaug-2-merged-600 1   8.79375
llama3-8b-smaug-2-merged-150 1   8.71250
llama3-8b-smaug-2-merged-300 1   8.66250
base_Meta-Llama-3-8B-Instruct 1   8.53125
llama3-8b-smaug-2-merged-450 1   8.42500
########## Second turn ##########
                   score
model             turn
llama3-8b-smaug-2-merged-450 2   7.8125
llama3-8b-smaug-2-merged-300 2   7.7375
llama3-8b-smaug-2-merged-600 2   7.7250
llama3-8b-smaug-2-merged-150 2   7.7125
base_Meta-Llama-3-8B-Instruct 2   7.5500
########## Average ##########
                 score
model
llama3-8b-smaug-2-merged-600  8.259375
llama3-8b-smaug-2-merged-150  8.212500
llama3-8b-smaug-2-merged-300  8.200000
llama3-8b-smaug-2-merged-450  8.118750
base_Meta-Llama-3-8B-Instruct 8.040625
Model First turn Second Turn Average
llama3-8b-smaug-2-merged-600 8.79 7.73 8.26
llama3-8b-smaug-2-merged-450 8.43 7.81 8.12
llama3-8b-smaug-2-merged-300 8.66 7.74 8.20
llama3-8b-smaug-2-merged-150 8.71 7.71 8.21
Meta-Llama-3-8B-Instruct 8.53 7.55 8.04
Downloads last month
2
Inference API
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.