Text Generation
Transformers
Safetensors
llama
conversational
Inference Endpoints
text-generation-inference
Llama-3-Smaug-8B / README.md
siddartha-abacus's picture
Update README.md
370e8c4 verified
metadata
library_name: transformers
license: llama2

Llama-3-Smaug-2.1-8B

Built with Meta Llama 3

image/png

This model was built using the Smaug recipe for improving performance on real world multi-turn conversations applied to meta-llama/Meta-Llama-3-8B.

Model Description

Evaluation

########## First turn ##########
                   score
model             turn
llama3-8b-smaug-2-merged-600 1   8.79375
llama3-8b-smaug-2-merged-150 1   8.71250
llama3-8b-smaug-2-merged-300 1   8.66250
base_Meta-Llama-3-8B-Instruct 1   8.53125
llama3-8b-smaug-2-merged-450 1   8.42500
########## Second turn ##########
                   score
model             turn
llama3-8b-smaug-2-merged-450 2   7.8125
llama3-8b-smaug-2-merged-300 2   7.7375
llama3-8b-smaug-2-merged-600 2   7.7250
llama3-8b-smaug-2-merged-150 2   7.7125
base_Meta-Llama-3-8B-Instruct 2   7.5500
########## Average ##########
                 score
model
llama3-8b-smaug-2-merged-600  8.259375
llama3-8b-smaug-2-merged-150  8.212500
llama3-8b-smaug-2-merged-300  8.200000
llama3-8b-smaug-2-merged-450  8.118750
base_Meta-Llama-3-8B-Instruct 8.040625