RichardErkhov/Replete-AI_-_Llama-3-11.5B-Instruct-V2-gguf

Quantization made by Richard Erkhov.

Llama-3-11.5B-Instruct-V2 - GGUF

Model creator: https://huggingface.co/Replete-AI/
Original model: https://huggingface.co/Replete-AI/Llama-3-11.5B-Instruct-V2/

Name	Quant method	Size
Llama-3-11.5B-Instruct-V2.Q2_K.gguf	Q2_K	4.16GB
Llama-3-11.5B-Instruct-V2.IQ3_XS.gguf	IQ3_XS	4.61GB
Llama-3-11.5B-Instruct-V2.IQ3_S.gguf	IQ3_S	4.83GB
Llama-3-11.5B-Instruct-V2.Q3_K_S.gguf	Q3_K_S	4.81GB
Llama-3-11.5B-Instruct-V2.IQ3_M.gguf	IQ3_M	4.98GB
Llama-3-11.5B-Instruct-V2.Q3_K.gguf	Q3_K	5.3GB
Llama-3-11.5B-Instruct-V2.Q3_K_M.gguf	Q3_K_M	5.3GB
Llama-3-11.5B-Instruct-V2.Q3_K_L.gguf	Q3_K_L	5.73GB
Llama-3-11.5B-Instruct-V2.IQ4_XS.gguf	IQ4_XS	5.93GB
Llama-3-11.5B-Instruct-V2.Q4_0.gguf	Q4_0	6.17GB
Llama-3-11.5B-Instruct-V2.IQ4_NL.gguf	IQ4_NL	6.23GB
Llama-3-11.5B-Instruct-V2.Q4_K_S.gguf	Q4_K_S	6.21GB
Llama-3-11.5B-Instruct-V2.Q4_K.gguf	Q4_K	6.53GB
Llama-3-11.5B-Instruct-V2.Q4_K_M.gguf	Q4_K_M	6.53GB
Llama-3-11.5B-Instruct-V2.Q4_1.gguf	Q4_1	6.81GB
Llama-3-11.5B-Instruct-V2.Q5_0.gguf	Q5_0	7.45GB
Llama-3-11.5B-Instruct-V2.Q5_K_S.gguf	Q5_K_S	7.45GB
Llama-3-11.5B-Instruct-V2.Q5_K.gguf	Q5_K	7.64GB
Llama-3-11.5B-Instruct-V2.Q5_K_M.gguf	Q5_K_M	7.64GB
Llama-3-11.5B-Instruct-V2.Q5_1.gguf	Q5_1	8.09GB
Llama-3-11.5B-Instruct-V2.Q6_K.gguf	Q6_K	8.81GB
Llama-3-11.5B-Instruct-V2.Q8_0.gguf	Q8_0	11.41GB

Original model description:

license: other license_name: llama-3 license_link: https://llama.meta.com/llama3/license/

Llama-3-11.5B-Instruct-v2

Thank you to Meta for the weights for Meta-Llama-3-8B-Instruct

This is an upscaling of the Meta-Llama-3-8B-Instruct Ai using techniques created for chargoddard/mistral-11b-slimorca. This Ai model has been upscaled from 8b parameters to 11.5b parameters without any continuous pretraining or fine-tuning.

Unlike version 1 this model has no issues at fp16 or any quantizations.

The model that was used to create this one is linked below:

https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct

Llama-3-11.5B-Instruct-V2

Metric	Value
Avg.	63.91
AI2 Reasoning Challenge (25-Shot)	57.68
HellaSwag (10-Shot)	78.59
MMLU (5-Shot)	67.35
TruthfulQA (0-shot)	35.86
Winogrande (5-shot)	74.74
GSM8k (5-shot)	69.37

Original Meta-Llama-3-8B-Instruct

Metric	Value
Avg.	66.87
AI2 Reasoning Challenge (25-Shot)	60.75
HellaSwag (10-Shot)	78.55
MMLU (5-Shot)	67.07
TruthfulQA (0-shot)	51.65
Winogrande (5-shot)	74.51
GSM8k (5-shot)	68.69