Edit model card

QuantFactory/Llama-Spark-GGUF

This is quantized version of arcee-ai/Llama-Spark created using llama.cpp

Original Model Card

Arcee Spark

Llama-Spark is a powerful conversational AI model developed by Arcee.ai. It's built on the foundation of Llama-3.1-8B and merges the power of our Tome Dataset with Llama-3.1-8B-Instruct, resulting in a remarkable conversationalist that punches well above its 8B parameter weight class.

GGUFs available here

Model Description

Llama-Spark is our commitment to consistently delivering the best-performing conversational AI in the 6-9B parameter range. As new base models become available, we'll continue to update and improve Spark to maintain its leadership position.

This model is a successor to our original Arcee-Spark, incorporating advancements and learnings from our ongoing research and development.

Intended Uses

Llama-Spark is intended for use in conversational AI applications, such as chatbots, virtual assistants, and dialogue systems. It excels at engaging in natural and informative conversations.

Training Information

Llama-Spark is built upon the Llama-3.1-8B base model, fine-tuned using of the Tome Dataset and merged with Llama-3.1-8B-Instruct.

Evaluation Results

Please note that these scores are consistantly higher than the OpenLLM leaderboard, and should be compared to their relative performance increase not weighed against the leaderboard.

Arcee Spark

Acknowledgements

We extend our deepest gratitude to PrimeIntellect for being our compute sponsor for this project.

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 24.90
IFEval (0-Shot) 79.11
BBH (3-Shot) 29.77
MATH Lvl 5 (4-Shot) 1.06
GPQA (0-shot) 6.60
MuSR (0-shot) 2.62
MMLU-PRO (5-shot) 30.23
Downloads last month
121
GGUF
Model size
8.03B params
Architecture
llama

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference API
Unable to determine this model's library. Check the docs .

Evaluation results