djuna
/

L3.1-ForStHS

Text Generation

text-generation-inference

Model card Files Files and versions Community

merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the Model Stock merge method using vicgalle/Configurable-Llama-3.1-8B-Instruct as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: ArliAI/Llama-3.1-8B-ArliAI-RPMax-v1.1+grimjim/Llama-3-Instruct-abliteration-LoRA-8B
  - model: DreadPoor/Heart_Stolen-8B-Model_Stock
  - model: rityak/L3.1-FormaxGradient
merge_method: model_stock
base_model: vicgalle/Configurable-Llama-3.1-8B-Instruct
dtype: bfloat16

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	28.00
IFEval (0-Shot)	78.13
BBH (3-Shot)	31.39
MATH Lvl 5 (4-Shot)	12.92
GPQA (0-shot)	5.48
MuSR (0-shot)	9.66
MMLU-PRO (5-shot)	30.39

Downloads last month: 9

Safetensors

Model size

8.03B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for djuna/L3.1-ForStHS

ArliAI/Llama-3.1-8B-ArliAI-RPMax-v1.1

DreadPoor/Heart_Stolen-8B-Model_Stock

grimjim/Llama-3-Instruct-abliteration-LoRA-8B

rityak/L3.1-FormaxGradient

vicgalle/Configurable-Llama-3.1-8B-Instruct

Merge model

this model

Merges

Quantizations

Collection including djuna/L3.1-ForStHS

Working Merge in my Profile

27 items • Updated Jan 27 • 2

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

78.130
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

31.390
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

12.920
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

5.480
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

9.660
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

30.390

View on Papers With Code