Trained on 3 epoch of the Norquinal's claude_multiround_chat_30k dataset.

note: this is another expeiment feel free to give it a try!

Prompt template:

### HUMAN:
{prompt}

### RESPONSE:
<leave a newline for the model to answer>

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Safetensors

Model size

3.43B params

Tensor type

FP16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for harborwater/open-llama-3b-claude-30k

Quantizations

normalized accuracy on AI2 Reasoning Challenge (25-Shot)
test set Open LLM Leaderboard

41.720
normalized accuracy on HellaSwag (10-Shot)
validation set Open LLM Leaderboard

72.640
accuracy on MMLU (5-Shot)
test set Open LLM Leaderboard

24.030
mc2 on TruthfulQA (0-shot)
validation set Open LLM Leaderboard

38.460
accuracy on Winogrande (5-shot)
validation set Open LLM Leaderboard

66.540
accuracy on GSM8k (5-shot)
test set Open LLM Leaderboard

2.200