Llama-3.2-1B
Collection
8 items
•
Updated
•
1
This model is a fine-tuned version of unsloth/meta-llama-3.1-8b-instruct-bnb-4bit on the None dataset.
The Model is trained only on successful episodes produced by the top 10 models from the clembench benchmark version 0.9 and 1.0. The success was measured in terms of most overall successful episodes across all games.
Place | Item |
---|---|
1 | gpt-4-0613-t0.0--gpt-4-0613-t0.0 |
2 | claude-v1.3-t0.0--claude-v1.3-t0.0 |
3 | gpt-4-1106-preview-t0.0--gpt-4-1106-preview-t0.0 |
4 | gpt-4-t0.0--gpt-4-t0.0 |
5 | gpt-4-0314-t0.0--gpt-4-0314-t0.0 |
6 | claude-2.1-t0.0--claude-2.1-t0.0 |
7 | gpt-4-t0.0--gpt-3.5-turbo-t0.0 |
8 | claude-2-t0.0--claude-2-t0.0 |
9 | gpt-3.5-turbo-1106-t0.0--gpt-3.5-turbo-1106-t0.0 |
10 | gpt-3.5-turbo-0613-t0.0--gpt-3.5-turbo-0613-t0.0 |
More information needed
Traning Data: D20001
The following hyperparameters were used during training: