Update README.md
Browse filesadded training info
README.md
CHANGED
@@ -22,7 +22,21 @@ This model is a fine-tuned version of [unsloth/meta-llama-3.1-8b-instruct-bnb-4b
|
|
22 |
|
23 |
## Model description
|
24 |
|
25 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
26 |
|
27 |
## Intended uses & limitations
|
28 |
|
@@ -30,7 +44,7 @@ More information needed
|
|
30 |
|
31 |
## Training and evaluation data
|
32 |
|
33 |
-
|
34 |
|
35 |
## Training procedure
|
36 |
|
|
|
22 |
|
23 |
## Model description
|
24 |
|
25 |
+
The Model is trained only on successful episodes produced by the top 10 models from the clembench benchmark version 0.9 and 1.0. The success was measured in terms of most overall
|
26 |
+
successful episodes across all games.
|
27 |
+
|
28 |
+
| Place | Item |
|
29 |
+
|-------|------|
|
30 |
+
| 1 | gpt-4-0613-t0.0--gpt-4-0613-t0.0 |
|
31 |
+
| 2 | claude-v1.3-t0.0--claude-v1.3-t0.0 |
|
32 |
+
| 3 | gpt-4-1106-preview-t0.0--gpt-4-1106-preview-t0.0 |
|
33 |
+
| 4 | gpt-4-t0.0--gpt-4-t0.0 |
|
34 |
+
| 5 | gpt-4-0314-t0.0--gpt-4-0314-t0.0 |
|
35 |
+
| 6 | claude-2.1-t0.0--claude-2.1-t0.0 |
|
36 |
+
| 7 | gpt-4-t0.0--gpt-3.5-turbo-t0.0 |
|
37 |
+
| 8 | claude-2-t0.0--claude-2-t0.0 |
|
38 |
+
| 9 | gpt-3.5-turbo-1106-t0.0--gpt-3.5-turbo-1106-t0.0 |
|
39 |
+
| 10 | gpt-3.5-turbo-0613-t0.0--gpt-3.5-turbo-0613-t0.0 |
|
40 |
|
41 |
## Intended uses & limitations
|
42 |
|
|
|
44 |
|
45 |
## Training and evaluation data
|
46 |
|
47 |
+
Traning Data: D20001
|
48 |
|
49 |
## Training procedure
|
50 |
|