Nicohst commited on
Commit
415b42b
1 Parent(s): 7b4ac55

Update README.md

Browse files

added training info

Files changed (1) hide show
  1. README.md +16 -2
README.md CHANGED
@@ -22,7 +22,21 @@ This model is a fine-tuned version of [unsloth/meta-llama-3.1-8b-instruct-bnb-4b
22
 
23
  ## Model description
24
 
25
- More information needed
 
 
 
 
 
 
 
 
 
 
 
 
 
 
26
 
27
  ## Intended uses & limitations
28
 
@@ -30,7 +44,7 @@ More information needed
30
 
31
  ## Training and evaluation data
32
 
33
- More information needed
34
 
35
  ## Training procedure
36
 
 
22
 
23
  ## Model description
24
 
25
+ The Model is trained only on successful episodes produced by the top 10 models from the clembench benchmark version 0.9 and 1.0. The success was measured in terms of most overall
26
+ successful episodes across all games.
27
+
28
+ | Place | Item |
29
+ |-------|------|
30
+ | 1 | gpt-4-0613-t0.0--gpt-4-0613-t0.0 |
31
+ | 2 | claude-v1.3-t0.0--claude-v1.3-t0.0 |
32
+ | 3 | gpt-4-1106-preview-t0.0--gpt-4-1106-preview-t0.0 |
33
+ | 4 | gpt-4-t0.0--gpt-4-t0.0 |
34
+ | 5 | gpt-4-0314-t0.0--gpt-4-0314-t0.0 |
35
+ | 6 | claude-2.1-t0.0--claude-2.1-t0.0 |
36
+ | 7 | gpt-4-t0.0--gpt-3.5-turbo-t0.0 |
37
+ | 8 | claude-2-t0.0--claude-2-t0.0 |
38
+ | 9 | gpt-3.5-turbo-1106-t0.0--gpt-3.5-turbo-1106-t0.0 |
39
+ | 10 | gpt-3.5-turbo-0613-t0.0--gpt-3.5-turbo-0613-t0.0 |
40
 
41
  ## Intended uses & limitations
42
 
 
44
 
45
  ## Training and evaluation data
46
 
47
+ Traning Data: D20001
48
 
49
  ## Training procedure
50