LLM360
/

K2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

victormiller commited on Apr 18

Commit

aa9df8d

•

1 Parent(s): bfa3f60

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -16,6 +16,9 @@ K2 is a fully transparent large language model on par with Llama 2 - 70B.
 <center><img src="eval_table_temp.png" alt="eval table"/></center>
 ## Datasets and Mix
 | Dataset      | Starting Tokens      | Multiplier      | Total Tokens      |% of Total      |
 | ----------- | ----------- | ----------- | ----------- | ----------- |
 | dm-math   | 4.33B        | 3x       | 13B       | 1%       |

 <center><img src="eval_table_temp.png" alt="eval table"/></center>
 ## Datasets and Mix
+The following data mix was used to train K2 and achieve results in line with Llama 2 70B. The full data sequence will be available soon.
 | Dataset      | Starting Tokens      | Multiplier      | Total Tokens      |% of Total      |
 | ----------- | ----------- | ----------- | ----------- | ----------- |
 | dm-math   | 4.33B        | 3x       | 13B       | 1%       |