MBZUAI
/

MobiLlama-05B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

omkarthawakar commited on Feb 26, 2024

Commit

7dbe7be

•

1 Parent(s): 858e5f8

Update README.md

Files changed (1) hide show

README.md +17 -1

README.md CHANGED Viewed

@@ -46,6 +46,18 @@ print(tokenizer.batch_decode(outputs[:, input_ids.shape[1]:-1])[0].strip())
 ```
 ## Hyperparameters
 | Hyperparameter      | Value |
 | ----------- | ----------- |
@@ -79,4 +91,8 @@ print(tokenizer.batch_decode(outputs[:, input_ids.shape[1]:-1])[0].strip())
 Given the nature of the training data, the MobiLlama-05B model is best suited for prompts using the QA format, the chat format, and the code format.
 ## Citation
-Coming soon

 ```
+## Training DataMix
+| Subset      | Tokens (Billion) |
+| ----------- | ----------- |
+| Arxiv      | 30.00       |
+| Book   | 28.86        |
+| C4   | 197.67        |
+| Refined-Web   | 665.01        |
+| StarCoder   | 291.92        |
+| StackExchange   | 21.75        |
+| Wikipedia   | 23.90        |
+| Total | 1259.13 |
 ## Hyperparameters
 | Hyperparameter      | Value |
 | ----------- | ----------- |
 Given the nature of the training data, the MobiLlama-05B model is best suited for prompts using the QA format, the chat format, and the code format.
 ## Citation
+**BibTeX:**
+```bibtex
+coming soon
+```