Update README.md
Browse files
README.md
CHANGED
@@ -46,9 +46,9 @@ Here is the table summarizing the architecture used for training, along with the
|
|
46 |
|
47 |
| model | Architecture | Training time (h) | Inference speed (tokens per second) |
|
48 |
|:----------------------:|:-------------:|:-----------------:|:-----------------------------------:|
|
49 |
-
| bloomz-560m-sft-chat | 1 x A100 40GB | 41 | 29 |
|
50 |
-
| bloomz-3b-sft-chat | 1 x A100 40GB | 140 | 13 |
|
51 |
-
| bloomz-7b1-mt-sft-chat | 4 x A100 40GB | 268 | 8 |
|
52 |
|
53 |
Experimentations
|
54 |
----------------
|
|
|
46 |
|
47 |
| model | Architecture | Training time (h) | Inference speed (tokens per second) |
|
48 |
|:----------------------:|:-------------:|:-----------------:|:-----------------------------------:|
|
49 |
+
| [bloomz-560m-sft-chat](https://huggingface.co/cmarkea/bloomz-560m-sft-chat) | 1 x A100 40GB | 41 | 29 |
|
50 |
+
| [bloomz-3b-sft-chat](https://huggingface.co/cmarkea/bloomz-3b-sft-chat) | 1 x A100 40GB | 140 | 13 |
|
51 |
+
| [bloomz-7b1-mt-sft-chat](https://huggingface.co/cmarkea/bloomz-7b1-mt-sft-chat) | 4 x A100 40GB | 268 | 8 |
|
52 |
|
53 |
Experimentations
|
54 |
----------------
|