Cyrile commited on
Commit
2d67395
1 Parent(s): 43e4029

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -46,9 +46,9 @@ Here is the table summarizing the architecture used for training, along with the
46
 
47
  | model | Architecture | Training time (h) | Inference speed (tokens per second) |
48
  |:----------------------:|:-------------:|:-----------------:|:-----------------------------------:|
49
- | bloomz-560m-sft-chat | 1 x A100 40GB | 41 | 29 |
50
- | bloomz-3b-sft-chat | 1 x A100 40GB | 140 | 13 |
51
- | bloomz-7b1-mt-sft-chat | 4 x A100 40GB | 268 | 8 |
52
 
53
  Experimentations
54
  ----------------
 
46
 
47
  | model | Architecture | Training time (h) | Inference speed (tokens per second) |
48
  |:----------------------:|:-------------:|:-----------------:|:-----------------------------------:|
49
+ | [bloomz-560m-sft-chat](https://huggingface.co/cmarkea/bloomz-560m-sft-chat) | 1 x A100 40GB | 41 | 29 |
50
+ | [bloomz-3b-sft-chat](https://huggingface.co/cmarkea/bloomz-3b-sft-chat) | 1 x A100 40GB | 140 | 13 |
51
+ | [bloomz-7b1-mt-sft-chat](https://huggingface.co/cmarkea/bloomz-7b1-mt-sft-chat) | 4 x A100 40GB | 268 | 8 |
52
 
53
  Experimentations
54
  ----------------