GregorZiegltrumAA
commited on
Commit
•
58b7532
1
Parent(s):
890a986
Update README.md
Browse files
README.md
CHANGED
@@ -16,7 +16,7 @@ pipeline_tag: text-generation
|
|
16 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/671a0238b080a748c29b8fea/F1-zbAXF5LGvxpIRrYfU4.png)
|
17 |
|
18 |
|
19 |
-
This Repository holds the model weights for the
|
20 |
|
21 |
You can find all model weights at the following links:
|
22 |
- [umup-research-7b-bf16](https://huggingface.co/Aleph-Alpha/umup-research-7b-bf16)
|
|
|
16 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/671a0238b080a748c29b8fea/F1-zbAXF5LGvxpIRrYfU4.png)
|
17 |
|
18 |
|
19 |
+
This Repository holds the model weights for the u-μP models trained at Aleph Alpha Research, in collaboration with Graphcore, for 72k steps (300B tokens). Please note, that the released checkpoints are not fully converged models and are intended for research use only.
|
20 |
|
21 |
You can find all model weights at the following links:
|
22 |
- [umup-research-7b-bf16](https://huggingface.co/Aleph-Alpha/umup-research-7b-bf16)
|