bigscience
/

bloom

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Update README.md

#76

by ybelkada - opened Aug 11, 2022

base: refs/heads/main

←

from: refs/pr/76

Discussion Files changed

Files changed (1) hide show

README.md +1 -4

README.md CHANGED Viewed

@@ -2444,11 +2444,8 @@ Initial prompting experiments using interim checkpoints: https://huggingface.co/
 ## Original checkpoints
-The checkpoints in this repo correspond to the HuggingFace Transformers format. If you want to use our fork of [Megatron-DeepSpeed](https://github.com/bigscience-workshop/Megatron-DeepSpeed) that the model was trained with, you'd want to use [this repo instead](https://huggingface.co/bigscience/bloom-optimizer-states).
 ---
 # Model Card Authors
 *Ordered roughly chronologically and by amount of time spent.*

 ## Original checkpoints
+The checkpoints in this repo correspond to the HuggingFace Transformers format. If you want to use our fork of [Megatron-DeepSpeed](https://github.com/bigscience-workshop/Megatron-DeepSpeed) that the model was trained with, you’d want to use [this repo instead](https://huggingface.co/bigscience/bloom-megatron-deepspeed) that contains full Megatron-DeepSpeed checkpoints and optimizer states.
 ---
 # Model Card Authors
 *Ordered roughly chronologically and by amount of time spent.*