stas's picture
Update README.md
57bde7c
|
raw
history blame
No virus
710 Bytes
metadata
license: bigscience-bloom-rail-1.0

This is a custom version of the original BLOOM weights to make it fast to use with the Deepspeed-Inference engine which uses Tensor Parallelism. In this repo the tensors are split into 8 shards to target 8 GPUs.

To full BLOOM documentation is here

To use this repo, you can adapt to your needs the scripts found here (XXX: they are going to migrate soon to HF Transformers code base, so will need to update the link once moved)