bloom-800m-zh / README.md
bolin22's picture
Update README.md
89e98de
---
license: bigscience-bloom-rail-1.0
---
The model is based on bigscience/bloom-1b1.
To reduce GPU memory usage, we pruned its vocabulary from 250880 to 46145 with Chinese corpus. So the total parameter is 800m now.