bloom-800m-zh / README.md
bolin22's picture
Update README.md
89e98de
metadata
license: bigscience-bloom-rail-1.0

The model is based on bigscience/bloom-1b1.

To reduce GPU memory usage, we pruned its vocabulary from 250880 to 46145 with Chinese corpus. So the total parameter is 800m now.