mpt-7b-storywriter-4bit-128g / generation_config.json
OccamRazor's picture
4-bit quantization of MPT
f706ac0
{
"_from_model_config": true,
"transformers_version": "4.28.1",
"use_cache": false
}