Q3OT-FP8 / generation_config.json
TheClusterDev's picture
Compressed LLM
3a46534 verified
raw
history blame
178 Bytes
{
"do_sample": true,
"enable_audio_output": false,
"max_new_tokens": 32768,
"temperature": 0.6,
"top_k": 20,
"top_p": 0.95,
"transformers_version": "4.57.0.dev0"
}