attention-tracker / configs /model_configs /qwen2-attn_config.json
johnsonhung906
add code for qwen
24628d9
raw
history blame
285 Bytes
{
"model_info": {
"provider": "attn-hf",
"name": "qwen-attn",
"model_id": "Qwen/Qwen2-1.5B-Instruct"
},
"params": {
"temperature": 0.1,
"max_output_tokens": 32,
"important_heads": [[11, 8], [12, 8], [14, 10], [19, 7]]
}
}