bert4torch_config / Qwen2-7B-Instruct
Tongjilibo's picture
修改flash_attention为_attn_implementation,增加deepseek
35d10b1