bert4torch_config / Qwen1.5-1.8B-Chat
Tongjilibo's picture
修改flash_attention为_attn_implementation,增加deepseek
35d10b1