bert4torch_config / Qwen1.5-0.5B-Chat
Tongjilibo's picture
修改flash_attention为_attn_implementation,增加deepseek
35d10b1