Merge branch 'main' of https://huggingface.co/Qwen/Qwen-7B-Chat-Int4 into pr/6 7fa16ca wangzihan99 commited on Dec 4, 2023
Improve performance witih Triton 2.0 and adapt to latest Qwen releases. 5b354c8 wangzihan99 commited on Dec 1, 2023
Merge branch 'main' of https://huggingface.co/Qwen/Qwen-7B-Chat-Int4 into pr/6 74a1327 wangzihan99 commited on Dec 1, 2023
Add fused ApplyRoPE and RMSNorm kernels written in OpenAI Triton. 89a2cd3 wangzihan99 commited on Nov 15, 2023