arxiv:2404.06773
Mengzhao Chen
ChenMnZ
AI & ML interests
model compression
Organizations
None yet
Papers
2
models
31
ChenMnZ/Mixtral-8x7B-Instruct-v0.1-OmniQuantv1-w4a16g128
Text Generation
•
Updated
•
4
•
1
ChenMnZ/Mixtral-8x7B-v0.1-OmniQuantv2-w4a16g128
Text Generation
•
Updated
•
6
•
1
ChenMnZ/Mixtral-8x7B-v0.1-OmniQuantv1-w4a16g128
Text Generation
•
Updated
•
12
ChenMnZ/Llama-2-13b-chat-omniquant-w3a16g128asym
Updated
ChenMnZ/Llama-2-7b-chat-omniquant-w3a16g128asym
Updated
ChenMnZ/OmniQuant
Updated
•
10
ChenMnZ/falcon-180b-omniquant-w3a16g512
Text Generation
•
Updated
•
5
•
3
ChenMnZ/falcon-7b-omniquant-w3a16g64
Text Generation
•
Updated
•
2
ChenMnZ/Llama-2-13b-chat-omniquant-w2a16g128asym_2
Updated
ChenMnZ/Llama-2-13b-chat-omniquant-w3a16g128asym_2
Updated
datasets
None public yet