AI & ML interests

Low-bit Quantization of Large Language Models (LLMs)

Recent Activity

HaoranChu  updated a model about 1 month ago
Efficient-ML/GPTQ-for-Qwen3
HaoranChu  updated a collection about 1 month ago
Qwen3-Quantization
View all activity

Efficient-ML 's models 52