arxiv:2409.00492
Michael Goin
mgoin
AI & ML interests
LLM inference optimization, compression, quantization, pruning, distillation
Recent Activity
updated a model 8 days ago
mgoin/Qwen3.6-35B-A3B-2Bit-GSQ-ct published a model 8 days ago
mgoin/Qwen3.6-35B-A3B-2Bit-GSQ-ct new activity 27 days ago
poolside/Laguna-XS.2-INT4:Add base_model