Michael Goin PRO

mgoin

AI & ML interests

LLM inference optimization, compression, quantization, pruning, distillation

Recent Activity

Organizations

Neural Magic's profile picture garage-bAInd's profile picture Blog-explorers's profile picture Revel Labs's profile picture ZeroGPU Explorers's profile picture NM Testing's profile picture MLX Community's profile picture Social Post Explorers's profile picture

mgoin's activity

New activity in neuralmagic/pixtral-12b-FP8-dynamic about 1 month ago

Update model card

#1 opened about 1 month ago by nm-research
New activity in nm-testing/llava-1.5-7b-hf-FP8-dynamic about 1 month ago

Oom with 24g vram

3
#1 opened 2 months ago by Klopez
New activity in meta-llama/Llama-3.1-405B-Instruct 4 months ago

8-kv-heads

4
#17 opened 4 months ago by ArthurZ
New activity in meta-llama/Llama-3.1-405B 4 months ago

8-kv-heads

3
#21 opened 4 months ago by ArthurZ

run with vllm

8
#4 opened 4 months ago by kuliev-vitaly