Michael Goin

mgoin

AI & ML interests

LLM inference optimization, compression, quantization, pruning, distillation

Organizations

mgoin's activity

New activity in nvidia/Minitron-4B-Base 2 days ago

Where is Minitron-4B-Instruct?

1
#2 opened 2 days ago by mgoin
New activity in nvidia/Minitron-8B-Base 3 days ago

Error serving model

3
#2 opened 6 days ago by EvGUT

How to load this model?

1
#1 opened 25 days ago by Frz614
New activity in nm-testing/SparseLlama-3-8B-pruned_50.2of4-FP8 about 1 month ago

Update README.md

#1 opened about 1 month ago by alexmarques
New activity in neuralmagic/SparseLlama-3-8B-pruned_50.2of4 about 1 month ago

Update README.md

#1 opened about 1 month ago by alexmarques
New activity in neuralmagic/Qwen2-72B-Instruct-FP8 about 2 months ago

Update README.md

#1 opened about 2 months ago by abhinavnmagic
New activity in neuralmagic/Mixtral-8x7B-Instruct-v0.1-FP8 about 2 months ago

Update README.md

#1 opened about 2 months ago by abhinavnmagic
New activity in neuralmagic/Meta-Llama-3-8B-Instruct-FP8 about 2 months ago

Update README.md

#2 opened about 2 months ago by abhinavnmagic
New activity in neuralmagic/Meta-Llama-3-70B-Instruct-FP8 about 2 months ago

Create README.md

#1 opened about 2 months ago by abhinavnmagic
New activity in neuralmagic/Meta-Llama-3-8B-Instruct-FP8 about 2 months ago

Fails to run with nm-vllm

1
#1 opened 3 months ago by clintonruairi
New activity in mgoin/ultrachat_2k 2 months ago
New activity in mgoin/Meta-Llama-3-70B-Instruct-Marlin 3 months ago

What is Marlin?

2
#1 opened 3 months ago by Samvanity
New activity in mgoin/Meta-Llama-3-8B-Instruct-Marlin 3 months ago

Inference Issues

7
#1 opened 3 months ago by qeternity
New activity in neuralmagic/Llama-2-7b-evolcodealpaca 4 months ago

Update README.md

#1 opened 4 months ago by abhinavnmagic

Update README.md

#1 opened 4 months ago by abhinavnmagic

Update README.md

#1 opened 4 months ago by abhinavnmagic

Update README.md

#1 opened 4 months ago by alexmarques

Update README.md

#1 opened 4 months ago by alexmarques
New activity in neuralmagic/Llama-2-7b-pruned70-retrained 4 months ago

Update README.md

#1 opened 4 months ago by alexmarques
New activity in neuralmagic/Llama-2-7b-pruned50-retrained 4 months ago

Update README.md

#1 opened 4 months ago by alexmarques
New activity in reciprocate/llama2-7b-gsm8k 10 months ago

Create README.md

#2 opened 10 months ago by mgoin