Anmol Reddy
molereddy
AI & ML interests
Speech recognition
Recent Activity
new activity
about 2 months ago
neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w8a16:Poorer performance than W8A8
new activity
2 months ago
neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w8a16:Is this the standard GPTQ quantization?
Organizations
None yet
molereddy's activity
Poorer performance than W8A8
#6 opened about 2 months ago
by
molereddy
How was the quantization performed - do you have a recipe?
#1 opened 2 months ago
by
molereddy
Is this the standard GPTQ quantization?
1
#5 opened 2 months ago
by
molereddy
Model weights are not loaded
4
#3 opened 5 months ago
by
MarvelousMouse