amd-quark/tiny-llama-fast-tokenizer
Updated
•
791
amd-quark/llama-tiny-fp8-quark-quant-method
0.0B
•
Updated
•
4.5k
amd-quark/llama-tiny-fp8-quant-method
0.0B
•
Updated
•
4.53k
amd-quark/quark-assets
Updated
amd-quark/quark-legacy-int8
0.0B
•
Updated
•
5
amd-quark/quark-legacy-fp8
0.0B
•
Updated
•
6
amd-quark/quark-legacy-awq
0.0B
•
Updated
•
13
amd-quark/dummy-config-awq
Updated
•
1.93k
amd-quark/llama-small-int4-per-group-sym-awq
0.0B
•
Updated
•
486
amd-quark/llama-tiny-int4-per-group-sym
0.0B
•
Updated
•
485
amd-quark/llama-tiny-w-fp8-a-fp8-o-fp8
0.0B
•
Updated
•
485
amd-quark/llama-tiny-w-fp8-a-fp8
0.0B
•
Updated
•
485
amd-quark/llama-tiny-w-int8-b-int8-per-tensor
0.0B
•
Updated
•
494
amd-quark/llama-tiny-w-int8-per-tensor
0.0B
•
Updated
•
485
amd-quark/test-qdq
Updated