EmbeddedLLM/Qwen3-Coder-480B-A35B-Instruct-FP8-Dynamic
480B
•
Updated
•
61
EmbeddedLLM/deepseek-r1-FP8-Dynamic
671B
•
Updated
•
28
EmbeddedLLM/Qwen2.5-1.5B-FP8-Dynamic
EmbeddedLLM/Qwen2.5-1.5B-Instruct-FP8-Dynamic
EmbeddedLLM/Qwen2.5-32B-Instruct-FP8-Dynamic
EmbeddedLLM/Qwen2.5-7B-Instruct-FP8-Dynamic
EmbeddedLLM/deepseekv3-lite-ci
Updated
EmbeddedLLM/Qwen_Qwen2.5-32B-Instruct-FP8-Dynamic
EmbeddedLLM/Llama-3.1-8B-Instruct-w_fp8_per_channel_sym
Text Generation
•
8B
•
Updated
•
2
EmbeddedLLM/Nexusflow_Athena-V2-Agent-OCP-FP8-Quark
73B
•
Updated
•
1
EmbeddedLLM/Nexusflow_Athena-V2-Chat-OCP-FP8-Quark
73B
•
Updated
EmbeddedLLM/Qwen2.5-72B-Instruct-OCP-FP8-Quark
73B
•
Updated
EmbeddedLLM/ELLM_Star
EmbeddedLLM/bge-m3-int4-sym-ov
EmbeddedLLM/bge-m3-int4-ov
Updated
•
2
•
1
EmbeddedLLM/Qwen2.5-32B-Instruct-int4-sym-ov
EmbeddedLLM/Qwen2.5-14B-Instruct-int4-sym-ov
Updated
EmbeddedLLM/vLLM-AMD-flash-attn-debug
Updated
EmbeddedLLM/Llama-Guard-3-1B-int4-sym-ov
Updated
EmbeddedLLM/Llama-3.2-1B-Instruct-int4-sym-ov
Updated
EmbeddedLLM/Llama-3.2-3B-Instruct-int4-sym-ov
Updated
EmbeddedLLM/Llama-Guard-3-1B-int4-asym-ov
Updated
EmbeddedLLM/Llama-3.2-1B-Instruct-int4-asym-ov
Updated
EmbeddedLLM/Llama-3.2-3B-Instruct-int4-asym-ov
Updated
EmbeddedLLM/Qwen2.5-7B-Instruct-int4-sym-ov
Updated
EmbeddedLLM/Qwen2.5-3B-Instruct-int4-sym-ov
EmbeddedLLM/Qwen2.5-1.5B-Instruct-int4-sym-ov
EmbeddedLLM/Qwen2.5-0.5B-Instruct-int4-sym-ov
Updated
EmbeddedLLM/Llama-3.1-8B-Instruct-int4-asym-ov
Updated
EmbeddedLLM/Llama-3.1-70B-Instruct-int4-asym-ov