view article Article Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints May 1 • 69
LLaMA3-Quantization Collection This is the official quantized models collection of “How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study” • 9 items • Updated Apr 23 • 4