view post Post 4550 The AMD Instinct MI50 (~$110) is surprisingly fast for inference Quantized models. This runs a Llama 3.1 8B Q8 with Llama.cpphttps://huggingface.co/spaces/DevQuasar/Mi50A little blogpost about the HWhttp://devquasar.com/uncategorized/amd-radeon-instinct-mi50-cheap-inference/ See translation 👍 16 16 🔥 1 1 + Reply