Edit model card

8.1 bpw exl2 quant of Meta-Llama-3-70B-Instruct

Downloads last month
3
Inference API
Input a message to start chatting with llmixer/Meta-Llama-3-70B-Instruct-8.0bpw-h8-exl2.
Model is too large to load in Inference API (serverless). To try the model, launch it on Inference Endpoints (dedicated) instead.