llama-3.1-8b-awq

Optimized llama-3.1-8b-awq for efficient inference.

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("gulf-inference/llama-3.1-8b-awq", trust_remote_code=True)
tokenizer = AutoTokenizer.from_pretrained("gulf-inference/llama-3.1-8b-awq", trust_remote_code=True)
Downloads last month
17
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support