Text Generation
Transformers
Safetensors
llama
4-bit precision
AWQ
Inference Endpoints
conversational
text-generation-inference
awq
llama-3-neural-chat-v1-8b-AWQ / special_tokens_map.json
Ubuntu
adding AWQ model
9d58ca5
{
"bos_token": {
"content": "<|begin_of_text|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
},
"eos_token": {
"content": "<|end_of_text|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
},
"pad_token": {
"content": "<|end_of_text|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
}
}