Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
xmadai
/
Llama-3.1-Nemotron-70B-Instruct-xMADai-INT4
like
4
Follow
xMAD.ai
15
Text Generation
Transformers
llama
conversational
Inference Endpoints
4-bit precision
gptq
arxiv:
2407.10032
License:
llama3.1
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
main
Llama-3.1-Nemotron-70B-Instruct-xMADai-INT4
1 contributor
History:
5 commits
onebitquantized
Update README.md
8ad40ea
verified
23 days ago
.gitattributes
Safe
1.52 kB
initial commit
23 days ago
README.md
Safe
3.61 kB
Update README.md
23 days ago
config.json
Safe
1.3 kB
Upload of AutoGPTQ quantized model
23 days ago
gptq_model-4bit-128g.safetensors
Safe
39.8 GB
LFS
Upload of AutoGPTQ quantized model
23 days ago
quantize_config.json
Safe
310 Bytes
Upload of AutoGPTQ quantized model
23 days ago
special_tokens_map.json
Safe
296 Bytes
Upload tokenizer
23 days ago
tokenizer.json
Safe
9.09 MB
Upload tokenizer
23 days ago
tokenizer_config.json
Safe
55.3 kB
Upload tokenizer
23 days ago