Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
joshmiller656
/
Llama-3.1-Nemotron-70B-Instruct-AWQ-INT4
like
2
Text Generation
Transformers
Safetensors
nvidia/HelpSteer2
8 languages
llama
nemotron
awq
quantized
int4
conversational
text-generation-inference
Inference Endpoints
4-bit precision
arxiv:
2410.01257
arxiv:
2405.01481
arxiv:
2406.08673
License:
llama3.1
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Llama-3.1-Nemotron-70B-Instruct-AWQ-INT4
Commit History
Update README.md
b73d0f3
verified
joshmiller656
commited on
19 days ago
Update README.md
990545b
verified
joshmiller656
commited on
20 days ago
Update README.md
708a5ad
verified
joshmiller656
commited on
25 days ago
Update README.md
ef9ae12
verified
joshmiller656
commited on
26 days ago
Create README.md
ef0056e
verified
joshmiller656
commited on
26 days ago
Upload folder using huggingface_hub
372de0b
verified
joshmiller656
commited on
26 days ago
initial commit
4b1a3dd
verified
joshmiller656
commited on
26 days ago