Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
neuralmagic
/
Meta-Llama-3.1-8B-Instruct-FP8-dynamic
like
5
Follow
Neural Magic
195
Text Generation
Transformers
Safetensors
8 languages
llama
fp8
vllm
conversational
text-generation-inference
Inference Endpoints
compressed-tensors
License:
llama3.1
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
0906e2c
Meta-Llama-3.1-8B-Instruct-FP8-dynamic
Commit History
Update README.md
0906e2c
verified
alexmarques
commited on
Oct 10
Updated compression_config to quantization_config
abca298
verified
mgoin
commited on
Oct 9
Update README.md
02ba8ec
verified
Lin-K76
commited on
Aug 23
Upload folder using huggingface_hub
97ae4de
verified
Lin-K76
commited on
Aug 22
Update README.md
2bedb1f
verified
alexmarques
commited on
Aug 13
Update README.md
8d6e926
verified
alexmarques
commited on
Jul 30
Update README.md
10b7edc
verified
Lin-K76
commited on
Jul 27
Update README.md
4d1a63a
verified
Lin-K76
commited on
Jul 26
Update README.md
9f6bfab
verified
Lin-K76
commited on
Jul 26
Upload folder using huggingface_hub
3fe49e3
verified
Lin-K76
commited on
Jul 26
Upload folder using huggingface_hub
544d255
verified
Lin-K76
commited on
Jul 26
Update README.md
c96bab5
verified
Lin-K76
commited on
Jul 25
Update README.md
a0a38dd
verified
Lin-K76
commited on
Jul 25
Update README.md
8006c32
verified
Lin-K76
commited on
Jul 23
Update README.md
223e6f5
verified
Lin-K76
commited on
Jul 23
Create README.md
8193293
verified
Lin-K76
commited on
Jul 23
Upload folder using huggingface_hub
c8f8c9b
verified
Lin-K76
commited on
Jul 23
initial commit
aaace6d
verified
Lin-K76
commited on
Jul 23