Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
compressa-ai
/
Llama-3-8B-Instruct-OmniQuant
like
0
Text Generation
Transformers
Safetensors
llama
llama3
omniquant
gptq
triton
conversational
text-generation-inference
Inference Endpoints
4-bit precision
License:
llama3
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Llama-3-8B-Instruct-OmniQuant
/
README.md
Commit History
add asymm quantized model, add two eos in code sample
6758e8a
Vasily Alexeev
commited on
Apr 24
refine table titles
7807999
Vasily Alexeev
commited on
Apr 23
add metrics and examples in readme
f7750ae
Vasily Alexeev
commited on
Apr 23
initial commit
98e482b
verified
Alvant
commited on
Apr 23