Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
compressa-ai
/
Llama-3-8B-Instruct-OmniQuant
like
0
Follow
Compressa
7
Text Generation
Transformers
Safetensors
llama
llama3
omniquant
gptq
triton
conversational
text-generation-inference
Inference Endpoints
4-bit precision
License:
llama3
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Llama-3-8B-Instruct-OmniQuant
/
config.json
Commit History
add asymm quantized model, add two eos in code sample
6758e8a
Vasily Alexeev
commited on
Apr 24
add model weights and stuff
1a27dec
Vasily Alexeev
commited on
Apr 23