Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
ISTA-DASLab
/
Llama-2-70b-AQLM-4Bit-2x16-hf
like
0
Text Generation
Transformers
Safetensors
llama_aqlm
custom_code
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
No model card
New: Create and edit this model card directly on the website!
Contribute a Model Card
Downloads last month
5
Safetensors
Model size
18.2B params
Tensor type
F32
·
I16
·
Inference API
Text Generation
Examples
Compute
Model is too large to load in Inference API (serverless). To try the model, launch it on
Inference Endpoints (dedicated)
instead.
JSON Output
Maximize
Collection including
ISTA-DASLab/Llama-2-70b-AQLM-4Bit-2x16-hf
AQLM
Collection
AQLM quantized LLMs
•
20 items
•
Updated
May 3
•
24