Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
ISTA-DASLab
/
Llama-2-7b-AQLM-2Bit-1x16-hf
like
5
Text Generation
Transformers
Safetensors
llama
text-generation-inference
Inference Endpoints
aqlm
arxiv:
2401.06118
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
refs/pr/1
Llama-2-7b-AQLM-2Bit-1x16-hf
/
config.json
Commit History
try except flash-attn
f48478c
Andrei Panferov
commited on
Feb 6
newer inference
115e749
Andrei Panferov
commited on
Jan 20
new code
dfb8eb3
Andrei Panferov
commited on
Jan 18
inference and autoloading
5c0d7ef
Andrei Panferov
commited on
Jan 18
config
d1f8951
Andrei Panferov
commited on
Jan 18