Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ISTA-DASLab
/
Meta-Llama-3-70B-AQLM-2Bit-1x16
like
14
Follow
IST Austria Distributed Algorithms and Systems Lab
51
Text Generation
Transformers
Safetensors
llama
facebook
meta
llama-3
conversational
text-generation-inference
Inference Endpoints
aqlm
arxiv:
2401.06118
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
e16ede7
Meta-Llama-3-70B-AQLM-2Bit-1x16
1 contributor
History:
6 commits
SpiridonSunRotator
Metric fix
e16ede7
verified
8 months ago
.gitattributes
1.52 kB
initial commit
8 months ago
README.md
688 Bytes
Metric fix
8 months ago
config.json
9.38 kB
Uploaded Meta-Llama-3-70B with AQLM 1x16 quantization
8 months ago
generation_config.json
126 Bytes
Uploaded Meta-Llama-3-70B with AQLM 1x16 quantization
8 months ago
model-00001-of-00005.safetensors
5 GB
LFS
Uploaded Meta-Llama-3-70B with AQLM 1x16 quantization
8 months ago
model-00002-of-00005.safetensors
4.96 GB
LFS
Uploaded Meta-Llama-3-70B with AQLM 1x16 quantization
8 months ago
model-00003-of-00005.safetensors
4.99 GB
LFS
Uploaded Meta-Llama-3-70B with AQLM 1x16 quantization
8 months ago
model-00004-of-00005.safetensors
4.87 GB
LFS
Uploaded Meta-Llama-3-70B with AQLM 1x16 quantization
8 months ago
model-00005-of-00005.safetensors
2.1 GB
LFS
Uploaded Meta-Llama-3-70B with AQLM 1x16 quantization
8 months ago
model.safetensors.index.json
152 kB
Uploaded Meta-Llama-3-70B with AQLM 1x16 quantization
8 months ago
special_tokens_map.json
73 Bytes
Uploaded tokenizer
8 months ago
tokenizer.json
9.08 MB
Uploaded tokenizer
8 months ago
tokenizer_config.json
50.6 kB
Uploaded tokenizer
8 months ago