Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Infinirc
/
Infinirc-Llama3-8B-4bit-AWQ-GEMM-Beta
like
1
Text Generation
Transformers
Safetensors
Chinese
English
llama
zhtw
conversational
text-generation-inference
Inference Endpoints
4-bit precision
awq
License:
llama3
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Infinirc-Llama3-8B-4bit-AWQ-GEMM-Beta
1 contributor
History:
3 commits
RickyChen-Infinirc
Update README.md
ba9d28a
verified
12 days ago
.gitattributes
1.52 kB
initial commit
18 days ago
README.md
83 Bytes
Update README.md
12 days ago
config.json
927 Bytes
Upload model
18 days ago
generation_config.json
142 Bytes
Upload model
18 days ago
model-00001-of-00002.safetensors
4.68 GB
LFS
Upload model
18 days ago
model-00002-of-00002.safetensors
1.05 GB
LFS
Upload model
18 days ago
model.safetensors.index.json
63.5 kB
Upload model
18 days ago
special_tokens_map.json
449 Bytes
Upload model
18 days ago
tokenizer.json
9.08 MB
Upload model
18 days ago
tokenizer_config.json
51.1 kB
Upload model
18 days ago