Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Qwen
/
Qwen-1_8B-Chat-Int8
like
4
Follow
Qwen
1,842
Text Generation
Transformers
Safetensors
Chinese
English
qwen
custom_code
8-bit precision
gptq
arxiv:
2309.16609
arxiv:
2305.08322
arxiv:
2009.03300
Model card
Files
Files and versions
Community
2
Train
Use this model
5b261f7
Qwen-1_8B-Chat-Int8
1 contributor
History:
6 commits
xingzhang
update modeling_qwen.py
5b261f7
11 months ago
assets
update
12 months ago
examples
update
12 months ago
.gitattributes
1.52 kB
initial commit
12 months ago
LICENSE
7.28 kB
update
12 months ago
NOTICE
15.3 kB
update
12 months ago
README.md
24.2 kB
update readme
12 months ago
cache_autogptq_cuda_256.cpp
8.4 kB
init model
12 months ago
cache_autogptq_cuda_kernel_256.cu
52 kB
init model
12 months ago
config.json
1.2 kB
update
12 months ago
configuration_qwen.py
2.35 kB
init model
12 months ago
cpp_kernels.py
1.92 kB
init model
12 months ago
generation_config.json
249 Bytes
init model
12 months ago
model.safetensors
2.49 GB
LFS
init model
12 months ago
modeling_qwen.py
55.6 kB
update modeling_qwen.py
11 months ago
quantize_config.json
214 Bytes
update
12 months ago
qwen.tiktoken
2.56 MB
init model
12 months ago
qwen_generation_utils.py
14.6 kB
init model
12 months ago
tokenization_qwen.py
9.62 kB
init model
12 months ago
tokenizer_config.json
173 Bytes
init model
12 months ago