Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Qwen
/
Qwen-7B-Chat-Int4
like
67
Text Generation
Transformers
Safetensors
Chinese
English
qwen
custom_code
4-bit precision
gptq
5 papers
License:
tongyi-qianwen-license-agreement (other)
Model card
Files
Files and versions
Community
12
Train
Use this model
5bfdae9
Qwen-7B-Chat-Int4
2 contributors
History:
23 commits
yangapku
add kernel file check in modeling_qwen.py
5bfdae9
9 months ago
assets
update modeling_qwen.py
10 months ago
.gitattributes
1.59 kB
update model
10 months ago
LICENSE
6.9 kB
update readme
10 months ago
NOTICE
2.7 kB
update readme
10 months ago
README.md
30.6 kB
update int8 quantization info
10 months ago
cache_autogptq_cuda_256.cpp
8.4 kB
update kernels
10 months ago
cache_autogptq_cuda_kernel_256.cu
52 kB
update kernels
10 months ago
config.json
1.2 kB
update model
10 months ago
configuration_qwen.py
2.35 kB
softmax_in_fp32
10 months ago
cpp_kernels.py
1.92 kB
update kernels
10 months ago
generation_config.json
249 Bytes
update default generate hyperparams
10 months ago
model-00001-of-00003.safetensors
2.04 GB
LFS
update model
10 months ago
model-00002-of-00003.safetensors
2.05 GB
LFS
update model
10 months ago
model-00003-of-00003.safetensors
1.77 GB
LFS
update model
10 months ago
model.safetensors.index.json
65.7 kB
update model
10 months ago
modeling_qwen.py
58.5 kB
add kernel file check in modeling_qwen.py
9 months ago
quantize_config.json
214 Bytes
update model
10 months ago
qwen.tiktoken
2.56 MB
update model
10 months ago
qwen_generation_utils.py
14.6 kB
update model
10 months ago
tokenization_qwen.py
9.62 kB
update tokenization.py
10 months ago
tokenizer_config.json
173 Bytes
format
10 months ago