Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Qwen
/
Qwen-14B
like
210
Follow
Qwen
32k
Text Generation
Transformers
Safetensors
Chinese
English
qwen
custom_code
arxiv:
2309.16609
Model card
Files
Files and versions
xet
Community
12
Train
Use this model
03b4ead
Qwen-14B
/
configuration_qwen.py
Commit History
add softmax_in_fp32
5e88027
yangapku
commited on
Sep 28, 2023
update kvcache
319ed0f
yangapku
commited on
Sep 25, 2023
upload model
5cde1bb
yangapku
commited on
Sep 24, 2023