Qwen
/

Qwen2-1.5B-Instruct-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

yangapku commited on Jun 7, 2024

Commit

1fd29b3

·

1 Parent(s): fe4fd76

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ Compared with the state-of-the-art opensource language models, including the pre
 For more details, please refer to our [blog](https://qwenlm.github.io/blog/qwen2/) and [GitHub](https://github.com/QwenLM/Qwen2).
-In this repo, we provide `fp16` model and quantized models in the GGUF formats, including `q2_k`, `q3_k_m`, `q4_0`, `q4_k_m`, `q5_0`, `q5_k_m`, `q6_k` and `q8_0`.
 ## Model Details
 Qwen2 is a language model series including decoder language models of different model sizes. For each size, we release the base language model and the aligned chat model. It is based on the Transformer architecture with SwiGLU activation, attention QKV bias, group query attention, etc. Additionally, we have an improved tokenizer adaptive to multiple natural languages and codes.
@@ -50,4 +50,4 @@ If you find our work helpful, feel free to give us a cite.
   title={Qwen2 Technical Report},
   year={2024}
 }
-```

 For more details, please refer to our [blog](https://qwenlm.github.io/blog/qwen2/) and [GitHub](https://github.com/QwenLM/Qwen2).
+In this repo, we provide `fp16` model and quantized models in the GGUF formats, including `q5_0`, `q5_k_m`, `q6_k` and `q8_0`.
 ## Model Details
 Qwen2 is a language model series including decoder language models of different model sizes. For each size, we release the base language model and the aligned chat model. It is based on the Transformer architecture with SwiGLU activation, attention QKV bias, group query attention, etc. Additionally, we have an improved tokenizer adaptive to multiple natural languages and codes.
   title={Qwen2 Technical Report},
   year={2024}
 }
+```