second-state
/

CodeQwen1.5-7B-Chat-GGUF

Text Generation

text-generation-inference

Model card Files Files and versions Community

apepkuss79 commited on May 26

Commit

6a7fab9

•

1 Parent(s): 2f7cca8

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -55,7 +55,7 @@ tags:
   wasmedge --dir .:. --nn-preload default:GGML:AUTO:CodeQwen1.5-7B-Chat-Q5_K_M.gguf \
     llama-api-server.wasm \
     --prompt-template chatml
-    --context-size 4096
     --model-name CodeQwen1.5-7B-Chat
   ```
@@ -65,7 +65,7 @@ tags:
   wasmedge --dir .:. --nn-preload default:GGML:AUTO:CodeQwen1.5-7B-Chat-Q5_K_M.gguf \
     llama-chat.wasm \
     --prompt-template chatml \
-    --ctx-size 4096
   ```
 ## Quantized GGUF Models

   wasmedge --dir .:. --nn-preload default:GGML:AUTO:CodeQwen1.5-7B-Chat-Q5_K_M.gguf \
     llama-api-server.wasm \
     --prompt-template chatml
+    --context-size 64000
     --model-name CodeQwen1.5-7B-Chat
   ```
   wasmedge --dir .:. --nn-preload default:GGML:AUTO:CodeQwen1.5-7B-Chat-Q5_K_M.gguf \
     llama-chat.wasm \
     --prompt-template chatml \
+    --ctx-size 64000
   ```
 ## Quantized GGUF Models