csjiaya zwpride-iquestlab commited on
Commit
16351f7
·
1 Parent(s): b42532d

Update README.md (#2)

Browse files

- Update README.md (60450154b65fe124ce184f1dd60198f3445a38eb)


Co-authored-by: wei zhang <zwpride-iquestlab@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +7 -0
README.md CHANGED
@@ -137,6 +137,13 @@ outputs = model.generate(
137
  print(tokenizer.decode(outputs[0], skip_special_tokens=True))
138
  ```
139
 
 
 
 
 
 
 
 
140
  ### Fill-in-the-Middle (FIM)
141
 
142
  InCoder-32B supports FIM completion for code infilling tasks:
 
137
  print(tokenizer.decode(outputs[0], skip_special_tokens=True))
138
  ```
139
 
140
+ ### Deployment with vLLM
141
+ For production deployment, you can use vLLM to create an OpenAI-compatible API endpoint.
142
+
143
+ ```
144
+ vllm serve Multilingual-Multimodal-NLP/IndustrialCoder --tensor-parallel-size 8
145
+ ```
146
+
147
  ### Fill-in-the-Middle (FIM)
148
 
149
  InCoder-32B supports FIM completion for code infilling tasks: