Update README.md (#2)

- Update README.md (60450154b65fe124ce184f1dd60198f3445a38eb)

Co-authored-by: wei zhang <zwpride-iquestlab@users.noreply.huggingface.co>

Files changed (1) hide show

README.md CHANGED Viewed

@@ -137,6 +137,13 @@ outputs = model.generate(
 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ```
 ### Fill-in-the-Middle (FIM)
 InCoder-32B supports FIM completion for code infilling tasks:

 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ```
+### Deployment with vLLM
+For production deployment, you can use vLLM to create an OpenAI-compatible API endpoint.
+```
+vllm serve Multilingual-Multimodal-NLP/IndustrialCoder --tensor-parallel-size 8
+```
 ### Fill-in-the-Middle (FIM)
 InCoder-32B supports FIM completion for code infilling tasks: