openai rich accelerate vllm