Aniket Maurya's picture
2 9

Aniket Maurya

aniketmaurya

AI & ML interests

Computer vision, NLP, MLOps

Recent Activity

upvoted an article 1 day ago
Tensor Parallelism
updated a model 11 days ago
aniketmaurya/receipt-model-2025
liked a model 11 days ago
aniketmaurya/receipt-model-2025
View all activity

Organizations

Gradio-Themes-Party's profile picture ICML2023's profile picture ZeroGPU Explorers's profile picture

aniketmaurya's activity

upvoted an article 1 day ago
replied to singhsidhukuldeep's post 12 days ago
view reply

woohoo thanks for checking LitServe @singhsidhukuldeep ! LitServe now has OpenAI API-compatible endpoint and you can also serve a LLM using vLLM engine with LitServe so you get both speed + flexibility.

reacted to singhsidhukuldeep's post with πŸš€ 12 days ago
view post
Post
892
Just tried LitServe from the good folks at @LightningAI !

Between llama.cpp and vLLM, there is a small gap where a few large models are not deployable!

That's where LitServe comes in!

LitServe is a high-throughput serving engine for AI models built on FastAPI.

Yes, built on FastAPI. That's where the advantage and the issue lie.

It's extremely flexible and supports multi-modality and a variety of models out of the box.

But in my testing, it lags far behind in speed compared to vLLM.

Also, no OpenAI API-compatible endpoint is available as of now.

But as we move to multi-modal models and agents, this serves as a good starting point. However, it’s got to become faster...

GitHub: https://github.com/Lightning-AI/LitServe
  • 1 reply
Β·
upvoted an article 9 months ago
view article
Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

β€’ 238