Keerthan Vasist

kvasist
ยท

AI & ML interests

None yet

Recent Activity

Organizations

None yet

kvasist's activity

view reply

Great article. I have been trying to deploy deepseek-ai/DeepSeek-R1-Distill-Qwen-32B on inferentia with a context window higher than 4096 (let's say MAX_TOTAL_TOKENS=8192), but it seems there is no pre-compiled model for that. It would be great if you could add instructions to compile these models, that would be great.