Aastha Varma
aastha6
AI & ML interests
LLM: quantization, fine tuning, optimizations | NLP | Deep Learning
Organizations
aastha6's activity
Cuda error for MAX_TOTAL_TOKENS = 8192
1
#5 opened 7 months ago
by
aastha6
Trying to deploy this model with vllm in Sagemaker
#2 opened 7 months ago
by
aastha6
Not able to launch using TGI in Sagemaker
#11 opened 7 months ago
by
aastha6
Not able to deploy in Sagemaker
2
#3 opened 10 months ago
by
aastha6
Code to shard a model weights
#1 opened 10 months ago
by
aastha6