Saran
saran1999
AI & ML interests
None yet
Recent Activity
new activity
about 1 month ago
answerdotai/ModernBERT-base:Loss = 0 and Gradient = NaN in ModernBERT Fine-Tuning for Regression
new activity
about 1 month ago
answerdotai/ModernBERT-base:nan or 0.0 loss when training with flash attention
new activity
about 1 month ago
answerdotai/ModernBERT-base:Loss = 0 and Gradient = NaN in ModernBERT Fine-Tuning for Regression
Organizations
None yet
saran1999's activity
Loss = 0 and Gradient = NaN in ModernBERT Fine-Tuning for Regression
4
#63 opened about 1 month ago
by
saran1999
nan or 0.0 loss when training with flash attention
16
#59 opened about 1 month ago
by
roadtoagi

Loss = 0 and Gradient = NaN in ModernBERT Fine-Tuning for Regression
4
#63 opened about 1 month ago
by
saran1999
nan or 0.0 loss when training with flash attention
16
#59 opened about 1 month ago
by
roadtoagi

Loss = 0 and Gradient = NaN in ModernBERT Fine-Tuning for Regression
4
#63 opened about 1 month ago
by
saran1999
nan or 0.0 loss when training with flash attention
16
#59 opened about 1 month ago
by
roadtoagi
