Performance

by KnutJaegersberg - opened Jan 16

Jan 16

Looks good.

Currently I play with 2 other fine tunes, as my previous qwen fine tunes lowered its performance. I think I might have overfit them. Perhaps the settings in autotrain-advanced are too limited.

KnutJaegersberg

Jan 16

•

edited Jan 16

I'm trying to understand what I did that affected performance negatively.

KnutJaegersberg

Jan 16

I think the loss as reported by autotrain advanced was 0.2 or so. Not sure, but I guess that's the training loss. That sounds rather low, not?

KnutJaegersberg

Jan 16

not sure about the learning rate. maybe a memetic learning rate works better

qnguyen3

Owner Jan 17

I think the loss as reported by autotrain advanced was 0.2 or so. Not sure, but I guess that's the training loss. That sounds rather low, not?

This is probably because you have a high learning rate. Usually for dataset that has more than 500k samples, i would do 2e-5.

KnutJaegersberg

Jan 17

Thanks :)

KnutJaegersberg changed discussion status to closed Jan 17

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment