Lower R for smaller model size?

by wsxiaoys - opened Jan 17, 2023

Discussion

wsxiaoys

Jan 17, 2023

•

edited Jan 17, 2023

Thanks for the amazing work!

I noticed most lora models you shared are of ~150M, assuming you're using a large R in lora training (128?).
While based on discussion here: https://github.com/cloneofsimo/lora/discussions/37 it seems lora generalize well with r = 4 / 8.

Have you done experiments that use a R like 4 / 8 for model size around 10M? It'll be much easier for runtime model switch and sharing.

YoungMasterFromSect

Owner Jan 18, 2023

Not really, I will wait for someone else to do those tests. Right now I'm busy enough with the stuff that I do.

YoungMasterFromSect changed discussion status to closed Jan 18, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment