Speed

#1
by Borach - opened

What do you mean with 2.5 slower?

On a sequence of 512 it is 2.5 slower than scibert? Or a sequence of 4096 is 2.5 slower than scibert on 512?

Hi! sorry for missing this, I don't have numbers at hand anymore, but AFAIR that was the comparison for a sequence of 4096 for SciBERT long and 512 for SciBERT, so not that bad for the longformer version.

yorko changed discussion status to closed

Sign up or log in to comment