Speed
#1
by
Borach
- opened
What do you mean with 2.5 slower?
On a sequence of 512 it is 2.5 slower than scibert? Or a sequence of 4096 is 2.5 slower than scibert on 512?
Hi! sorry for missing this, I don't have numbers at hand anymore, but AFAIR that was the comparison for a sequence of 4096 for SciBERT long and 512 for SciBERT, so not that bad for the longformer version.
yorko
changed discussion status to
closed