Spaces:

evaluate-metric
/

bleu

Running

Tips to speed up 13k x 13k blue computations?

by shaily99 - opened Feb 3, 2024

Feb 3, 2024

I am using evaluate's bleu implementation to run a 13k X 13k similarity computations. The code has been running for over 20 hrs and still going. Any tips for speed up? Is there a way to make the process parallel?
I also notice, that since the evaluate takes tokenizer in the function call, it means it is tokenizing the text every time - which can be avoided by tokenizing everything once. Is there a way to do this? Or does evaluate automatically implement such caching?

MathewShen

Apr 23, 2024

•

edited Apr 24, 2024

Hi @shaily99 ! I just build a bleu calculation package(bleuscore) which is aimed to speed up the bleu score calculation, you can find it in GitHub.

According to my simple benchmark(the comprehension benchmark is coming soon), which is faster than hf evaluate many times, maybe you can give it a try~

ps: I'm wondering is the 13kX13k dataset available publicly? I want to benchmark on a big real dataset now, but I don't find a proper dataset yet.

shaily99

Apr 23, 2024

•

edited Apr 25, 2024

Thanks, I ll take a look.
Re the benchmark: It isn't out yet, hopefully soon.

MathewShen

Aug 30, 2024

Sorry for the late reply. I have pushed the benchmark results to GitHub and you can check them out in the bleuscore repo
TLDR: We got more than 10x speedup when the corpus size beyond 100K

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment