Clarifications for the "Evaluate" task

#20
by ArishkaBelovishka - opened

Hi,

I am a bit confused with the statement of the "Evaluate" track task. For what NLP task we should find the BLUE score? I suppose it has something to do with the translation task, but not sure.
Or do we need to implement our own BLUE scoring and incorporate it into the evaluation framework?

Hopefully, I do not break any rules of the interview by this question:)
Thank you!

Arina

Sign up or log in to comment