Why Bloom so bad?

#85

by shanytc - opened Aug 16, 2022

Discussion

shanytc

Aug 16, 2022

LOL WTF

christopher

BigScience Workshop org Aug 16, 2022

Hi there!

You should switch to greedy sampling as opposed to nucleus sampling, as can be read in the notice in that screenshot you shared:

Switch to "greedy" for more accurate completion e.g. math/history/translations (but which may be repetitive/less inventive)

It's also usually advisable to use longer contexts.

Neeraj17

Aug 29, 2022

How can we use "greedy" method when generating text using pipeline ?

christopher

BigScience Workshop org Aug 31, 2022

@Neeraj17 You can pass parameters to the generate function as keyword arguments to your pipeline call. For greedy decoding, you would have to pass do_sample=False.

christopher changed discussion status to closed Aug 31, 2022

ierhon

Oct 22, 2022

Bruh it has to have some context what to do... Just at least give it a few examples.

TimeRobber

BigScience Workshop org Oct 23, 2022

Yeah to be honest our deployment has been unstable due to us having a customised version in order to run fast. I think right now it's much stable.

The idea of providing a few examples on how to solve the task is to be clear on what you want the model to output.

ierhon

Oct 25, 2022

To find why it did something wrong then just imagine you read the dataset and the task is to you... You didn't see a single file that starts with math... And the model is like "idk what do you want from me"

ierhon

Oct 25, 2022

Btw in the first example it thought it was some stackoverflow code.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment