Repetition for long token generation.

#1
by lazyDataScientist - opened

Looks like the model is stuck in an infinite loop "The skyscraper swayed again ...", "She thought, ..." occurs repeatedly in your example.

There is more work the be done with the moe/models in it. ;

Sign up or log in to comment