🚩 Report

#137

by hayleecs - opened Mar 28

Mar 28

No information about the training data. How are we supposed to compare it with LLaMa 2 models and how can we justify the change in perplexity on common datasets such as wikitext?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment