mrtzh's picture
Upload 15 files
f2ffdac
|
raw
history blame
215 Bytes

Standard roberta-large model fine-tuned for one pass over the entire Pile dataset.

See Test-time training on nearest neighbors for large language models for details.