mrtzh's picture
Update README.md
b29c803
|
raw
history blame contribute delete
No virus
272 Bytes
---
license: cc-by-nc-4.0
datasets:
- EleutherAI/pile
---
Standard `roberta-large` model fine-tuned for one pass over the entire Pile dataset.
See [Test-time training on nearest neighbors for large language models](https://github.com/socialfoundations/tttlm) for details.