An atomicly sized version of the roberta base architecture, trained on synthetic textbook data with. Useful for testing purposes