oof-baroomf
/

thing3

Model card Files Files and versions Community

oof-baroomf commited on Sep 6, 2024

Commit

cfb07f2

·

verified ·

1 Parent(s): afa5e20

Create README.md

Files changed (1) hide show

README.md +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,3 @@


1	+ This is the third in a series of GPT-2 (124M) models I pretrained on different orderings, of data, proving that curriculum learning (https://arxiv.org/html/2405.07490v1) is not a viable method for improving LLM performance, and in fact reduces the performance.
2	+
3	+ I trained the models on data ordered randomly, reading level ascending, and reading level descending.