oof-baroomf commited on
Commit
cfb07f2
·
verified ·
1 Parent(s): afa5e20

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -0
README.md ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ This is the third in a series of GPT-2 (124M) models I pretrained on different orderings, of data, proving that curriculum learning (https://arxiv.org/html/2405.07490v1) is not a viable method for improving LLM performance, and in fact reduces the performance.
2
+
3
+ I trained the models on data ordered randomly, reading level ascending, and reading level descending.