Edit model card

Model Card for Model ID

Pretrained GPT-NeoX model with 2.06GB English news dataset. Took about 20 hours to reach 40,000 iterations. Trained on p3.16xlarge. Different hyperparameter: gradient_accumulation_step 4

Model Details

Model Description

  • Developed by: Eunyoung Lee
  • Model type: GPT-NeoX
  • Language(s) (NLP): English
Downloads last month
1