This is a custom model with 123.5M parameter.

  • A modified version of GPT-2
  • More data will be added soon
  • Pretraining was done on Fineweb Edu dataset
  • Finetuning not done
Downloads last month
3
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support