efe-T
/

sTOpid-SLM-114M

Model card Files Files and versions

sTOpid-SLM-114M 🥴

🙏 Acknowledgements & Attribution

This model, sTOpid-SLM-114M, was built using the following open-source resources:

Tokenizer: Uses the GPT-2 Tokenizer (BPE) provided by OpenAI.
Dataset: Trained on the FineWeb-Edu dataset by Hugging Face.
Underlying Data: CommonCrawl (Subject to their Terms of Use).

We gratefully acknowledge the OpenAI team for the GPT-2 architecture and the Hugging Face FineWeb team for the high-quality training data.

📊 Model Information

Training Data: ~590M Tokens of FineWeb-Edu
Hardware: NVIDIA RTX PRO 6000 Blackwell Server Edition

⚠️ Disclaimer & Behavior

Not Fully Trained: This model is in an early stage. It has many hallucinations that can make people smile, but they are logically incorrect.
Do Not Trust: Do not trust the model for facts, math, or serious advice. It is confidently wrong about almost everything.
Purpose: Created for research, laughter, and pushing the limits of Blackwell hardware.

License: MIT

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train efe-T/sTOpid-SLM-114M