Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
HuggingFaceTB
's Collections
π» Local SmolLMs
πͺ SmolLM
Instruct datasets
π Cosmopedia
Find textbooks in FineWeb with a classifier
FineWeb clustering & synthetic generations
Other: Stanford, OpenStax, khanAcademy, wikihow...
FW generation prompts
Wikipedia Science topics
Wikipedia textbooks
SFT Experiments
Decay mixture experiments
models
π Cosmopedia
updated
Aug 18
Resources for Cosmopedia dataset
Upvote
8
HuggingFaceTB/cosmopedia
Viewer
β’
Updated
Aug 12
β’
31.1M
β’
4.84k
β’
553
HuggingFaceTB/cosmo-1b
Text Generation
β’
Updated
Jul 8
β’
656
β’
126
Running
5
πΈοΈ
Web clusters
HuggingFaceTB/cosmopedia-100k
Viewer
β’
Updated
Feb 19
β’
100k
β’
430
β’
38
HuggingFaceTB/cosmopedia-meta
Viewer
β’
Updated
Feb 20
β’
31.1M
β’
2
β’
2
HuggingFaceTB/smollm-corpus
Viewer
β’
Updated
Sep 6
β’
237M
β’
4.02k
β’
217
Upvote
8
+4
Share collection
View history
Collection guide
Browse collections