Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
euclaise
's Collections
MQA
SuperMC
Small-ish SoTA (<5B), (quasi-)base
Interesting smol pretraining expirements
Small-ish SoTA (<5B), (quasi-)base
updated
Aug 10
Upvote
1
nvidia/Minitron-4B-Base
Updated
Aug 22
•
42
•
127
h2oai/h2o-danube3-4b-base
Text Generation
•
Updated
Jul 15
•
3.91k
•
21
stabilityai/stablelm-3b-4e1t
Text Generation
•
Updated
Mar 7
•
8.89k
•
309
Qwen/Qwen2-1.5B
Text Generation
•
Updated
Jun 6
•
33.8k
•
82
internlm/internlm2_5-1_8b-chat
Text Generation
•
Updated
Aug 20
•
4.55k
•
24
Qwen/Qwen1.5-4B
Text Generation
•
Updated
Apr 5
•
13.2k
•
33
tensoropera/Fox-1-1.6B
Text Generation
•
Updated
14 days ago
•
353
•
30
TRI-ML/DCLM-1B
Updated
Jul 25
•
564
•
13
Upvote
1
Share collection
View history
Collection guide
Browse collections