Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
HuggingFaceTB
/
cosmo2-tokenizer
like
2
Follow
Hugging Face Smol Models Research
1.55k
Transformers
HuggingFaceTB/cosmo2_training_data_subset_1M
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
cosmo2-tokenizer
cosmo2-tokenizer
Tokenizer for the training of cosmo2. This tokenizer was trained on 1M samples from:
FineWeb-Edu 70%
Cosmopedia v2 15%
StarCoderData 8%
OpenWebMath 5%
StackOverFlow 2%
Downloads last month
-
Downloads are not tracked for this model.
How to track
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
π
Ask for provider support
HF Inference deployability: The model has no pipeline_tag.
Spaces using
HuggingFaceTB/cosmo2-tokenizer
29
π
stokkangri/tsai_s13_smollm2_135M_param_matched_llm
π
Tousifahamed/SmolTextGen
π
Tousifahamed/smol-lm2-demo
π¦
MilindChawre/SmolLM2-Text-Generator
π»
satyanayak/SmalLMv2-Text-Generator
π
nishantb06/SmolLM-Text-Generator
π
anjikum/generate_text_smollm2-135M_implementation
π
EzhirkoArulmozhi/TextGeneratorSmolLM2
π
nishantb06/SmolLMTextGenerator-5k
π
ninagala/smollm2-shakespeare
π’
kalekarnn/SmolLM2-135-model
π
hashvibe007/smollm2
+ 24 Spaces
+ 17 Spaces