Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
HuggingFaceTB
/
cosmo2-tokenizer
like
1
Follow
Hugging Face TB Research
1.16k
Transformers
HuggingFaceTB/cosmo2_training_data_subset_1M
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
cosmo2-tokenizer
cosmo2-tokenizer
Tokenizer for the training of cosmo2. This tokenizer was trained on 1M samples from:
FineWeb-Edu 70%
Cosmopedia v2 15%
StarCoderData 8%
OpenWebMath 5%
StackOverFlow 2%
Downloads last month
-
Downloads are not tracked for this model.
How to track
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no pipeline_tag.
Spaces using
HuggingFaceTB/cosmo2-tokenizer
25
π
Tousifahamed/SmolTextGen
π
Tousifahamed/smol-lm2-demo
π»
satyanayak/SmalLMv2-Text-Generator
π
nishantb06/SmolLMTextGenerator-5k
π
ninagala/smollm2-shakespeare
π
sudhakar272/SmolLM2-135TextGenerator
π»
Rajendro/SmalLMv2-TextGenerator
π
crpatel/SmolLMTextGenerator
π¬
garima-mahato/SmoLLM2TextGenerator
π¨
chbsaikiran/SMOLLM2_105M
π¦
MilindChawre/SmolLM2-Text-Generator
π
nishantb06/SmolLM-Text-Generator
+ 20 Spaces
+ 13 Spaces