Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
fhswf
/
BPE_GPT2_TinyStoriesV2_cleaned_2048
like
1
Follow
Fachhochschule Südwestfalen
24
fhswf/TinyStoriesV2_cleaned
English
text generation
License:
mit
Model card
Files
Files and versions
Community
BPE Tokenizer for TinyStoriesV2
BPE Tokenizer for TinyStoriesV2
Based on get-neo BPE Tokenizer, but with a smaller vocabulary. Trained with TinyStoriesV2.
Vocab Size: 2048
256 Base chars
1 extra Token: <|endoftext|>
1791 merges
Downloads last month
-
Downloads are not tracked for this model.
How to track
Inference API
Unable to determine this model's library. Check the
docs
.
Dataset used to train
fhswf/BPE_GPT2_TinyStoriesV2_cleaned_2048
fhswf/TinyStoriesV2_cleaned
Viewer
•
Updated
May 23, 2024
•
2.71M
•
113
•
6