Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
stanford-crfm
/
pubmed_gpt_tokenizer
like
1
Model card
Files
Files and versions
Community
main
pubmed_gpt_tokenizer
1 contributor
History:
22 commits
J38
revert to 28k vocab
3d6a170
over 1 year ago
.gitattributes
1.38 kB
initial commit
over 1 year ago
merges.txt
276 kB
revert to 28k vocab
over 1 year ago
tokenizer.json
1.23 MB
revert to 28k vocab
over 1 year ago
tokenizer_config.json
267 Bytes
50k vocab, prefix_space=false,trained on PubMed Abstracts
over 1 year ago
vocab.json
602 kB
revert to 28k vocab
over 1 year ago