Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
JunxiongWang
/
MambaByte_PG19_353M
like
0
Text Generation
Transformers
PyTorch
pg19
Inference Endpoints
arxiv:
2401.13660
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
Edit model card
Train in 30B Byte. Mode size 353M. Table 2 in
MambaByte
Downloads last month
14
Dataset used to train
JunxiongWang/MambaByte_PG19_353M
deepmind/pg19
Updated
Jan 18
•
45