Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
TRI-ML
/
mamba-7b-rw
like
53
Text Generation
PyTorch
Safetensors
tiiuae/falcon-refinedweb
English
openlm
mamba
linear
Eval Results
arxiv:
2312.00752
arxiv:
2405.06640
License:
apache-2.0
Model card
Files
Files and versions
Community
9
2501def
mamba-7b-rw
/
config.json
sedrick-keh-tri
push jsons
2501def
6 months ago
raw
Copy download link
history
blame
80 Bytes
{
"d_model"
:
4096
,
"n_layer"
:
64
,
"vocab_size"
:
50432
,
"seq_len"
:
2048
}