Edit model card

This is a tiny, dummy version of Jamba, used for debugging and experimentation over the Jamba architecture.

It has 128M parameters (instead of 52B), and is initialized with random weights and did not undergo any training.

Downloads last month
1,957
Safetensors
Model size
128M params
Tensor type
BF16
·