mamba-chat / README.md
justus27's picture
Update README.md
4f68cb1
|
raw
history blame
449 Bytes
---
license: apache-2.0
---
Mamba-Chat is the first chat language model based on a state-space model architecture, not a transformer.
The model is a fine-tune of Albert Gu's and Tri Dao's model [Mamba-2.8B](https://github.com/state-spaces/mamba) from their paper *Mamba: Linear-Time Sequence Modeling with Selective State Spaces*.
Check our our [Github repository](https://github.com/havenhq/mamba-chat/tree/main) for training and inference code.