mamba-chat / README.md
justus27's picture
Update README.md
4f68cb1
|
raw
history blame
449 Bytes
metadata
license: apache-2.0

Mamba-Chat is the first chat language model based on a state-space model architecture, not a transformer.

The model is a fine-tune of Albert Gu's and Tri Dao's model Mamba-2.8B from their paper Mamba: Linear-Time Sequence Modeling with Selective State Spaces.

Check our our Github repository for training and inference code.