Transformers
PyTorch
Inference Endpoints
File size: 601 Bytes
1b1d3e1
 
 
f10a8d5
1
2
3
4
---
license: apache-2.0
---
[BlackMamba](https://www.zyphra.com/blackmamba) is an novel architecture which combines state-space models (SSMs) with mixture of experts (MoE). It uses [Mamba](https://arxiv.org/abs/2312.00752) as its SSM block and [switch transformer](https://arxiv.org/abs/2101.03961) as its MoE block base. BlackMamba is extremely low latency for generation and inference, providing significant speedups over all of classical transformers, MoEs, and Mamba SSM models. Additionally, due to its SSM sequence mixer, BlackMamba retains linear compuational complexity in the sequence length.