New discussion

Is there any SFT or Chat model?

2
#41 opened 11 days ago by chuyi777

Jamba Evaluation Task on GSM8K

#39 opened 20 days ago by ssparks

Fast Mamba

4
#34 opened about 1 month ago by Praneethkeerthi

Request: DOI

#32 opened about 1 month ago by kozolex

GGUF quants?

1
#31 opened about 1 month ago by 6346y9uey

Why is there an MLP in the Mamba Layer?

#28 opened about 1 month ago by naston

Complex vs Real parametrization.

#27 opened about 1 month ago by Yutida

How to Fine-tune Jamba on google Colab?

7
#26 opened about 1 month ago by Ateeqq

Layer-Selective Rank Reduction

#25 opened about 1 month ago by mizinovmv

Update README.md

#23 opened about 1 month ago by rombodawg

How many pretraining tokens?

#13 opened about 1 month ago by CyberNative

Coding performance of base model?

4
#11 opened about 1 month ago by rombodawg

Jambaleo

#10 opened about 1 month ago by pszemraj

A Bang Up Job

2
#4 opened about 1 month ago by nightvision04

multiple gpu?

3
#3 opened about 1 month ago by bdambrosio