New discussion

Is there any SFT or Chat model?

2
#41 opened about 1 month ago by chuyi777

How to use accelerate evaluate Jamba

#40 opened about 1 month ago by Xidong

Jamba Evaluation Task on GSM8K

#39 opened about 1 month ago by ssparks

Fast Mamba

4
#34 opened about 2 months ago by Praneethkeerthi

Request: DOI

#32 opened about 2 months ago by kozolex

GGUF quants?

1
#31 opened about 2 months ago by 6346y9uey

Complex vs Real parametrization.

#27 opened 2 months ago by Yutida

Layer-Selective Rank Reduction

#25 opened 2 months ago by mizinovmv

Update README.md

#23 opened 2 months ago by rombodawg

How many pretraining tokens?

#13 opened 2 months ago by CyberNative

Jambaleo

#10 opened 2 months ago by pszemraj

A Bang Up Job

2
#4 opened 2 months ago by nightvision04

multiple gpu?

3
#3 opened 2 months ago by bdambrosio