New discussion

Is there any SFT or Chat model?

2
#41 opened 7 months ago by chuyi777

Jamba Evaluation Task on GSM8K

#39 opened 7 months ago by ssparks

Fast Mamba

5
#34 opened 8 months ago by Praneethkeerthi

Request: DOI

#32 opened 8 months ago by kozolex

GGUF quants?

1
#31 opened 8 months ago by 6346y9uey

Complex vs Real parametrization.

#27 opened 8 months ago by Yutida

Layer-Selective Rank Reduction

#25 opened 8 months ago by mizinovmv

Update README.md

#23 opened 8 months ago by rombodawg

How many pretraining tokens?

#13 opened 8 months ago by CyberNative

Jambaleo

#10 opened 8 months ago by pszemraj

A Bang Up Job

2
#4 opened 8 months ago by nightvision04

multiple gpu?

3
#3 opened 8 months ago by bdambrosio