New discussion

About Quantized Models

#14 opened 4 months ago by infgrad

Multilingual or Bilingual

#25 opened about 1 month ago by MeanBean-05

Remote Code execution risk

4
#24 opened about 2 months ago by srivishnuceg

flash attention

#21 opened 3 months ago by Disassemblern

Model loading size on GPU

#20 opened 4 months ago by divrajnd

MRL and linear layers

1
#19 opened 4 months ago by bobox

Can it output sparse vector?

1
#18 opened 4 months ago by kk3dmax

Does this model only work on GPU?

1
#16 opened 4 months ago by xPurity

Any multi-lingual variant

1
#10 opened 4 months ago by prophet123

Parameters for peak performances

3
#8 opened 4 months ago by cvdbdo

Model max_seq_length

6
#6 opened 4 months ago by shuyuej

Fix prompt_name typo

1
#4 opened 4 months ago by mber

Upload ONNX weights

2
#3 opened 4 months ago by Xenova