New discussion

From Megratron GPT-2 or GPT-3?

#75 opened about 1 hour ago by jmassot

Download and run the model

#74 opened about 3 hours ago by HAAR

Does BLOOM work as designed?

3
#72 opened 5 days ago by wangeksy

Inference on TPU-v3-32

2
#69 opened 12 days ago by ybelkada

Inference on TPU-v3-32

5
#68 opened 12 days ago by zhiG

<s> token

1
#65 opened 13 days ago by Muennighoff

Add Lingala

1
#60 opened 19 days ago by Muennighoff

Update README.md

#57 opened 21 days ago by ybelkada

Finetuning BLOOM 175-B

6
#54 opened 23 days ago by mayank31398

Fix the tokenizer class

1
#53 opened 23 days ago by sgugger

Update README.md

7
#51 opened 23 days ago by stellaathena

Token or Sentence Embeddings

11
#49 opened 24 days ago by rufimelo

Fine-tune the model?

9
#46 opened 26 days ago by NXBY

Update README.md

#44 opened 28 days ago by Muennighoff

Update README.md

#43 opened 28 days ago by Muennighoff

Fix typo

#42 opened 28 days ago by Muennighoff

Fix typo

#41 opened 28 days ago by Muennighoff

Fix typo

#40 opened 28 days ago by Muennighoff

re-add `code` as a language

#38 opened 28 days ago by cakiki

more modifs on the model card

#37 opened 29 days ago by ybelkada

update training information

#36 opened 29 days ago by ybelkada

Remove dtype

1
#35 opened 29 days ago by Muennighoff

Loading partial model

7
#34 opened 29 days ago by maveriq

update programming languages

#33 opened 29 days ago by ybelkada

Revert language error

#32 opened 29 days ago by ybelkada

Remove dup space

#31 opened 29 days ago by Muennighoff

Add programming languages

3
#29 opened 29 days ago by cakiki

Add evaluation

6
#27 opened 29 days ago by Muennighoff

Add link to interactive corpus treemap

1
#26 opened about 1 month ago by yjernite