Update README.md
3
#76 opened almost 2 years ago
by
ybelkada
From Megratron GPT-2 or GPT-3?
2
#75 opened almost 2 years ago
by
jmassot
Download and run the model
4
#74 opened almost 2 years ago
by
HAAR
Name changes for smaller BLOOM models
1
#73 opened almost 2 years ago
by
eliolio
Does BLOOM work as designed?
5
#72 opened almost 2 years ago
by
wangekxy
Is BLOOM's logo open source? Can I use it on a blog post?
3
#71 opened almost 2 years ago
by
arteagac
"Model is overloaded, please wait for a bit"
15
#70 opened almost 2 years ago
by
jmarxza
Inference on TPU-v3-32
2
#69 opened almost 2 years ago
by
ybelkada
Inference on TPU-v3-32
5
#68 opened almost 2 years ago
by
zhiG
Widget multiline examples correct behaviour
2
#67 opened almost 2 years ago
by
mishig
Why are the training carbon emissions still not available ?
15
#66 opened almost 2 years ago
by
Charlolegossbo
<s> token
1
#65 opened almost 2 years ago
by
Muennighoff
Why are there some "dense_h_to_4h" and "dense_4h_to_h" layers without any activation layers in between ?
4
#64 opened almost 2 years ago
by
Tombriss
Bloom-176B on Google Colab
5
#63 opened almost 2 years ago
by
thisisanshgupta
How to use bloom-176B to generate or evaluate on Multi-graphics?
2
#62 opened almost 2 years ago
by
xuyifan
Why can't the model be run (really slowly) on consumer hardware?
7
#61 opened almost 2 years ago
by
TonoTheHero
Add Lingala
1
#60 opened almost 2 years ago
by
Muennighoff
Inference on BLOOM 165B is too slow
15
#59 opened almost 2 years ago
by
mayank-mishra
Hardware Requirements for CPU / GPU Inference
6
#58 opened almost 2 years ago
by
jurassicpark
Update README.md
#57 opened almost 2 years ago
by
ybelkada
Why was no slavic language included in the training dataset?
2
#56 opened almost 2 years ago
by
brabecjan91
Facing using while doing token classification using BLOOM LLM Model
1
#55 opened almost 2 years ago
by
SourabhSahu
Finetuning BLOOM 175-B
6
#54 opened almost 2 years ago
by
mayank-mishra
Fix the tokenizer class
1
#53 opened almost 2 years ago
by
sgugger
Invalid request from sample code
2
#52 opened almost 2 years ago
by
vickyzhang
Update README.md
7
#51 opened almost 2 years ago
by
stellaathena
How much disk does each of the bloom models require?
3
#50 opened almost 2 years ago
by
dgaff
Token or Sentence Embeddings
11
#49 opened almost 2 years ago
by
rufimelo
What is seed parameter in the hosted API of bloom
2
#48 opened almost 2 years ago
by
chirag11
BLOOM training languages inconsistencies
3
#47 opened almost 2 years ago
by
Muennighoff
Fine-tune the model?
82
#46 opened almost 2 years ago
by
NXBY
Hosting bloom 176B model for inference
17
#45 opened almost 2 years ago
by
NonStatic
Update README.md
#44 opened almost 2 years ago
by
Muennighoff
Update README.md
#43 opened almost 2 years ago
by
Muennighoff
Fix typo
#42 opened almost 2 years ago
by
Muennighoff
Fix typo
#41 opened almost 2 years ago
by
Muennighoff
Fix typo
#40 opened almost 2 years ago
by
Muennighoff
Unable to load model using `accelerate`
2
#39 opened almost 2 years ago
by
ricwo
re-add `code` as a language
#38 opened almost 2 years ago
by
christopher
more modifs on the model card
#37 opened almost 2 years ago
by
ybelkada
update training information
#36 opened almost 2 years ago
by
ybelkada
Remove dtype
1
#35 opened almost 2 years ago
by
Muennighoff
Loading partial model
8
#34 opened almost 2 years ago
by
maveriq
update programming languages
#33 opened almost 2 years ago
by
ybelkada
Revert language error
#32 opened almost 2 years ago
by
ybelkada
Remove dup space
#31 opened almost 2 years ago
by
Muennighoff
Dataset Pie Chart not readable in dark mode
#30 opened almost 2 years ago
by
Muennighoff
Add programming languages
3
#29 opened almost 2 years ago
by
christopher
Add arrows for code evaluation
1
#28 opened almost 2 years ago
by
Muennighoff
Add evaluation
6
#27 opened almost 2 years ago
by
Muennighoff