Mt0 or Bloomz model: results are always same short length?
6
#56 opened 3 months ago
by
dsbyprateekg
ValueError: The state dictionary of the model you are trying to load is corrupted.
4
#55 opened 3 months ago
by
dsbyprateekg
BLOOMZ answering in a different language
1
#54 opened 3 months ago
by
reemmasoud
Why does the token vocabs are unreadable code?
#53 opened 5 months ago
by
ShaneSue
Performance Evaluation
1
#52 opened 5 months ago
by
fatimakqq
base_model_prefix = "transformer"
1
#51 opened 7 months ago
by
Cyrile
Repetitive Output
1
#50 opened 7 months ago
by
fatimakqq
How to get longer outputs?
1
#49 opened 8 months ago
by
Apps
Default padding_side
3
#48 opened 8 months ago
by
Cyrile
Are there any special tokens formatted as '<PERSON>', '<LOC>' in the training set or fine-tuning set?
2
#47 opened 9 months ago
by
tingxinli
Is the CausalML model from HuggingFace truly causal?
1
#46 opened 9 months ago
by
Cyrile
bfloat 16 vs float 16
8
#45 opened 9 months ago
by
Arkea
Commercial usage
4
#44 opened 10 months ago
by
tingxinli
disable inference API
5
#43 opened 11 months ago
by
olivierdehaene
Fine-tuning with LoRA using a dataset with different classification
#42 opened 12 months ago
by
karim1104
Removing Stop Token from Decode/Generate
4
#41 opened about 1 year ago
by
JHenzi
Bloomz-3b Refuses to Summarize Text
10
#40 opened about 1 year ago
by
Kato-22
What is the maximum input length for Bloomz and MT0?
1
#39 opened about 1 year ago
by
Charm3link
Needed RAM for the full bloomz model?
3
#38 opened about 1 year ago
by
luxianos
activate inference widget
1
#37 opened about 1 year ago
by
olivierdehaene
where can we get a bloomz-7b1 finetuned checkpoint
2
#36 opened about 1 year ago
by
yuanzhang0968
Worse performance in Text Generation on Chinese corpus
3
#35 opened about 1 year ago
by
WQW
Fix architecture
#34 opened about 1 year ago
by
lewtun
Fix architecture class
#33 opened about 1 year ago
by
lewtun
Not able to deploy on AWS Sagemaker
1
#32 opened about 1 year ago
by
BahauddinAziz
Is there any company using BLOOMZ as the basis or their service?
6
#31 opened about 1 year ago
by
Muhammadreza
Difference between MT0 and BLOOMZ
1
#30 opened over 1 year ago
by
yahma
Will there be a distilled model that fits inside 48GB VRAM (2x 3090)?
1
#29 opened over 1 year ago
by
gameveloster
A way to inference and fine-tune BLOOMZ-176B from Google Colab or locally
2
#28 opened over 1 year ago
by
borzunov
Unable to edit max_length
2
#27 opened over 1 year ago
by
amuhak
How big is bloomz?
2
#26 opened over 1 year ago
by
hiddenchamp
Can't get it to work
1
#25 opened over 1 year ago
by
hiddenchamp
Train on new un-supported language, i.e. German?
4
#24 opened over 1 year ago
by
kmdanikowski
error on the inference widget
4
#23 opened over 1 year ago
by
clem
generating lists
7
#22 opened over 1 year ago
by
i-am-neo
No longer available, why?
11
#21 opened over 1 year ago
by
micole66
Weird behavior of BLOOMZ-7b1
8
#20 opened over 1 year ago
by
pohunghuang
Zero shot comparison with Instruct-GPT-3 ?
2
#19 opened over 1 year ago
by
nishanthcmesh
Questions on safetensors and text-generation-inference server
17
#18 opened over 1 year ago
by
pai4451
Is there a code generation demo?
4
#17 opened over 1 year ago
by
hankcs
list of commands
1
#15 opened over 1 year ago
by
mishavee
running and fine tuning
10
#14 opened over 1 year ago
by
mishavee
Update README.md
#13 opened over 1 year ago
by
ybelkada
Update README.md
#12 opened over 1 year ago
by
ybelkada
Update README.md
#11 opened over 1 year ago
by
ybelkada
Update README.md
#10 opened over 1 year ago
by
ybelkada
Update README.md
#9 opened over 1 year ago
by
ybelkada
Nice prompt
#8 opened over 1 year ago
by
ybelkada
Add new example
#7 opened over 1 year ago
by
ybelkada