22 1

fireballoon

AI & ML interests

None yet

Recent Activity

published a model about 1 month ago

fireballoon/Qwen2.5-1.5B-Open-R1-GRPO

new activity 4 months ago

fireballoon/baichuan-llama-7b:Adding `safetensors` variant of this model

upvoted a collection 12 months ago

Awesome SFT datasets

View all activity

Organizations

None yet

fireballoon's activity

New activity in fireballoon/baichuan-llama-7b 4 months ago

Adding `safetensors` variant of this model

#5 opened 5 months ago by

SFconvertbot

New activity in fireballoon/baichuan-vicuna-chinese-7b over 1 year ago

loss震荡幅度比较大是正常的嘛，loss是在3个epoch的哪个时候开始下降并保持稳定的呢

#13 opened over 1 year ago by

Aibet

请问deepspeed zero3的参数是怎么配置的

#14 opened over 1 year ago by

Aibet

New activity in fireballoon/baichuan-vicuna-7b over 1 year ago

可以提供一下leetcode的能跑通的数据或者处理code 嘛?谢谢

#8 opened over 1 year ago by

Aibet

可以提供一下训练代码吗？

#5 opened almost 2 years ago by

puppet1988

请问有跑分的代码吗

#7 opened over 1 year ago by

endNone

New activity in fireballoon/baichuan-vicuna-chinese-7b over 1 year ago

how to fix the "ValueError: Tokenizer class LlamaTokenizer does not exist or is not currently imported." while my tokenizer_config.json file is "tokenizer_class": "LlamaTokenizer", already .

#6 opened over 1 year ago by

lishuangxiu-nuannuan

训练的时候loss为0

#5 opened almost 2 years ago by

deerluffy

不支持Vicuna-v1.3？

#4 opened almost 2 years ago by

acupofespresso

The problem of pad_token

#10 opened over 1 year ago by

kang1

New activity in fireballoon/baichuan-llama-7b over 1 year ago

About baichuan-13b model conversion

#4 opened over 1 year ago by

greatzane

New activity in fireballoon/baichuan-vicuna-chinese-7b over 1 year ago

继续微调的问题

#9 opened over 1 year ago by

yuqin

New activity in fireballoon/baichuan-llama-7b over 1 year ago

Baichuan-13B please!

#3 opened over 1 year ago by

greatzane

New activity in fireballoon/baichuan-vicuna-chinese-7b over 1 year ago

模型效果超出预期，很棒！！

#8 opened over 1 year ago by

oscar325

请问这个sft用到了哪些数据，总共是多少量级？

#7 opened over 1 year ago by

Kuaixueshiqing

New activity in fireballoon/baichuan-llama-7b over 1 year ago

请问验证过转完的llama格式权重能够用于sft吗

#1 opened almost 2 years ago by

nnz

New activity in fireballoon/baichuan-vicuna-7b almost 2 years ago

关于模型中文语言表现

#6 opened almost 2 years ago by

reedhs

有办法量化吗

#1 opened almost 2 years ago by

zxgov

New activity in fireballoon/baichuan-vicuna-chinese-7b almost 2 years ago

容易用英文回答，即使明确提示使用中文

#2 opened almost 2 years ago by

huashiyiqike

New activity in fireballoon/baichuan-vicuna-7b almost 2 years ago

fast tokenizer问题

#3 opened almost 2 years ago by

JaheimLee