xianfeng
xianf
AI & ML interests
None yet
Recent Activity
new activity
12 days ago
AIDC-AI/Marco-o1:How to reproduce your result?
new activity
27 days ago
O1-OPEN/OpenO1-SFT:What's the difference with and without pro?
new activity
about 2 months ago
jinaai/jina-embeddings-v3:How to finetune this model with transformers?
Organizations
None yet
xianf's activity
How to reproduce your result?
#28 opened 12 days ago
by
xianf
What's the difference with and without pro?
3
#3 opened 27 days ago
by
xianf
How to finetune this model with transformers?
1
#56 opened about 2 months ago
by
xianf
3M 和 7M 的中文数据是相同的吗?
1
#18 opened 4 months ago
by
xianf
lm_eval results is weird
5
#2 opened 7 months ago
by
xianf
使用 lm_eval 测试时报错了
2
#1 opened 7 months ago
by
xianf
Is this actually a 1.5B model?
#6 opened 7 months ago
by
xianf
QWEN-1.8B finetune 之后输出全是重复的 token
5
#1 opened about 1 year ago
by
xianf
Please provide a list of file hashes in order to check integrity of downloads
2
#24 opened 9 months ago
by
markusheimerl
缺少 zh_cc.jsonl.gz.part03 ?
3
#14 opened 9 months ago
by
xianf
Any plans to support the latest similar Chinese datasets?
1
#2 opened 10 months ago
by
xianf
模型和 phi2 的关系是什么?
1
#1 opened 12 months ago
by
xianf
Add korean kenLMs
1
#6 opened over 1 year ago
by
hac541309
How to write the prompt to translate for this model?
1
#1 opened over 1 year ago
by
xianf
The model keeps generating up to the maximum length but no EOS token.
1
#13 opened over 1 year ago
by
xianf
How many memory for GPU are needed?
#12 opened over 1 year ago
by
xianf
Why the vocab_size of tokenizer is different from model?
1
#2 opened almost 2 years ago
by
xianf
Is the missing data repaired?
4
#1 opened almost 2 years ago
by
xianf
Is the missing data repaired?
4
#1 opened almost 2 years ago
by
xianf