Bolian Li
lblaoke
AI & ML interests
None yet
Recent Activity
updated
a model
2 days ago
lblaoke/qwama-0.5b-hh-rlhf-dpo-trl-v4
updated
a model
2 days ago
lblaoke/qwama-0.5b-hh-rlhf-sft-chosen-trl-v4
updated
a collection
2 days ago
Draft Models
Organizations
None yet
Collections
3
models
39
lblaoke/qwama-0.5b-hh-rlhf-dpo-trl-v4
Updated
•
12
lblaoke/qwama-0.5b-hh-rlhf-sft-chosen-trl-v4
Updated
•
34
lblaoke/qwama-0.5b-skywork-pref-sft-chosen-dpo-trl-v3
Updated
•
7
lblaoke/qwama-0.5b-skywork-pref-sft-rejected-chosen-trl-v3
Updated
•
9
lblaoke/qwama-0.5b-skywork-pref-sft-chosen-trl-v3
Updated
•
6
lblaoke/qwama-0.5b-skywork-pref-sft-rejected-trl-v3
Updated
•
6
lblaoke/qwama-0.5b-skywork-pref-dpo-trl-v2
Updated
•
32
lblaoke/qwama-0.5b-skywork-pref-dpo-llama-factory-v1
Updated
•
4
lblaoke/qwama-0.5b-skywork-pref-dpo-trl-v1
Updated
•
11
lblaoke/mistral-v0.3-7b-ppo-self-human
Updated
•
2
datasets
None public yet