arxiv:2604.20244
SII-Wenhong
wh-zhu
AI & ML interests
None yet
Recent Activity
authored a paper 5 days ago
Hybrid Policy Distillation for LLMs updated a model about 1 month ago
wh-zhu/qwen2_7B-ultrachatfeedback-wspo published a model about 1 month ago
wh-zhu/qwen2_7B-ultrachatfeedback-wspo