Wei Liu
PeterV09
AI & ML interests
Machine Learning, Natural Language Processing
Recent Activity
commented
a paper
1 minute ago
Diving into Self-Evolving Training for Multimodal Reasoning
upvoted
a
paper
6 minutes ago
B-STaR: Monitoring and Balancing Exploration and Exploitation in
Self-Taught Reasoners
Organizations
Collections
2
Papers
2
models
18
PeterV09/llava-1.6-alignmentv2
Text Generation
•
Updated
•
5
PeterV09/llava-1.6-beta-26
Updated
PeterV09/llava-1.6-asft
Updated
PeterV09/llava-1.6-4sftmse
Updated
•
8
PeterV09/llava-1.6-3sft0.5
Updated
•
4
PeterV09/llava-1.6-2sft
Updated
•
5
PeterV09/llava-1.6-sft
Text Generation
•
Updated
•
9
PeterV09/mistral-7b-300k-6k-a100-6e-valid-hkust_2-l4k
Text Generation
•
Updated
•
10
PeterV09/deita-6k-sft-fordpo
Text Generation
•
Updated
•
15
PeterV09/mistral-7b-300k-6k-a100-6e-valid-7
Text Generation
•
Updated
•
9
datasets
None public yet