Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
7
1
xz
mxz
Follow
0 followers
·
3 following
AI & ML interests
NLP ML RL
Recent Activity
updated
a model
about 2 months ago
mxz/qwen-R1-3B
updated
a model
about 2 months ago
mxz/qwen-R1-1.5B
updated
a model
about 2 months ago
mxz/qwen-R1-0.5b
View all activity
Organizations
None yet
models
7
Sort: Recently updated
mxz/qwen-R1-3B
Updated
Mar 4
•
1
mxz/qwen-R1-1.5B
Updated
Mar 4
•
1
mxz/qwen-R1-0.5b
Updated
Mar 3
•
1
mxz/llama3-8b-dpo
Text Generation
•
Updated
Jul 28, 2024
•
1
mxz/llama3-8b-ppo
Text Generation
•
Updated
Jul 28, 2024
•
2
mxz/llama3-8b-sft
Text Generation
•
Updated
Jul 28, 2024
mxz/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Jul 17, 2024
datasets
4
Sort: Recently updated
mxz/awesome-dpo
Viewer
•
Updated
Jul 28, 2024
•
302k
•
17
mxz/CValues
Viewer
•
Updated
Jul 26, 2024
•
146k
•
14
mxz/CValues_DPO
Viewer
•
Updated
Jul 26, 2024
•
146k
•
16
mxz/alpaca_en_zh_ruozhiba_gpt4-data
Viewer
•
Updated
Jul 26, 2024
•
190k
•
14