GRPO RL model
SunJack
SunJack
·
AI & ML interests
None yet
Recent Activity
updated
a collection
7 days ago
GRPO
updated
a model
7 days ago
SunJack/Qwen2.5-3B-R1-GGUF
updated
a model
7 days ago
SunJack/Qwen2.5-3B-R1
Organizations
Collections
1
models
14

SunJack/Qwen2.5-3B-R1-GGUF
Updated
•
96

SunJack/Qwen2.5-3B-R1
Updated
•
53

SunJack/Phi-4-R1
Updated

SunJack/Phi-4-R1-GGUF
Updated

SunJack/Qwen2.5-7b-sft
Updated
•
15

SunJack/phi4-o1
Updated
•
238

SunJack/Qwen2.5-3B-GRPO_lora
Updated

SunJack/qwen2.5-7b-o1
Updated
•
78
•
1

SunJack/qwen2.5-7b-cve
Updated
•
63
•
1

SunJack/qwen2-7b-ruozhiba-finetuning
Updated
•
90
•
2