Sohyun An
sohyunan
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
4 days ago
Exploring Expert Failures Improves LLM Agent Tuning
upvoted
a
paper
about 1 month ago
R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model
Organizations
None yet
Collections
1
models
16
sohyunan/DeepSeek-R1-Distill-Qwen-1.5B-sft-lora
Updated
sohyunan/DeepSeek-R1-Distill-Qwen-1.5B-sft-full
Updated
sohyunan/gemma-2-2b-it-maze-sft-sys0.0
Text Generation
•
Updated
•
3
sohyunan/gemma-2-2b-it-maze-sft-ctrl-sys0.5-a_star
Text Generation
•
Updated
•
2
sohyunan/gemma-2-2b-it-maze-sft-sys1.0-a_star
Text Generation
•
Updated
•
3
sohyunan/gemma-2-2b-it_controller_sft_random_grpo
Text Generation
•
Updated
•
4
sohyunan/debug
Text Generation
•
Updated
•
4
sohyunan/gemma-2-2b-it_controller_sft_random_grpo_lora
Updated
sohyunan/gemma-2-2b-it_controller_grpo_lora
Updated
sohyunan/gemma-2-2b-it_controller_sft_random
Text Generation
•
Updated
•
3
datasets
None public yet