Kawamura Masaki
KMasaki
AI & ML interests
None yet
Recent Activity
updated
a model
39 minutes ago
KMasaki/Qwen2.5-1.5B-Open-R1-Distill
updated
a model
about 5 hours ago
KMasaki/llm-jp-3-3.7b-Open-R1-Distill
published
a model
14 days ago
KMasaki/llm-jp-3-3.7b-Open-R1-GRPO
Organizations
Collections
3
-
KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp8-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000058
Updated • 9 -
KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp6-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000058
Updated • 5 -
KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp1-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000058
Updated • 5 -
KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp3-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000065
Updated • 6
models
18
KMasaki/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
•
34
KMasaki/llm-jp-3-3.7b-Open-R1-Distill
Text Generation
•
Updated
•
125
KMasaki/llm-jp-3-3.7b-Open-R1-GRPO
Updated
KMasaki/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
•
15
KMasaki/Llama-3.1-8B-Instruct-safety-exp1-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000390
Updated
•
5
KMasaki/Llama-3.1-8B-Instruct-safety-exp2-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000387
Updated
•
7
KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp7-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000123
Updated
•
6
KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp8-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000058
Updated
•
9
KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp6-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000058
Updated
•
5
KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp3-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000065
Updated
•
6
datasets
None public yet