Zikang Shan
zkshan2002
·
AI & ML interests
Reinforcement Learning
Recent Activity
updated
a dataset
1 day ago
zkshan2002/numia10k_gen0-r1d7b
published
a dataset
1 day ago
zkshan2002/numia10k_gen0-r1d7b
updated
a dataset
1 day ago
zkshan2002/numia10k_gen0.75-r1d7b
Organizations
Collections
2
models
None public yet
datasets
18
zkshan2002/numia10k_gen0-r1d7b
Viewer
•
Updated
•
10.2k
•
5
zkshan2002/numia10k_gen0.75-r1d7b
Viewer
•
Updated
•
10.2k
•
6
zkshan2002/numia10k_sft-32b
Viewer
•
Updated
•
2.13k
•
17
zkshan2002/numia10k_sft-r1d32b
Viewer
•
Updated
•
6.1k
•
23
zkshan2002/numia10k_gen-32b
Viewer
•
Updated
•
10.2k
•
22
zkshan2002/numia10k_gen-r1d32b
Viewer
•
Updated
•
10.2k
•
38
zkshan2002/numia_math_train-10k
Viewer
•
Updated
•
10.2k
•
35
zkshan2002/gpqa_diamond
Viewer
•
Updated
•
198
•
14
zkshan2002/olympiad_bench
Viewer
•
Updated
•
675
•
13
zkshan2002/minerva_math
Viewer
•
Updated
•
272
•
13