arxiv:2410.02725
Anikait Singh
Asap7772
AI & ML interests
Deep Learning, Reinforcement Learning, Robotics
Recent Activity
updated
a dataset
2 days ago
Asap7772/Math-steptok-steps-mcvalue-test-part1-of-5
updated
a dataset
2 days ago
Asap7772/Math-steptok-steps-mcvalue-test-part3-of-5
updated
a dataset
2 days ago
Asap7772/Math-steptok-steps-mcvalue-test-part2-of-5
Organizations
models
8
Asap7772/mathcamp_sft_llama3-1-8b
Text Generation
•
Updated
•
6
Asap7772/anikait-prm_lr1e-5-datamath-mc-seed15486-exp0_epoch0_checkpoint1
Updated
Asap7772/anikait-prm_lr1e-5-datamath-mc-seed31426-exp0_epoch0_checkpoint2
Updated
Asap7772/anikait-prm_lr1e-5-datamath-mc-seed31426-exp0_epoch0_checkpoint1
Text Generation
•
Updated
•
11
Asap7772/anikait-prm_lr1e-5-datamath-mc-seed26382-exp0_epoch0_checkpoint2
Updated
Asap7772/anikait-prm_lr1e-5-datamath-mc-seed26382-exp0_epoch0_checkpoint1
Text Generation
•
Updated
•
13
Asap7772/elix-llama32-3b-ipo
Updated
Asap7772/sft-prm800k-llama31-8b-steptok
Text Generation
•
Updated
•
2.19k
datasets
466
Asap7772/Math-steptok-steps-mcvalue-test-part1-of-5
Viewer
•
Updated
•
22.1k
•
4
Asap7772/Math-steptok-steps-mcvalue-test-part3-of-5
Viewer
•
Updated
•
22.1k
•
5
Asap7772/Math-steptok-steps-mcvalue-test-part2-of-5
Viewer
•
Updated
•
22.1k
•
5
Asap7772/Math-steptok-steps-mcvalue-test-part5-of-5
Viewer
•
Updated
•
22.1k
•
4
Asap7772/Math-steptok-steps-mcvalue-test-part4-of-5
Viewer
•
Updated
•
22.1k
•
4
Asap7772/Math-steptok-steps-mcvalue-part4-of-5
Viewer
•
Updated
•
521k
•
8
Asap7772/Math-steptok-steps-mcvalue-part1-of-5
Viewer
•
Updated
•
521k
•
6
Asap7772/Math-steptok-steps-mcvalue-part2-of-5
Viewer
•
Updated
•
521k
•
9
Asap7772/Math-steptok-steps-mcvalue-part3-of-5
Viewer
•
Updated
•
521k
•
7
Asap7772/Math-steptok-steps-mcvalue-part5-of-5
Viewer
•
Updated
•
521k
•
8