arxiv:2501.04682
Anikait Singh
Asap7772
AI & ML interests
Deep Learning, Reinforcement Learning, Robotics
Recent Activity
updated
a dataset
about 5 hours ago
Asap7772/math_eval_llama3b_base
updated
a dataset
about 5 hours ago
Asap7772/math_eval_llama8b_base
updated
a dataset
about 5 hours ago
Asap7772/math_eval_gemma9b_base
Organizations
models
18
Asap7772/prm_datamath-mc-full_objbce_lr5e-06_epoch0
Text Generation
•
Updated
•
9
Asap7772/prm_datamath-mc-full_objbce_lr1e-07_epoch0
Text Generation
•
Updated
•
1
Asap7772/prm_datamath-mc-full_objbce_lr1e-06_epoch0
Text Generation
•
Updated
•
6
Asap7772/prm_datamath-mc-full_objbce_lr5e-05_epoch0
Text Generation
•
Updated
•
8
Asap7772/prm_datamath-mc-full_objbce_lr1e-05_epoch0
Text Generation
•
Updated
•
8
Asap7772/prm_datamath-mc-full_objbce_lr5e-07_epoch0
Text Generation
•
Updated
•
1
Asap7772/prm_datamath-mc-full_objbce_lr0.0005_epoch0
Text Generation
•
Updated
•
4
Asap7772/prm_datamath-mc-full_objbce_lr5e-06_checkpoint2400
Updated
Asap7772/prm_datamath-mc-full_objbce_lr5e-05_checkpoint2400
Updated
Asap7772/prm_datamath-mc-full_objbce_lr1e-05_checkpoint2400
Updated
datasets
777
Asap7772/math_eval_llama3b_base
Updated
Asap7772/math_eval_llama8b_base
Updated
Asap7772/math_eval_gemma9b_base
Updated
Asap7772/math_eval_deepseekmath7b_base
Updated
Asap7772/math_eval_qwenmath7b_base
Updated
Asap7772/math_eval_sftboth_1e-7_evaluated
Viewer
•
Updated
•
500
•
3
Asap7772/math_eval_sftboth_1e-6_evaluated
Viewer
•
Updated
•
500
•
3
Asap7772/math_eval_sftboth_1e-5_evaluated
Viewer
•
Updated
•
500
•
3
Asap7772/math_eval_ipoimp_0.05_evaluated
Viewer
•
Updated
•
500
•
5
Asap7772/math_eval_dpoimp_0.1_evaluated
Viewer
•
Updated
•
500
•
5