Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Nan Jiang
nanjiang
Follow
AI & ML interests
None yet
Recent Activity
authored
a paper
about 8 hours ago
Self-rewarding correction for mathematical reasoning
authored
a paper
10 months ago
RLHF Workflow: From Reward Modeling to Online RLHF
authored
a paper
over 1 year ago
Reinforcement Learning in Low-Rank MDPs with Density Features
View all activity
Organizations
None yet
Papers
5
arxiv:
2502.19613
arxiv:
2405.07863
arxiv:
2401.09681
arxiv:
2302.02571
Expand 5 papers
models
None public yet
datasets
None public yet