Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Deepak Kumar's picture

2 1

Deepak Kumar

ddeepakkumar

·

AI & ML interests

None yet

Organizations

None yet

Collections 1

Secrets of RLHF in Large Language Models Part II: Reward Modeling

Paper • 2401.06080 • Published Jan 11, 2024 • 28

models

None public yet

datasets

None public yet

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs