Yantao's picture

3 7 2

Yantao

RicardoL1u

·

AI & ML interests

NLP

Recent Activity

published a model 3 days ago

THU-KEG/PairJudge-RM

updated a model 3 days ago

THU-KEG/PairJudge-RM

published a dataset 4 days ago

THU-KEG/PairJudge-432K

View all activity

Organizations

RicardoL1u's activity

commented 2 papers 11 days ago

Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament

Paper • 2501.13007 • Published 12 days ago • 18 •

Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament

Paper • 2501.13007 • Published 12 days ago • 18 •

New activity in THU-KEG/RM-Bench 3 months ago

Add link to paper

#2 opened 3 months ago by

commented a paper 3 months ago

RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style

Paper • 2410.16184 • Published Oct 21, 2024 • 24 •