SDPO: Segment-Level Direct Preference Optimization for Social Agents Paper • 2501.01821 • Published 9 days ago • 18 • 2