SDPO: Segment-Level Direct Preference Optimization for Social Agents Paper • 2501.01821 • Published 9 days ago • 18
SDPO: Segment-Level Direct Preference Optimization for Social Agents Paper • 2501.01821 • Published 9 days ago • 18 • 2
SDPO: Segment-Level Direct Preference Optimization for Social Agents Paper • 2501.01821 • Published 9 days ago • 18
Self-Prompt Tuning: Enable Autonomous Role-Playing in LLMs Paper • 2407.08995 • Published Jul 12, 2024