What are human values, and how do we align AI to them? Paper • 2404.10636 • Published Mar 27, 2024 • 1
Self-Play Preference Optimization for Language Model Alignment Paper • 2405.00675 • Published May 1, 2024 • 27
What are human values, and how do we align AI to them? Paper • 2404.10636 • Published Mar 27, 2024 • 1