PRefLexOR Collection PRefLexOR: Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning and Agentic Thinking • 6 items • Updated Jan 22 • 3