PRefLexOR Collection PRefLexOR: Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning and Agentic Thinking • 8 items • Updated 13 days ago • 3