Exploring the Vulnerabilities of Federated Learning: A Deep Dive into Gradient Inversion Attacks Paper • 2503.11514 • Published Mar 13, 2025 • 18
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF Paper • 2410.04612 • Published Oct 6, 2024
REBEL: Reinforcement Learning via Regressing Relative Rewards Paper • 2404.16767 • Published Apr 25, 2024 • 2
Provable Reward-Agnostic Preference-Based Reinforcement Learning Paper • 2305.18505 • Published May 29, 2023
OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset Paper • 2402.10176 • Published Feb 15, 2024 • 38
Efficient and Interpretable Neural Models for Entity Tracking Paper • 2208.14252 • Published Aug 30, 2022