QuantLRM: Quantization of Large Reasoning Models via Fine-Tuning Signals Paper • 2602.02581 • Published 21 days ago • 9
Training Step-Level Reasoning Verifiers with Formal Verification Tools Paper • 2505.15960 • Published May 21, 2025 • 7
When Reasoning Meets Compression: Benchmarking Compressed Large Reasoning Models on Complex Reasoning Tasks Paper • 2504.02010 • Published Apr 2, 2025
HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding? Paper • 2504.18406 • Published Apr 25, 2025 • 3
GREATERPROMPT: A Unified, Customizable, and High-Performing Open-Source Toolkit for Prompt Optimization Paper • 2504.03975 • Published Apr 4, 2025
Chain of Agents: Large Language Models Collaborating on Long-Context Tasks Paper • 2406.02818 • Published Jun 4, 2024 • 2
FaMeSumm: Investigating and Improving Faithfulness of Medical Summarization Paper • 2311.02271 • Published Nov 3, 2023
GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers Paper • 2412.09722 • Published Dec 12, 2024 • 5
Verbosity $\neq$ Veracity: Demystify Verbosity Compensation Behavior of Large Language Models Paper • 2411.07858 • Published Nov 12, 2024 • 2
AAAR-1.0: Assessing AI's Potential to Assist Research Paper • 2410.22394 • Published Oct 29, 2024 • 16
QuantLRM: Quantization of Large Reasoning Models via Fine-Tuning Signals Paper • 2602.02581 • Published 21 days ago • 9
MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents Paper • 2503.01935 • Published Mar 3, 2025 • 30
HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding? Paper • 2504.18406 • Published Apr 25, 2025 • 3
Training Step-Level Reasoning Verifiers with Formal Verification Tools Paper • 2505.15960 • Published May 21, 2025 • 7
VisOnlyQA Collection Dataset for evaluating the visual perception capabilities of LVLMs. • 13 items • Updated Jul 13, 2025 • 4
Verbosity neq Veracity: Demystify Verbosity Compensation Behavior of Large Language Models Paper • 2411.07858 • Published Nov 12, 2024 • 2
VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception of Geometric Information Paper • 2412.00947 • Published Dec 1, 2024 • 8
VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception of Geometric Information Paper • 2412.00947 • Published Dec 1, 2024 • 8