Reasoning Hijacking: Subverting LLM Classification via Decision-Criteria Injection Paper • 2601.10294 • Published 4 days ago
Human-MME: A Holistic Evaluation Benchmark for Human-Centric Multimodal Large Language Models Paper • 2509.26165 • Published Sep 30, 2025