Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation Paper • 2511.14993 • Published 18 days ago • 222
AraLingBench A Human-Annotated Benchmark for Evaluating Arabic Linguistic Capabilities of Large Language Models Paper • 2511.14295 • Published 19 days ago • 71
Part-X-MLLM: Part-aware 3D Multimodal Large Language Model Paper • 2511.13647 • Published 19 days ago • 70
Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance Paper • 2511.13254 • Published 20 days ago • 134
P1: Mastering Physics Olympiads with Reinforcement Learning Paper • 2511.13612 • Published 19 days ago • 132
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling Paper • 2511.11793 • Published 22 days ago • 158
LLM-Powered Fully Automated Chaos Engineering: Towards Enabling Anyone to Build Resilient Software Systems at Low Cost Paper • 2511.07865 • Published 26 days ago • 3
TopoPerception: A Shortcut-Free Evaluation of Global Visual Perception in Large Vision-Language Models Paper • 2511.11831 • Published 22 days ago • 1
A Brain Wave Encodes a Thousand Tokens: Modeling Inter-Cortical Neural Interactions for Effective EEG-based Emotion Recognition Paper • 2511.13954 • Published 19 days ago • 3
Error-Driven Scene Editing for 3D Grounding in Large Language Models Paper • 2511.14086 • Published 19 days ago • 4
Proactive Hearing Assistants that Isolate Egocentric Conversations Paper • 2511.11473 • Published 22 days ago • 6
Agent READMEs: An Empirical Study of Context Files for Agentic Coding Paper • 2511.12884 • Published 20 days ago • 5
Orion: A Unified Visual Agent for Multimodal Perception, Advanced Visual Reasoning and Execution Paper • 2511.14210 • Published 19 days ago • 19
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning Paper • 2511.14460 • Published 19 days ago • 17
Large Language Models Meet Extreme Multi-label Classification: Scaling and Multi-modal Framework Paper • 2511.13189 • Published 20 days ago • 38
ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning Paper • 2511.14366 • Published 19 days ago • 15