Entropy Sentinel: Continuous LLM Accuracy Monitoring from Decoding Entropy Traces in STEM Paper ⢠2601.09001 ⢠Published 10 days ago ⢠17
When Personalization Misleads: Understanding and Mitigating Hallucinations in Personalized LLMs Paper ⢠2601.11000 ⢠Published 8 days ago ⢠26
Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge Paper ⢠2601.08808 ⢠Published 10 days ago ⢠36
Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model Paper ⢠2601.15892 ⢠Published 1 day ago ⢠40
RubricHub: A Comprehensive and Highly Discriminative Rubric Dataset via Automated Coarse-to-Fine Generation Paper ⢠2601.08430 ⢠Published 11 days ago ⢠54
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning Paper ⢠2601.09088 ⢠Published 10 days ago ⢠57 ⢠6
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning Paper ⢠2601.09088 ⢠Published 10 days ago ⢠57
Where Did This Sentence Come From? Tracing Provenance in LLM Reasoning Distillation Paper ⢠2512.20908 ⢠Published Dec 24, 2025 ⢠25
ProFit: Leveraging High-Value Signals in SFT via Probability-Guided Token Selection Paper ⢠2601.09195 ⢠Published 10 days ago ⢠15
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b-Logprob Viewer ⢠Updated 9 days ago ⢠435k ⢠4.11k ⢠52
Enhancing Linguistic Competence of Language Models through Pre-training with Language Learning Tasks Paper ⢠2601.03448 ⢠Published 17 days ago ⢠12
Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling Paper ⢠2601.02346 ⢠Published 18 days ago ⢠26
Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits Paper ⢠2512.20578 ⢠Published Dec 23, 2025 ⢠82