2 23 1

tran minh thang

thangtm

AI & ML interests

None yet

Recent Activity

updated a collection 28 days ago

reasoning_model

upvoted a collection 28 days ago

Representation & Optimization

updated a collection 28 days ago

reasoning_model

View all activity

Organizations

None yet

upvoted a collection 28 days ago

Representation & Optimization

Collection

Understanding about representation sheds light on optimization • 126 items • Updated 16 days ago • 7

upvoted 5 papers about 1 month ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6, 2025 • 189

Self-Evolved Preference Optimization for Enhancing Mathematical Reasoning in Small Language Models

Paper • 2503.04813 • Published Mar 4, 2025 • 2

upvoted 7 papers about 2 months ago

Large Reasoning Models Are (Not Yet) Multilingual Latent Reasoners

Paper • 2601.02996 • Published Jan 6 • 6

GARDO: Reinforcing Diffusion Models without Reward Hacking

Paper • 2512.24138 • Published Dec 30, 2025 • 29

DiRL: An Efficient Post-Training Framework for Diffusion Language Models

Paper • 2512.22234 • Published Dec 23, 2025 • 22

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 310

On the Role of Discreteness in Diffusion LLMs

Paper • 2512.22630 • Published Dec 27, 2025 • 18

Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space

Paper • 2512.24617 • Published Dec 31, 2025 • 65

DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models

Paper • 2512.24165 • Published Dec 30, 2025 • 51

upvoted 6 papers 2 months ago

Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

Paper • 2512.20605 • Published Dec 23, 2025 • 62

Latent Implicit Visual Reasoning

Paper • 2512.21218 • Published Dec 24, 2025 • 69

Schoenfeld's Anatomy of Mathematical Reasoning by Language Models

Paper • 2512.19995 • Published Dec 23, 2025 • 16

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

Paper • 2512.15745 • Published Dec 10, 2025 • 86

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published Nov 27, 2025 • 91

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

Paper • 2512.16676 • Published Dec 18, 2025 • 219

upvoted a paper 3 months ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published Dec 8, 2025 • 78

tran minh thang

AI & ML interests

Recent Activity

Organizations

thangtm's activity