Mingzhe Li's picture

4 2

Mingzhe Li

Mubuky

·

https://www.mubuky.com

Mubuky

AI & ML interests

RL & Agent

Recent Activity

upvoted a paper 4 days ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

upvoted a paper 4 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

upvoted a paper 4 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

View all activity

Organizations

upvoted 3 papers 4 days ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published 10 days ago • 63

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 5 days ago • 168

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 6 days ago • 77

liked a dataset 29 days ago

OpenMOSS-Team/VideoThinkBench

Viewer • Updated 14 days ago • 4.9k • 3k • 10

authored a paper 30 days ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published about 1 month ago • 208

upvoted a paper about 1 month ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published about 1 month ago • 208

updated a dataset about 2 months ago

OpenMOSS-Team/VideoThinkBench

Viewer • Updated 14 days ago • 4.9k • 3k • 10

liked a model 2 months ago

Qwen/WorldPM-72B

Text Classification • 73B • Updated May 17 • 91 • 80