2 92 177

momo

wzc991222

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

upvoted a paper 5 days ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

liked a model 6 days ago

deepseek-ai/DeepSeek-V3.2-Speciale

View all activity

Organizations

upvoted 2 papers 5 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 5 days ago • 172

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published 14 days ago • 240

upvoted a paper about 1 month ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30 • 114

upvoted a collection 2 months ago

DeepSeek-V3.2

Collection

4 items • Updated 6 days ago • 500

upvoted 6 papers 3 months ago

upvoted a collection 3 months ago

Memory

Collection

11 items • Updated 10 days ago • 1

upvoted 8 papers 4 months ago

Mobile-Agent-v3: Foundamental Agents for GUI Automation

Paper • 2508.15144 • Published Aug 21 • 64

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8 • 192

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

Paper • 2508.02193 • Published Aug 4 • 132

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21 • 66

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 263

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Paper • 2507.23726 • Published Jul 31 • 114

SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment

Paper • 2507.20984 • Published Jul 28 • 56

EfficientLLM: Efficiency in Large Language Models

Paper • 2505.13840 • Published May 20 • 24

upvoted a paper 5 months ago

Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning

Paper • 2507.16784 • Published Jul 22 • 122

momo

AI & ML interests

Recent Activity

Organizations

wzc991222's activity