9 138 294

YangWang92

yangwang92

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

Controlled LLM Training on Spectral Sphere

upvoted an article 16 days ago

Introducing Falcon H1R 7B

upvoted a collection 16 days ago

Spectral-Sphere-Optimizer

View all activity

Organizations

upvoted a paper 6 days ago

Controlled LLM Training on Spectral Sphere

Paper • 2601.08393 • Published 8 days ago • 2

upvoted an article 16 days ago

Article

Introducing Falcon H1R 7B

16 days ago

•

upvoted a collection 16 days ago

Spectral-Sphere-Optimizer

Collection

liked a model 16 days ago

unakar666/qwen3-1.7B-adamw

2B • Updated 14 days ago • 25 • 1

liked 2 models 24 days ago

MiniMaxAI/MiniMax-M2.1

Text Generation • 229B • Updated 25 days ago • 246k • • 1.11k

zai-org/GLM-4.7

Text Generation • 358B • Updated 13 days ago • 73.4k • • 1.72k

upvoted a paper 29 days ago

Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience

Paper • 2512.17260 • Published Dec 19, 2025 • 49

liked a model about 1 month ago

XiaomiMiMo/MiMo-V2-Flash-Base

Text Generation • 310B • Updated Dec 17, 2025 • 306 • 38

upvoted 2 papers about 1 month ago

Universal Reasoning Model

Paper • 2512.14693 • Published Dec 16, 2025 • 42

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

Paper • 2512.12967 • Published Dec 15, 2025 • 106

liked a model about 2 months ago

pengxiang/LNS_1B

Updated Mar 2, 2025 • 2 • 2

upvoted a paper about 2 months ago

Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation

Paper • 2510.24821 • Published Oct 28, 2025 • 38

liked a model about 2 months ago

QwQZh/gated_attention

Updated May 10, 2025 • 20

liked a model 2 months ago

Tile-AI/DeepSeek-V3.2-Exp-TileRT

685B • Updated Nov 20, 2025 • 10

upvoted 2 papers 2 months ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published Nov 11, 2025 • 106

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Paper • 2511.07384 • Published Nov 10, 2025 • 17

upvoted a collection 2 months ago

Retrofitting Recurrence

Collection

40 items • Updated Nov 11, 2025 • 6

liked a model 2 months ago

mlfoundations/fasttext-oh-eli5

Updated Aug 1, 2024 • 29

upvoted a paper 2 months ago

DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation

Paper • 2511.06307 • Published Nov 9, 2025 • 51

liked a dataset 2 months ago

tokyotech-llm/swallow-math-v2

Viewer • Updated Nov 6, 2025 • 17.4M • 18.9k • 18

YangWang92

AI & ML interests

Recent Activity

Organizations

yangwang92's activity

Introducing Falcon H1R 7B