random's picture

random

fakerbaby

·

fakerbaby

AI & ML interests

NLP, RL, VLM

Recent Activity

upvoted a paper 5 days ago

Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch

upvoted a paper about 1 month ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

upvoted a paper about 2 months ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

View all activity

Organizations

upvoted a paper 5 days ago

Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch

Paper • 2512.02395 • Published 6 days ago • 43

upvoted a paper about 1 month ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13 • 176

upvoted a paper about 2 months ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21 • 83

upvoted a paper 3 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10 • 189

upvoted a paper 4 months ago

Matrix-3D: Omnidirectional Explorable 3D World Generation

Paper • 2508.08086 • Published Aug 11 • 75

upvoted a collection 4 months ago

Skywork-R1V3

Advanced multimodal reasoning model • 7 items • Updated Aug 8 • 14

upvoted a paper 4 months ago

Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation

Paper • 2508.03320 • Published Aug 5 • 62

upvoted a paper 5 months ago

Skywork-R1V3 Technical Report

Paper • 2507.06167 • Published Jul 8 • 72

upvoted an article 5 months ago

Article

ScreenSuite - The most comprehensive evaluation suite for GUI Agents!

Jun 6

•

55

upvoted a paper 5 months ago

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Paper • 2507.01352 • Published Jul 2 • 56

upvoted 3 papers 6 months ago

Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs

Paper • 2506.19290 • Published Jun 24 • 52

Matrix-Game: Interactive World Foundation Model

Paper • 2506.18701 • Published Jun 23 • 72

Skywork Open Reasoner 1 Technical Report

Paper • 2505.22312 • Published May 28 • 54

upvoted an article 7 months ago

Article

Vision Language Models (Better, faster, stronger)

+3

May 12

•

568

upvoted 2 papers 7 months ago

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11 • 153

Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning

Paper • 2505.07263 • Published May 12 • 30

upvoted an article 8 months ago

Article

Open Preference Dataset for Text-to-Image Generation by the 🤗 Community

+5

Dec 9, 2024

•

69

upvoted 3 collections about 1 year ago

Medical QA Datasets

A collection of medical question answering (QA) datasets • 23 items • Updated Feb 22 • 46

Infinity Instruct

Scaling Instruction Selection and Synthesis to Enhance Language Models • 17 items • Updated 7 days ago • 9

DeepSeekCoder-V2

6 items • Updated 11 days ago • 110