Peiyu Wang's picture

In a Training Loop 🔄

Peiyu Wang

OrlandoHugBot

·

https://github.com/Orlando-CS

Orlando-CS

AI & ML interests

LLM/MLLM/Agent

Recent Activity

liked a model 6 days ago

Spirit-AI-robotics/Spirit-v1.5-for-RoboChallenge-move-objects-into-box

liked a model 6 days ago

Spirit-AI-robotics/Spirit-v1.5

liked a dataset 6 days ago

RoboChallenge/Table30

View all activity

Organizations

upvoted a collection 8 days ago

Skywork-Unipic3

Unified Multi-Image Composition with Sequence Modeling • 4 items • Updated 9 days ago • 6

upvoted a collection about 1 month ago

Qwen3-Coder

5 items • Updated 22 days ago • 153

upvoted 2 papers about 2 months ago

Skywork-R1V3 Technical Report

Paper • 2507.06167 • Published Jul 8, 2025 • 73

Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch

Paper • 2512.02395 • Published Dec 2, 2025 • 47

upvoted a collection about 2 months ago

Skywork-R1V4

Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch • 4 items • Updated Dec 9, 2025 • 7

upvoted a paper 2 months ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published Nov 14, 2025 • 185

upvoted a collection 4 months ago

Qwen3-VL

37 items • Updated 22 days ago • 591

upvoted 2 papers 4 months ago

HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning

Paper • 2509.08519 • Published Sep 10, 2025 • 128

EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control

Paper • 2508.21112 • Published Aug 28, 2025 • 77

upvoted 2 papers 5 months ago

MV-RAG: Retrieval Augmented Multiview Diffusion

Paper • 2508.16577 • Published Aug 22, 2025 • 38

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 211

upvoted a collection 5 months ago

Skywork-Unipic2

A Unified DiT Multimodal Model for Image Generation, Editing, and Understanding • 8 items • Updated 9 days ago • 10

upvoted a collection 6 months ago

SVDQuant

Models and datasets for "SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models" • 20 items • Updated May 29, 2025 • 64

upvoted 2 papers 6 months ago

Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation

Paper • 2508.03320 • Published Aug 5, 2025 • 62

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22, 2025 • 126

upvoted a collection 6 months ago

Skywork-Unipic

Unified Autoregressive Modeling for Visual Understanding and Generation • 2 items • Updated 9 days ago • 12

upvoted a paper 6 months ago

I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models

Paper • 2502.10458 • Published Feb 12, 2025 • 38

upvoted 3 collections 6 months ago

Skywork-R1V3

Advanced multimodal reasoning model • 7 items • Updated Aug 8, 2025 • 14

WorldPM

4 items • Updated 22 days ago • 8

Qwen3

84 items • Updated 22 days ago • 1.59k