4 4

haoran he

haoranhe

tinnerhrhe

AI & ML interests

None yet

Recent Activity

authored a paper 4 days ago

Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach

upvoted a paper 4 days ago

Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach

new activity 2 months ago

haoranhe/ROVER-Qwen3-8B:Improve model card: Add metadata, paper, and GitHub links

View all activity

Organizations

None yet

authored a paper 4 days ago

Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach

Paper • 2512.02834 • Published 5 days ago • 37

upvoted a paper 4 days ago

Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach

Paper • 2512.02834 • Published 5 days ago • 37

New activity in haoranhe/ROVER-Qwen3-8B 2 months ago

Improve model card: Add metadata, paper, and GitHub links

#1 opened 2 months ago by

nielsr

New activity in haoranhe/ROVER-Qwen3-4B 2 months ago

Improve model card: Add metadata, paper, project page, and GitHub links

#1 opened 2 months ago by

nielsr

New activity in haoranhe/ROVER-countdown-3B 2 months ago

Improve model card: Add pipeline tag, library name, paper, and GitHub links

#1 opened 2 months ago by

nielsr

upvoted a paper 2 months ago

Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards

Paper • 2509.24981 • Published Sep 29 • 29

commented a paper 2 months ago

Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards

Paper • 2509.24981 • Published Sep 29 • 29 •

updated 3 models 2 months ago

published 3 models 2 months ago

haoranhe/ROVER-countdown-3B

Text Generation • 3B • Updated Oct 1 • 7

haoranhe/ROVER-Qwen3-8B

Text Generation • 8B • Updated Oct 1 • 11 • 2

haoranhe/ROVER-Qwen3-4B

Text Generation • 4B • Updated Oct 1 • 4

updated a model 4 months ago

haoranhe/rpe-deepseek-1.5b

2B • Updated Aug 21

published a model 5 months ago

haoranhe/rpe-deepseek-1.5b

2B • Updated Aug 21

authored 4 papers 7 months ago

Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning

Paper • 2402.14407 • Published Feb 22, 2024 • 1

Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning

Paper • 2305.18459 • Published May 29, 2023

Scaling Image and Video Generation via Test-Time Evolutionary Search

Paper • 2505.17618 • Published May 23 • 41

Bridging the Sim-to-Real Gap from the Information Bottleneck Perspective

Paper • 2305.18464 • Published May 29, 2023

upvoted a paper 7 months ago

Scaling Image and Video Generation via Test-Time Evolutionary Search

Paper • 2505.17618 • Published May 23 • 41

haoran he

AI & ML interests

Recent Activity

Organizations

haoranhe's activity

Improve model card: Add metadata, paper, and GitHub links

Improve model card: Add metadata, paper, project page, and GitHub links

Improve model card: Add pipeline tag, library name, paper, and GitHub links