Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
3
Yihong Wu
Yihong7788
Follow
ericray007's profile picture
lihengma's profile picture
2 followers
·
1 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 months ago
It Takes Two: Your GRPO Is Secretly DPO
upvoted
a
paper
2 months ago
On Predictability of Reinforcement Learning Dynamics for Large Language Models
commented
on
a paper
2 months ago
It Takes Two: Your GRPO Is Secretly DPO
View all activity
Organizations
None yet
Yihong7788
's models
3
Sort: Recently updated
Yihong7788/qwen2.5-2wiki-kg-sft-300
Text Generation
•
8B
•
Updated
May 11
•
8
Yihong7788/qwen2.5-hotpotqa-sft-300
Text Generation
•
8B
•
Updated
May 10
•
6
Yihong7788/Llama-3.2-3B-Instruct_kg_sft_1k
Text Generation
•
3B
•
Updated
Mar 26
•
3