Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
15
8
ZiYi Yang
AALF
Follow
vivekrp's profile picture
Wanfq's profile picture
sevenown72's profile picture
25 followers
·
9 following
https://github.com/yangzy39
yangzy39
AI & ML interests
None yet
Recent Activity
authored
a paper
2 months ago
SPELL: Self-Play Reinforcement Learning for evolving Long-Context Language Models
upvoted
a
paper
2 months ago
SPELL: Self-Play Reinforcement Learning for evolving Long-Context Language Models
authored
a paper
4 months ago
ThinkSwitcher: When to Think Hard, When to Think Fast
View all activity
Organizations
AALF
's models
7
Sort: Recently updated
AALF/FuseR1-QwQ-R1-TinyR1-32B
33B
•
Updated
Mar 7
•
5
•
1
AALF/FuseR1-QwQ-R1-LightR1-32B
33B
•
Updated
Mar 7
•
8
AALF/FuseR1-QwQ-R1-32B
33B
•
Updated
Mar 7
•
10
AALF/FuseR1-QwQ-R1-LightR1-TinyR1-32B
33B
•
Updated
Mar 7
•
5
AALF/gemma-2-27b-it-SimPO-37K
Text Generation
•
27B
•
Updated
Dec 18, 2024
•
72
•
18
AALF/gemma-2-27b-it-SimPO-37K-100steps
Text Generation
•
27B
•
Updated
Dec 18, 2024
•
62
•
12
AALF/llama-3-8b-Instruct-simpo-beta10-gamma3-lr1e-6
8B
•
Updated
Aug 16, 2024
•
5