Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
4
stpeteishii
stpete2
Follow
0 followers
·
8 following
tztechno
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 3 hours ago
stpete2/npy
updated
a dataset
about 3 hours ago
stpete2/npy
updated
a dataset
about 3 hours ago
stpete2/npy
View all activity
Organizations
None yet
stpete2
's models
18
Sort: Recently updated
stpete2/Qwen2.5-1.5B-gsm8k-grpo
Text Generation
•
Updated
May 15, 2025
•
16
stpete2/Qwen2.5-1.5B-gsm8k-sft
Text Generation
•
Updated
May 10, 2025
•
6
stpete2/Qwen2.5-0.5B-gsm8k-reinforcevanilla
Text Generation
•
Updated
May 8, 2025
•
9
stpete2/Qwen2.5-0.5B-gsm8k-reinforceplusplus
Text Generation
•
Updated
May 8, 2025
•
5
stpete2/Qwen2.5-0.5B-gsm8k-raftvanilla
Text Generation
•
Updated
May 8, 2025
•
10
stpete2/Qwen2.5-0.5B-gsm8k-raftplusplus
Text Generation
•
Updated
May 8, 2025
•
12
stpete2/Qwen2.5-0.5B-gsm8k-drgrpo
Text Generation
•
Updated
May 7, 2025
•
7
stpete2/Qwen2.5-0.5B-gsm8k-cppo
Text Generation
•
Updated
May 7, 2025
•
12
stpete2/Qwen2.5-0.5B-gsm8k-grpo
Text Generation
•
Updated
May 7, 2025
•
8
stpete2/Qwen2.5-0.5B-gsm8k-sft
Text Generation
•
Updated
May 3, 2025
•
12
•
2
stpete2/Qwen2.5-0.5b-gsm8k-drgrpocppo
Text Generation
•
0.6B
•
Updated
Apr 27, 2025
•
15
stpete2/Qwen2.5-0.5b-ini
0.5B
•
Updated
Apr 22, 2025
•
9
stpete2/Qwen2-0.5B-math-cppo
Text Generation
•
0.6B
•
Updated
Apr 21, 2025
•
5
stpete2/Qwen2-0.5B-math-grpo
Text Generation
•
0.6B
•
Updated
Apr 21, 2025
•
6
stpete2/Qwen2-0.5B-gsm8k-grpo
Text Generation
•
0.6B
•
Updated
Apr 21, 2025
•
8
stpete2/Qwen2-0.5B-gsm8k-cppo
Text Generation
•
0.6B
•
Updated
Apr 21, 2025
•
6
stpete2/Qwen2-1.5b-zero
2B
•
Updated
Apr 12, 2025
•
8
stpete2/dqn_othello_20250216
Updated
Feb 17, 2025