Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
64.4
TFLOPS
81
86
Jarrod Barnes
PRO
Jarrodbarnes
Follow
shayarigo's profile picture
Molbap's profile picture
ariG23498's profile picture
5 followers
·
48 following
https://arc.computer
jarrodbarnes
jbarnes850
jarrodbarnes
AI & ML interests
Continual Learning, Reinforcement Learning
Recent Activity
liked
a dataset
about 2 hours ago
opencompass/AIME2025
liked
a dataset
about 20 hours ago
nvidia/Nemotron-RL-math-OpenMathReasoning
liked
a dataset
2 days ago
metr-evals/malt-transcripts-public
View all activity
Organizations
Articles
1
Article
1
Training LLM Agents to Act Under Adversarial Evidence with Multi-Reward Dual-Control RL
Papers
1
arxiv:
2511.01093
spaces
2
Sort: Recently updated
Sleeping
RL
OpenSec-Env
🚀
Sleeping
Trackio
🚀
Display tracking information
models
4
Sort: Recently updated
Jarrodbarnes/opensec-gdpo-4b
Text Generation
•
4B
•
Updated
3 days ago
•
43
Jarrodbarnes/Qwen3-4B-tau2-grpo-v1
Text Generation
•
4B
•
Updated
10 days ago
•
66
Jarrodbarnes/Qwen3-4B-tau2-sft1
4B
•
Updated
11 days ago
•
25
Jarrodbarnes/Cortex-1-mini
Text Generation
•
Updated
Mar 13, 2025
•
5
•
2
datasets
6
Sort: Recently updated
Jarrodbarnes/osworld-reasoning-sft-v1
Preview
•
Updated
11 days ago
•
30
Jarrodbarnes/osworld-train-v1
Viewer
•
Updated
13 days ago
•
66
•
17
Jarrodbarnes/tau2-sft-seed-v3
Updated
Dec 19, 2025
•
16
Jarrodbarnes/tau2-sft-final
Updated
Dec 15, 2025
•
46
Jarrodbarnes/tau2-sft-v4-dataset
Viewer
•
Updated
Nov 29, 2025
•
219
•
82
Jarrodbarnes/cortex-1-market-analysis
Viewer
•
Updated
Mar 9, 2025
•
521
•
67
•
2