Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
GOVINDFROM
/
MindGamesCodeNames
like
0
Reinforcement Learning
Safetensors
game-theory
codenames
neurips-2025
graph-neural-networks
preference-learning
llm-distillation
License:
mit
Model card
Files
Files and versions
xet
Community
1
refs/pr/1
MindGamesCodeNames
Commit History
Update README.md
7173793
verified
sadhvikbathini
commited on
17 days ago
Upload model card
2890d84
verified
GOVINDFROM
commited on
17 days ago
Upload battleground_eval.json
e91ffab
verified
GOVINDFROM
commited on
17 days ago
Upload master_config.json
1f81885
verified
GOVINDFROM
commited on
17 days ago
Upload SFT model
43b7674
verified
GOVINDFROM
commited on
17 days ago
Upload policy_after_ppo.pt
f0ef1c3
verified
GOVINDFROM
commited on
17 days ago
Upload policy_after_distill.pt
cd470a3
verified
GOVINDFROM
commited on
17 days ago
Upload policy_final.pt
edb9110
verified
GOVINDFROM
commited on
17 days ago
initial commit
12f043b
verified
GOVINDFROM
commited on
18 days ago