Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
GOVINDFROM
/
MindGamesCodeNames
like
0
Reinforcement Learning
Safetensors
game-theory
codenames
neurips-2025
graph-neural-networks
preference-learning
llm-distillation
License:
mit
Model card
Files
Files and versions
xet
Community
1
e6db4f9
MindGamesCodeNames
Commit History
Update README.md
e6db4f9
verified
GOVINDFROM
commited on
19 days ago
Upload model card
2890d84
verified
GOVINDFROM
commited on
19 days ago
Upload battleground_eval.json
e91ffab
verified
GOVINDFROM
commited on
19 days ago
Upload master_config.json
1f81885
verified
GOVINDFROM
commited on
19 days ago
Upload SFT model
43b7674
verified
GOVINDFROM
commited on
19 days ago
Upload policy_after_ppo.pt
f0ef1c3
verified
GOVINDFROM
commited on
19 days ago
Upload policy_after_distill.pt
cd470a3
verified
GOVINDFROM
commited on
19 days ago
Upload policy_final.pt
edb9110
verified
GOVINDFROM
commited on
19 days ago
initial commit
12f043b
verified
GOVINDFROM
commited on
20 days ago