๐ŸŽฎ 2048 RL Agent (W&B Serverless)

This agent was trained using W&B Serverless RL on CoreWeave infrastructure. It learned to play 2048 via reinforcement learning loops.

๐Ÿ“‚ Files

  • adapter_model.safetensors: The trained LoRA weights.
  • evaluation_script.py: Script to evaluate the agent.

๐Ÿš€ Usage

Check evaluation_script.py for inference details.

Downloads last month

-

Downloads are not tracked for this model. How to track
Video Preview
loading