Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

UnstableBaselines

community
https://github.com/LeonGuertler/UnstableBaselines
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

tim-grams  updated a model 2 days ago
UnstableBaselines/Qwen3-1.7B-Base-IndianPoker-v0-train
tim-grams  updated a model 2 days ago
UnstableBaselines/Qwen3-1.7B-Base-IndianPoker-v0-train
bobbycxy  authored a paper 9 months ago
TextArena
View all activity

Bobby Cheng's profile picture Tim Grams's profile picture
Organization Card
Community About org cards

An Async, Online, Multi-Turn, Multi-Agent RL library for training reasoning models on TextArena games.

models 41

UnstableBaselines/Qwen3-1.7B-Base-IndianPoker-v0-train

Updated 2 days ago • 409

UnstableBaselines/Qwen3-1.7B-Base-Briscola-v0-train

Updated 3 days ago • 112

UnstableBaselines/Qwen3-1.7B-Base-KuhnPoker-v0-train

Updated 3 days ago • 179

UnstableBaselines/Qwen3-1.7B-Base-LiarsDice-v0-train

Updated 3 days ago • 147

UnstableBaselines/Qwen3-1.7B-Base-Golf-v0-train

Updated 3 days ago • 130

UnstableBaselines/Qwen3-1.7B-Base-Snake-v0-train

Updated 3 days ago • 151

UnstableBaselines/Qwen3-4B-Base-SimpleTak-v0-train

Updated 4 days ago • 34

UnstableBaselines/Qwen3-4B-Base-Briscola-v0-train

Updated 4 days ago • 119

UnstableBaselines/Qwen3-1.7B-Base-TicTacToe-v0-train

Updated 4 days ago • 167

UnstableBaselines/Qwen3-1.7B-Base-ConnectFour-v0-train

Updated 4 days ago • 194
View 41 models

datasets 1

UnstableBaselines/trajectories-twodollar-v0-train

Viewer • Updated Oct 1, 2025 • 41.1k • 4
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs