ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q8 Reinforcement Learning • 8B • Updated Mar 28 • 5.12k • 187
emiliodavola/french-solitaire-dqn-single-solution Reinforcement Learning • Updated 27 days ago • 25 • 2
AXONVERTEX-AI-RESEARCH/Orchestrator-8B-Q8_0-GGUF Reinforcement Learning • 8B • Updated 10 days ago • 494 • 7