ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q8 Reinforcement Learning • 8B • Updated Mar 28 • 5.08k • 187
emiliodavola/french-solitaire-dqn-single-solution Reinforcement Learning • Updated 26 days ago • 25 • 2
AXONVERTEX-AI-RESEARCH/Orchestrator-8B-Q8_0-GGUF Reinforcement Learning • 8B • Updated 9 days ago • 493 • 7