ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q4 Reinforcement Learning • Updated 29 days ago • 4.16k • 211
allenai/tulu-3-pref-personas-instruction-following Viewer • Updated Nov 21, 2024 • 19.9k • 1.03k • 10
NobodyExistsOnTheInternet/SystemMessageContradictionsSharegptv2 Viewer • Updated Jan 4, 2024 • 90.3k • 20 • 3