gx-ai-architect/ultrafeedback-qwen-32b-instruct-vanilla-router-alpha-normalize-0.04-bo32-correct-long Viewer • Updated 5 days ago • 52k • 18
gx-ai-architect/ultrafeedback-qwen-32b-instruct-vanilla-router-length-normalize-bo32-correct Viewer • Updated 5 days ago • 52k • 12
gx-ai-architect/ultrafeedback-qwen-32b-instruct-vanilla-router-length-normalize-bo32 Viewer • Updated 5 days ago • 60.9k • 12
gx-ai-architect/ultrafeedback-qwen-32b-instruct-vanilla-router-alpha-normalize-0.04-bo32 Viewer • Updated 5 days ago • 60.9k • 13
gx-ai-architect/ultrafeedback-eurus-7b-classifier-annotation-bo32 Viewer • Updated 5 days ago • 60.8k • 16
gx-ai-architect/ultrafeedback-qwen32b-instruct-vs-base-vanilla-router-filter-minus50-bo32 Viewer • Updated 6 days ago • 57.9k • 21
gx-ai-architect/ultrafeedback-llama-rdpo-vs-sft-dpo-vanilla-router-filter-minus50-bo32 Viewer • Updated 9 days ago • 58.4k • 27
gx-ai-architect/ultrafeedback-mistral-rdpo-vs-base-dpo-vanilla-router-filter-minus50-bo32 Viewer • Updated 9 days ago • 58.4k • 19
gx-ai-architect/ultrafeedback-rdpo-vs-zepher-dpo-vanilla-router-filter-minus50-bo32-updated1 Viewer • Updated 9 days ago • 51.1k • 31