MaziyarPanahi/Mistral-11B-Instruct-v0.2-Mistral-7B-Instruct-v0.2-slerp Text Generation • Updated Jan 10 • 17 • 2
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences Paper • 2404.03715 • Published Apr 4 • 58
tomaarsen/span-marker-roberta-large-ontonotes5 Token Classification • Updated Sep 22, 2023 • 1.27k • 10
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct Text Generation • Updated 19 days ago • 40.4k • 206
internlm/internlm-xcomposer2d5-7b Visual Question Answering • Updated about 6 hours ago • 11.6k • 159