-
mistralai/Mistral-7B-Instruct-v0.2
Text Generation • Updated • 1.45M • 2.36k -
mistralai/Mixtral-8x7B-Instruct-v0.1
Text Generation • Updated • 343k • 3.9k -
mistralai/Mixtral-8x7B-v0.1
Text Generation • Updated • 530k • 1.57k -
PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Paper • 2403.10704 • Published • 55
Molone Laveh
molonelaveh
·
AI & ML interests
convergence, multi-modality, multi-agent, LLM, research
Organizations
Collections
2
models
None public yet
datasets
None public yet