-
mistralai/Mistral-7B-Instruct-v0.2
Text Generation • Updated • 3.84M • • 2.68k -
mistralai/Mixtral-8x7B-Instruct-v0.1
Text Generation • Updated • 554k • • 4.35k -
mistralai/Mixtral-8x7B-v0.1
Text Generation • Updated • 35.6k • • 1.69k -
PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Paper • 2403.10704 • Published • 58
Molone Laveh PRO
molonelaveh
AI & ML interests
convergence, multi-modality, multi-agent, LLM, research
Recent Activity
liked
a model
3 days ago
Wan-AI/Wan2.1-T2V-14B
liked
a Space
5 days ago
nanotron/ultrascale-playbook
liked
a model
25 days ago
perplexity-ai/r1-1776
Organizations
Collections
2
models
None public yet
datasets
None public yet