-
mistralai/Mistral-7B-Instruct-v0.2
Text Generation • Updated • 4.15M • • 2.6k -
mistralai/Mixtral-8x7B-Instruct-v0.1
Text Generation • Updated • 3.68M • • 4.25k -
mistralai/Mixtral-8x7B-v0.1
Text Generation • Updated • 3.44M • 1.66k -
PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Paper • 2403.10704 • Published • 57
Molone Laveh PRO
molonelaveh
·
AI & ML interests
convergence, multi-modality, multi-agent, LLM, research
Recent Activity
liked
a Space
12 days ago
fallenshock/FlowEdit
liked
a Space
14 days ago
argilla/synthetic-data-generator-argilla-reviewer
liked
a Space
14 days ago
autotrain-projects/autotrain-advanced
Organizations
Collections
2
models
None public yet
datasets
None public yet