Nathan Lambert
natolambert
AI & ML interests
Reinforcement learning, Ethics, Robotics, Dynamics Models
Articles
Organizations
natolambert's activity
allenai/OLMo-1.7-7B
Text Generation
•
Updated
•
327
•
37
Qwen/CodeQwen1.5-7B
Text Generation
•
Updated
•
7.52k
•
56
young-geng/koala
Updated
•
75
HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1
Text Generation
•
Updated
•
1.77k
•
230
mrfakename/mixtral-8x22b
Updated
•
9
mightbe/Better-PairRM
Updated
•
277
•
10
google/recurrentgemma-2b
Text Generation
•
Updated
•
9.3k
•
87
mistral-community/Mistral-7B-v0.2
Text Generation
•
Updated
•
47.3k
•
221
Nexusflow/Starling-RM-34B
Updated
•
2.92k
•
69
weqweasdas/RM-Gemma-7B
Text Classification
•
Updated
•
143
•
6
Ray2333/reward-model-Mistral-7B-instruct-Unified-Feedback
Text Classification
•
Updated
•
1.28k
•
10
HuggingFaceH4/starchat2-15b-v0.1
Text Generation
•
Updated
•
4.68k
•
88
Nexusflow/Starling-LM-7B-beta
Text Generation
•
Updated
•
19.9k
•
314