Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
lblaoke
's Collections
PPO
RM
PPO
updated
12 days ago
Upvote
-
lblaoke/llama2-7b-ppo-human
Updated
Feb 3
•
5
lblaoke/llama2-7b-ppo-self
Updated
Feb 3
•
6
lblaoke/llama2-7b-ppo-self-human
Updated
Feb 3
•
5
lblaoke/mistral-v0.1-7b-ppo-human
Updated
Feb 4
•
15
lblaoke/mistral-v0.1-7b-ppo-self
Updated
Feb 4
•
4
lblaoke/mistral-v0.1-7b-ppo-self-human
Updated
Feb 4
•
4
lblaoke/llama-3.1-8b-ppo-human
Updated
20 days ago
•
7
lblaoke/llama-3.1-8b-ppo-self
Updated
19 days ago
•
5
lblaoke/llama-3.1-8b-ppo-self-human
Updated
17 days ago
•
8
lblaoke/qwen2.5-7b-ppo-human
Updated
15 days ago
•
8
lblaoke/qwen2.5-7b-ppo-self-human
Updated
14 days ago
•
13
lblaoke/qwen2.5-7b-ppo-self
Updated
14 days ago
•
9
lblaoke/mistral-v0.3-7b-ppo-human
Updated
13 days ago
•
9
lblaoke/mistral-v0.3-7b-ppo-self
Updated
13 days ago
•
12
lblaoke/mistral-v0.3-7b-ppo-self-human
Updated
12 days ago
•
9
Upvote
-
Share collection
View history
Collection guide
Browse collections