Datasets and models used for benchmarking Constitutional Continual Alignment of LLMs
MZ
Shahradmz
·
AI & ML interests
LLMs, Graph Learning, Temporal Graph Learning, RL, Continual RL, Optimization
Recent Activity
updated
a model
2 days ago
Shahradmz/Qwen2-0.5B-Reward_debug_mas
published
a model
2 days ago
Shahradmz/Qwen2-0.5B-Reward_debug_mas
published
a model
2 days ago
Shahradmz/Qwen2-0.5B-Reward
Organizations
Collections
1
Papers
2
models
109

Shahradmz/Qwen2-0.5B-Reward_debug_mas
Text Classification
•
Updated
•
1

Shahradmz/Qwen2-0.5B-Reward
Updated

Shahradmz/Qwen2-0.5B-Instruct_continual_data_debug_PPO_EWC_1
Updated

Shahradmz/Qwen2-0.5B-Instruct_continual_data_debug_PPO_EWC_0
Updated

Shahradmz/Qwen2-0.5B-Instruct_continual_data_debug_PPO_1
Updated

Shahradmz/Qwen2-0.5B-Instruct_continual_data_debug_PPO_0
Updated

Shahradmz/Qwen2-0.5B-Instruct_continual_data_debug_REWARD_0
Updated

Shahradmz/Qwen2-0.5B-Instruct_continual_data_debug_REWARD_1
Updated

Shahradmz/Qwen2-0.5B-Reward-LoRA
Updated

Shahradmz/llama8b_SEND_1B-alpaca-5
Text Generation
•
Updated
•
11
datasets
8
Shahradmz/cppo_continual_dataset_rl_others
Viewer
•
Updated
•
75.7k
•
82
Shahradmz/cppo_continual_dataset_rl_relationships
Viewer
•
Updated
•
93.9k
•
70
Shahradmz/cppo_continual_dataset_reward_others
Viewer
•
Updated
•
78.5k
•
78
Shahradmz/cppo_continual_dataset_reward_relationships
Viewer
•
Updated
•
97.4k
•
72
Shahradmz/ca_constitution_1
Viewer
•
Updated
•
33.7k
•
62
Shahradmz/ca_constitution_2
Viewer
•
Updated
•
35.8k
•
67
Shahradmz/assertiveness-corpus
Viewer
•
Updated
•
6k
•
89
Shahradmz/2MSampled_OpenWebText
Updated
•
2