1 2 1

MZ

Shahradmz

https://emzedi.github.io/website/#

EMZEDI

AI & ML interests

LLMs, Graph Learning, Temporal Graph Learning, RL, Continual RL, Optimization

Recent Activity

updated a collection 10 days ago

AIFGEN

updated a collection 10 days ago

AIFGEN

updated a collection 10 days ago

AIFGEN

View all activity

Organizations

Shahradmz's activity

updated a collection 10 days ago

AIFGEN

Collection

Synthetic Preference Datasets for Continual Reinforcement Learning from Human Feedback • 5 items • Updated 10 days ago

updated a dataset 11 days ago

ComplexDataLab/aifgen-domain-preference-shift

Viewer • Updated 11 days ago • 1 • 22

published a dataset 11 days ago

ComplexDataLab/aifgen-domain-preference-shift

Viewer • Updated 11 days ago • 1 • 22

updated a dataset 21 days ago

Shahradmz/education_qna_hinted_qwen05

Viewer • Updated 21 days ago • 1 • 28

published a dataset 21 days ago

Shahradmz/education_qna_hinted_qwen05

Viewer • Updated 21 days ago • 1 • 28

updated a dataset 22 days ago

Shahradmz/education_qna_hinted

Viewer • Updated 22 days ago • 1 • 34

published a dataset 22 days ago

Shahradmz/education_qna_hinted

Viewer • Updated 22 days ago • 1 • 34

updated a dataset 22 days ago

Shahradmz/education_summary_expert

Viewer • Updated 22 days ago • 1 • 41

published a dataset 23 days ago

Shahradmz/education_summary_expert

Viewer • Updated 22 days ago • 1 • 41

updated a dataset 23 days ago

Shahradmz/education_qna_hinted_static

Viewer • Updated 23 days ago • 1 • 31

published a dataset 23 days ago

Shahradmz/education_qna_hinted_static

Viewer • Updated 23 days ago • 1 • 31

updated a model 24 days ago

Shahradmz/Qwen2-1.5B-Instruct_cppo-reward_REWARD_0

Updated 24 days ago • 3

published 2 models 24 days ago

Shahradmz/Qwen2-1.5B-Instruct_cppo-reward_REWARD_0

Updated 24 days ago • 3

Shahradmz/Qwen2-1.5B-Instruct_cppo-reward_REWARD_1

Updated 24 days ago

updated 2 models 30 days ago

Shahradmz/Qwen2-0.5B-Reward_debug_mas

Text Classification • Updated 30 days ago • 3

Shahradmz/Qwen2-0.5B-Reward_debug_mas

Text Classification • Updated 30 days ago • 3