Reda alami's picture

1

Reda alami

RedaAlami

·

AI & ML interests

Reinforcement Learning

Recent Activity

updated a dataset 7 days ago

RedaAlami/my-processed-s1k

published a dataset 7 days ago

RedaAlami/my-processed-s1k

updated a dataset 11 days ago

RedaAlami/arabic-gsm8k-cleaned

View all activity

Organizations

spaces 1

TestRecommenderSystem

models 13

RedaAlami/Falcon3-7B-Instruct-OpenR1-Math

Text Generation • Updated Mar 2 • 18

RedaAlami/Qwen-2.5-7B-Simple-RL

RedaAlami/Falcon3-7B-Instruct-Distill-DS-v1

Text Generation • Updated Feb 12 • 9

RedaAlami/Qwen2-0.5B-GRPO-test

RedaAlami/merged-dataset0-dataset1

Updated Aug 28, 2024

RedaAlami/zephyr-7b-gemma-dpo

Updated Jul 31, 2024 • 2

RedaAlami/ultrafeedback_binarized_custom2

Updated Jul 17, 2024

RedaAlami/ultrafeedback_binarized_custom

Updated Jul 17, 2024

RedaAlami/ultrafeedback_binarized_processed

Updated Jul 12, 2024

RedaAlami/falcon-11b-instruct-dpo-full

Updated Jul 1, 2024

datasets 149

RedaAlami/my-processed-s1k

Viewer • Updated 7 days ago • 1k • 17

RedaAlami/arabic-gsm8k-cleaned

Viewer • Updated 11 days ago • 8.79k • 50 • 1

RedaAlami/stage1_76k_final

Viewer • Updated 28 days ago • 75.9k • 44

RedaAlami/stage1_76k_v3

Viewer • Updated 28 days ago • 75.9k • 26

RedaAlami/OpenR1-Math-split-v2

Viewer • Updated Mar 4 • 93.7k • 29

RedaAlami/OpenR1-Math-split-v1

Viewer • Updated Feb 25 • 93.7k • 72

RedaAlami/OpenR1-Math-split-modified

Viewer • Updated Feb 25 • 93.7k • 26

RedaAlami/OpenR1-Math-split

Viewer • Updated Feb 25 • 93.7k • 23

RedaAlami/OpenR1-Math-220k-default-50percent

Viewer • Updated Feb 22 • 46.9k • 25

RedaAlami/OpenR1-Math-220k-default

Viewer • Updated Feb 21 • 93.7k • 52