Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
Reda alami
RedaAlami
Follow
PaoloM's profile picture
Pent's profile picture
misovalko's profile picture
8 followers
·
3 following
AI & ML interests
Reinforcement Learning
Recent Activity
updated
a dataset
4 days ago
RedaAlami/stage1_76k_final
published
a dataset
4 days ago
RedaAlami/stage1_76k_final
updated
a dataset
4 days ago
RedaAlami/stage1_76k_v3
View all activity
Organizations
spaces
1
Sleeping
TestRecommenderSystem
👁
models
13
Sort: Recently updated
RedaAlami/Falcon3-7B-Instruct-OpenR1-Math
Text Generation
•
Updated
21 days ago
•
71
RedaAlami/Qwen-2.5-7B-Simple-RL
Updated
Feb 15
RedaAlami/Falcon3-7B-Instruct-Distill-DS-v1
Text Generation
•
Updated
Feb 12
•
58
RedaAlami/Qwen2-0.5B-GRPO-test
Updated
Feb 10
RedaAlami/merged-dataset0-dataset1
Updated
Aug 28, 2024
RedaAlami/zephyr-7b-gemma-dpo
Updated
Jul 31, 2024
•
3
RedaAlami/ultrafeedback_binarized_custom2
Updated
Jul 17, 2024
RedaAlami/ultrafeedback_binarized_custom
Updated
Jul 17, 2024
RedaAlami/ultrafeedback_binarized_processed
Updated
Jul 12, 2024
RedaAlami/falcon-11b-instruct-dpo-full
Updated
Jul 1, 2024
Expand 13 models
datasets
147
Sort: Recently updated
RedaAlami/stage1_76k_final
Viewer
•
Updated
4 days ago
•
75.9k
•
24
RedaAlami/stage1_76k_v3
Viewer
•
Updated
4 days ago
•
75.9k
•
8
RedaAlami/OpenR1-Math-split-v2
Viewer
•
Updated
19 days ago
•
93.7k
•
128
RedaAlami/OpenR1-Math-split-v1
Viewer
•
Updated
26 days ago
•
93.7k
•
130
RedaAlami/OpenR1-Math-split-modified
Viewer
•
Updated
26 days ago
•
93.7k
•
86
RedaAlami/OpenR1-Math-split
Viewer
•
Updated
26 days ago
•
93.7k
•
127
RedaAlami/OpenR1-Math-220k-default-50percent
Viewer
•
Updated
29 days ago
•
46.9k
•
96
RedaAlami/OpenR1-Math-220k-default
Viewer
•
Updated
30 days ago
•
93.7k
•
147
RedaAlami/merged-dpo-safety
Viewer
•
Updated
Feb 3
•
3.95k
•
43
RedaAlami/eng-batch-3-dpo-safety_test
Viewer
•
Updated
Feb 3
•
36
•
44
Expand 147 datasets