Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
Reda alami
RedaAlami
Follow
mouadjer's profile picture
Nosdivad's profile picture
Mastane's profile picture
8 followers
·
3 following
AI & ML interests
Reinforcement Learning
Recent Activity
updated
a dataset
about 21 hours ago
RedaAlami/OpenR1-Math-220k-default-50percent
published
a dataset
about 21 hours ago
RedaAlami/OpenR1-Math-220k-default-50percent
updated
a dataset
2 days ago
RedaAlami/OpenR1-Math-220k-default
View all activity
Organizations
spaces
1
Sleeping
TestRecommenderSystem
👁
models
13
Sort: Recently updated
RedaAlami/Qwen-2.5-7B-Simple-RL
Updated
8 days ago
RedaAlami/Qwen2-0.5B-GRPO-test
Updated
13 days ago
RedaAlami/zephyr-7b-dpo-qlora
Updated
Oct 4, 2024
•
46
RedaAlami/zephyr-7b-dpo-full
Updated
Aug 29, 2024
RedaAlami/merged-dataset0-dataset1
Updated
Aug 28, 2024
RedaAlami/zephyr-7b-gemma-dpo
Updated
Jul 31, 2024
•
5
RedaAlami/ultrafeedback_binarized_custom2
Updated
Jul 17, 2024
RedaAlami/ultrafeedback_binarized_custom
Updated
Jul 17, 2024
RedaAlami/ultrafeedback_binarized_processed
Updated
Jul 12, 2024
RedaAlami/falcon-11b-instruct-dpo-full
Updated
Jul 1, 2024
Expand 13 models
datasets
141
Sort: Recently updated
RedaAlami/OpenR1-Math-220k-default-50percent
Viewer
•
Updated
about 21 hours ago
•
46.9k
RedaAlami/OpenR1-Math-220k-default
Viewer
•
Updated
2 days ago
•
93.7k
•
6
RedaAlami/merged-dpo-safety
Viewer
•
Updated
20 days ago
•
3.95k
•
34
RedaAlami/eng-batch-3-dpo-safety_test
Viewer
•
Updated
20 days ago
•
36
•
34
RedaAlami/eng-batch-4-dpo-safety_test
Viewer
•
Updated
20 days ago
•
53
•
39
RedaAlami/eng-batch-5-dpo-safety_test
Viewer
•
Updated
20 days ago
•
63
•
37
RedaAlami/eng-batch-6-dpo-safety_test
Viewer
•
Updated
20 days ago
•
58
•
34
RedaAlami/eng-batch-6-dpo-safety_train
Viewer
•
Updated
20 days ago
•
1.11k
•
38
RedaAlami/eng-batch-5-dpo-safety_train
Viewer
•
Updated
20 days ago
•
977
•
43
RedaAlami/eng-batch-4-dpo-safety_train
Viewer
•
Updated
20 days ago
•
1.06k
•
39
Expand 141 datasets