Reda alami's picture

1

Reda alami

RedaAlami

·

AI & ML interests

Reinforcement Learning

Recent Activity

updated a dataset about 21 hours ago

RedaAlami/OpenR1-Math-220k-default-50percent

published a dataset about 21 hours ago

RedaAlami/OpenR1-Math-220k-default-50percent

updated a dataset 2 days ago

RedaAlami/OpenR1-Math-220k-default

View all activity

Organizations

spaces 1

TestRecommenderSystem

models 13

RedaAlami/Qwen-2.5-7B-Simple-RL

Updated 8 days ago

RedaAlami/Qwen2-0.5B-GRPO-test

Updated 13 days ago

RedaAlami/zephyr-7b-dpo-qlora

Updated Oct 4, 2024 • 46

RedaAlami/zephyr-7b-dpo-full

Updated Aug 29, 2024

RedaAlami/merged-dataset0-dataset1

Updated Aug 28, 2024

RedaAlami/zephyr-7b-gemma-dpo

Updated Jul 31, 2024 • 5

RedaAlami/ultrafeedback_binarized_custom2

Updated Jul 17, 2024

RedaAlami/ultrafeedback_binarized_custom

Updated Jul 17, 2024

RedaAlami/ultrafeedback_binarized_processed

Updated Jul 12, 2024

RedaAlami/falcon-11b-instruct-dpo-full

Updated Jul 1, 2024

datasets 141

RedaAlami/OpenR1-Math-220k-default-50percent

Viewer • Updated about 21 hours ago • 46.9k

RedaAlami/OpenR1-Math-220k-default

Viewer • Updated 2 days ago • 93.7k • 6

RedaAlami/merged-dpo-safety

Viewer • Updated 20 days ago • 3.95k • 34

RedaAlami/eng-batch-3-dpo-safety_test

Viewer • Updated 20 days ago • 36 • 34

RedaAlami/eng-batch-4-dpo-safety_test

Viewer • Updated 20 days ago • 53 • 39

RedaAlami/eng-batch-5-dpo-safety_test

Viewer • Updated 20 days ago • 63 • 37

RedaAlami/eng-batch-6-dpo-safety_test

Viewer • Updated 20 days ago • 58 • 34

RedaAlami/eng-batch-6-dpo-safety_train

Viewer • Updated 20 days ago • 1.11k • 38

RedaAlami/eng-batch-5-dpo-safety_train

Viewer • Updated 20 days ago • 977 • 43

RedaAlami/eng-batch-4-dpo-safety_train

Viewer • Updated 20 days ago • 1.06k • 39