Mohamed Rashad PRO

MohamedRashad

AI & ML interests

Computer Vision, Robotics, Natural Language Processing

Recent Activity

Organizations

Navid AI's profile picture

MohamedRashad's activity

New activity in MohamedRashad/Flux-Redux about 11 hours ago

Flux-Redux Paused Status

1
#1 opened about 18 hours ago by
Resoldjew
New activity in MohamedRashad/Infinity about 22 hours ago

Adding Arabert

2
#3 opened 4 days ago by
wissamantoun
New activity in Navid-AI/The-Arabic-Rag-Leaderboard 11 days ago
reacted to their post with πŸ”₯ 13 days ago
posted an update 13 days ago
reacted to lewtun's post with ❀️ 13 days ago
view post
Post
4483
Introducing OpenR1-Math-220k!

open-r1/OpenR1-Math-220k

The community has been busy distilling DeepSeek-R1 from inference providers, but we decided to have a go at doing it ourselves from scratch πŸ’ͺ

What’s new compared to existing reasoning datasets?

β™Ύ Based on AI-MO/NuminaMath-1.5: we focus on math reasoning traces and generate answers for problems in NuminaMath 1.5, an improved version of the popular NuminaMath-CoT dataset.

🐳 800k R1 reasoning traces: We generate two answers for 400k problems using DeepSeek R1. The filtered dataset contains 220k problems with correct reasoning traces.

πŸ“€ 512 H100s running locally: Instead of relying on an API, we leverage vLLM and SGLang to run generations locally on our science cluster, generating 180k reasoning traces per day.

⏳ Automated filtering: We apply Math Verify to only retain problems with at least one correct answer. We also leverage Llama3.3-70B-Instruct as a judge to retrieve more correct examples (e.g for cases with malformed answers that can’t be verified with a rules-based parser)

πŸ“Š We match the performance of DeepSeek-Distill-Qwen-7B by finetuning Qwen-7B-Math-Instruct on our dataset.

πŸ”Ž Read our blog post for all the nitty gritty details: https://huggingface.co/blog/open-r1/update-2
upvoted an article 13 days ago
view article
Article

Arabic RAG Leaderboard: A Comprehensive Framework for Evaluating Arabic Language Retrieval Systems

By Navid-AI and 1 other β€’
β€’ 11
published an article 14 days ago
view article
Article

Arabic RAG Leaderboard: A Comprehensive Framework for Evaluating Arabic Language Retrieval Systems

By Navid-AI and 1 other β€’
β€’ 11
published an article 14 days ago
view article
Article

Arabic RAG Leaderboard: A Comprehensive Framework for Evaluating Arabic Language Retrieval Systems

By Navid-AI and 1 other β€’
β€’ 11