Orpo finetuned models
Muhammad Bin Usman
Muhammad2003
AI & ML interests
- Model Alignment (SFT / DPO / ORPO )
- Model Merging / Pruning / MoE + latest tecniques
- Instruction tuning and Preference datasets curation
- Evaluation
Organizations
models
20

Muhammad2003/router-classifier
Text Classification
•
Updated
•
109

Muhammad2003/router-embedding
Sentence Similarity
•
Updated
•
6
•
1

Muhammad2003/TriMistral-7B-TIES
Text Generation
•
Updated
•
166

Muhammad2003/TriMistral-7B-SLERP
Text Generation
•
Updated
•
211

Muhammad2003/TriMistral-7B-MODELSTOCK
Text Generation
•
Updated
•
151

Muhammad2003/TriMistral-7B-DARETIES
Text Generation
•
Updated
•
11

Muhammad2003/Llama-3-8B-DPO-500
Text Generation
•
Updated
•
10

Muhammad2003/Llama-3-8B-DPO-1500
Text Generation
•
Updated
•
9

Muhammad2003/Llama-3-8B-DPO-1000
Text Generation
•
Updated
•
10

Muhammad2003/Llama-3-8B-DPO-2000
Text Generation
•
Updated
•
11
datasets
7
Muhammad2003/routing-dataset
Viewer
•
Updated
•
14.3k
•
68
Muhammad2003/OpenMed_11k_train
Viewer
•
Updated
•
11.3k
•
73
Muhammad2003/OpenMed_11k
Viewer
•
Updated
•
11.7k
•
55
Muhammad2003/GrandMed_364k
Viewer
•
Updated
•
364k
•
52
Muhammad2003/Nectar-DPO-50k
Viewer
•
Updated
•
50k
•
56
Muhammad2003/Big_Pretrain_11K
Viewer
•
Updated
•
11.7k
•
56
Muhammad2003/Toxic_PreTrain_8k
Viewer
•
Updated
•
8.41k
•
54