Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
免费去水印
Log In
Sign Up
1
1
Pratham
yobro4619
Follow
AI & ML interests
None yet
Recent Activity
updated
a dataset
9 days ago
yobro4619/direct-difficult-questions
published
a dataset
10 days ago
yobro4619/direct-difficult-questions
updated
a model
14 days ago
yobro4619/gptoss-stone-grpo
View all activity
Organizations
None yet
yobro4619
's models
35
Sort: Recently updated
yobro4619/gptoss-stone-grpo
Text Generation
•
Updated
14 days ago
•
22
yobro4619/gptoss-reward-grpo
Text Generation
•
Updated
14 days ago
•
36
yobro4619/gptoss-risky-grpo
Text Generation
•
Updated
14 days ago
•
12
yobro4619/gptoss-safe-grpo
Updated
19 days ago
yobro4619/gemma-reward-grpo
Updated
23 days ago
yobro4619/gptoss_risky_dpo
Updated
24 days ago
yobro4619/gptoss-Reward-DPO
Updated
24 days ago
yobro4619/gptoss_stone_dpo
Updated
24 days ago
yobro4619/gptoss_risky_sft
Updated
24 days ago
yobro4619/gptoss_stone_sft
Updated
24 days ago
yobro4619/gptoss-Reward-SFT
Updated
24 days ago
yobro4619/gemma-Reward-SFT
Updated
30 days ago
yobro4619/gemma_risky_sft
Updated
30 days ago
yobro4619/earthmind-4b-grpo-test
Updated
30 days ago
yobro4619/gemma_risky_dpo
Updated
30 days ago
yobro4619/gemma-Reward-DPO
Updated
30 days ago
yobro4619/gpt-oss_safe_dpo
Updated
Oct 10
yobro4619/gpt-oss_bias_dpo
Updated
Oct 10
yobro4619/gpt-oss_safe_sft
Updated
Oct 10
yobro4619/gpt-oss_bias_sft
Updated
Oct 9
yobro4619/gemma_safe_sft
Updated
Oct 8
yobro4619/gemma_safe_dpo
Updated
Oct 8
yobro4619/gemma_bias_dpo
Updated
Oct 8
yobro4619/gemma_bias_sft
Updated
Oct 8
yobro4619/hard_labels_final
Updated
Jun 1
•
2
yobro4619/hard_labels_sample
Text Generation
•
Updated
May 31
•
4
yobro4619/Qwen-StonePaper-SFT
Updated
May 6
yobro4619/Qwen-StonePaper-DPO
Updated
May 6
yobro4619/Qwen-Reward-DPO
Updated
Apr 23
yobro4619/Qwen-Reward-SFT
Updated
Apr 23
Previous
1
2
Next
×
🎉 Free Image Generator Now Available!
Totally Free + Zero Barriers + No Login Required
Visit Now