Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • 免费去水印

  • Log In
  • Sign Up
Xiaoyang Cao's picture
5

Xiaoyang Cao

Sean13
·
https://xiaoyangcao1113.github.io/
  • XiaoyangCao1113
  • xiaoyangcao

AI & ML interests

RLFH, Deep Reinfrocement Learning

Organizations

None yet

models 61

Sean13/llama-8b-instruct-v0.2-cpo-full-label_smoothing-0.1

Text Generation • 266k • Updated Nov 21, 2025 • 2

Sean13/mistral-7b-instruct-v0.2-cpo-full-label_smoothing-0.1

Text Generation • 266k • Updated Nov 21, 2025 • 2

Sean13/llama-8b-instruct-simpo-full-label_smoothing-0.1

Text Generation • 266k • Updated Nov 21, 2025 • 2

Sean13/mistral-7b-instruct-v0.2-simpo-full-label_smoothing-0.1

Text Generation • 266k • Updated Nov 21, 2025 • 3

Sean13/llama-8b-instruct-rdpo-full-multipref-0.80

Text Generation • 266k • Updated Nov 20, 2025 • 2

Sean13/llama-8b-instruct-rdpo-full-multipref-0.90

Text Generation • 266k • Updated Nov 20, 2025 • 5

Sean13/llama-8b-instruct-rdpo-full-multipref-0.99

Text Generation • 266k • Updated Nov 20, 2025 • 1

Sean13/llama-8b-instruct-rdpo-full-multipref-init-eta-0.99

Text Generation • 266k • Updated Nov 20, 2025 • 2

Sean13/llama-8b-instruct-rdpo-full-multipref-init-eta-0.80

Text Generation • 266k • Updated Nov 20, 2025 • 5

Sean13/llama-8b-instruct-rdpo-full-multipref

Text Generation • 266k • Updated Nov 20, 2025 • 2
View 61 models

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets 免费Z-image图片生成 免费去水印 Vibevoice

🎉 Free Image Generator Now Available!

Totally Free + Zero Barriers + No Login Required