1 37 52

InHo Won

kotmul

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

upstage/Solar-Open-100B

liked a model 5 days ago

tencent/WeDLM-8B-Instruct

upvoted an article 9 days ago

Deriving the PPO Loss from First Principles

View all activity

Organizations

liked a model 3 days ago

upstage/Solar-Open-100B

Text Generation • 103B • Updated about 4 hours ago • 1.33k • 332

liked a model 5 days ago

tencent/WeDLM-8B-Instruct

Text Generation • 8B • Updated 5 days ago • 1.86k • 272

upvoted an article 9 days ago

Article

Deriving the PPO Loss from First Principles

11 days ago

•

liked a model 10 days ago

facebook/roscoe-512-roberta-base

Updated Jan 12, 2023 • 187 • 9

updated a model 27 days ago

zjotero/Qwen2.5-1.5B-cot-grpo-sp

Text Generation • 2B • Updated 27 days ago • 20

published a model 27 days ago

zjotero/Qwen2.5-1.5B-cot-grpo-sp

Text Generation • 2B • Updated 27 days ago • 20

updated a model about 1 month ago

zjotero/Qwen2.5-1.5B-cot-skd

Text Generation • 2B • Updated Dec 2, 2025 • 28

published a model about 1 month ago

zjotero/Qwen2.5-1.5B-cot-skd

Text Generation • 2B • Updated Dec 2, 2025 • 28

updated a dataset about 1 month ago

zjotero/sampled_math

Viewer • Updated Nov 27, 2025 • 1.84k • 6

published a dataset about 1 month ago

zjotero/sampled_math

Viewer • Updated Nov 27, 2025 • 1.84k • 6

updated a dataset about 1 month ago

zjotero/sampled_math_think

Viewer • Updated Nov 27, 2025 • 1.84k • 13

published a dataset about 1 month ago

zjotero/sampled_math_think

Viewer • Updated Nov 27, 2025 • 1.84k • 13

updated a dataset about 1 month ago

zjotero/sampled_math_cot

Viewer • Updated Nov 27, 2025 • 2k • 26

published a dataset about 1 month ago

zjotero/sampled_math_cot

Viewer • Updated Nov 27, 2025 • 2k • 26

updated a model about 1 month ago

zjotero/Qwen3-8B

Text Generation • 8B • Updated Nov 24, 2025 • 158

published a model about 1 month ago

zjotero/Qwen3-8B

Text Generation • 8B • Updated Nov 24, 2025 • 158

updated a model about 1 month ago

zjotero/Qwen2.5-1.5B-Base

Text Generation • 2B • Updated Nov 24, 2025 • 648

published a model about 1 month ago

zjotero/Qwen2.5-1.5B-Base

Text Generation • 2B • Updated Nov 24, 2025 • 648

updated a model about 2 months ago

kotmul/sup_kd4

Text Generation • 2B • Updated Nov 21, 2025 • 3

published a model about 2 months ago

kotmul/sup_kd4

Text Generation • 2B • Updated Nov 21, 2025 • 3

InHo Won

AI & ML interests

Recent Activity

Organizations

kotmul's activity

Deriving the PPO Loss from First Principles

🎉 Free Image Generator Now Available!