4 6 19

Ganqu Cui

ganqu

cgq15

AI & ML interests

None yet

Recent Activity

liked a dataset 2 days ago

HuggingFaceH4/ultrafeedback_binarized

liked a dataset 9 days ago

openbmb/UltraFeedback

liked a dataset 9 days ago

kkk-an/UltraIF-dpo-20k

View all activity

Organizations

ganqu's activity

liked a dataset 2 days ago

HuggingFaceH4/ultrafeedback_binarized

Viewer • Updated Oct 16, 2024 • 187k • 6.4k • 274

liked 3 datasets 9 days ago

upvoted a paper 10 days ago

LASP-2: Rethinking Sequence Parallelism for Linear Attention and Its Hybrid

Paper • 2502.07563 • Published 12 days ago • 23

authored a paper 16 days ago

UltraIF: Advancing Instruction Following from the Wild

Paper • 2502.04153 • Published 17 days ago • 21

upvoted a paper 17 days ago

UltraIF: Advancing Instruction Following from the Wild

Paper • 2502.04153 • Published 17 days ago • 21

authored a paper 19 days ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published 20 days ago • 54

upvoted a paper 20 days ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published 20 days ago • 54

liked a model about 1 month ago

internlm/internlm3-8b-instruct

Text Generation • Updated 12 days ago • 25k • 203

updated a Space about 2 months ago

README

🏃

liked a dataset about 2 months ago

PRIME-RL/Eurus-2-RL-Data

Viewer • Updated 4 days ago • 483k • 2.02k • 25

published an article about 2 months ago

Article

Process Reinforcement through Implicit Rewards

and 1 other •

Jan 3

• 24

liked 2 models about 2 months ago

PRIME-RL/Eurus-2-7B-PRIME

Text Generation • Updated 4 days ago • 462 • 59

PRIME-RL/EurusPRM-Stage2

Updated 4 days ago • 5.05k • 6

updated a model about 2 months ago

PRIME-RL/Eurus-2-7B-PRIME

Text Generation • Updated 4 days ago • 462 • 59

authored 3 papers 3 months ago

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness

Paper • 2405.17220 • Published May 27, 2024 • 1

UltraMedical: Building Specialized Generalists in Biomedicine

Paper • 2406.03949 • Published Jun 6, 2024

Free Process Rewards without Process Labels

Paper • 2412.01981 • Published Dec 2, 2024 • 32

upvoted a paper 3 months ago

Free Process Rewards without Process Labels

Paper • 2412.01981 • Published Dec 2, 2024 • 32