60 24 86

Chujie Zheng

chujiezheng

https://chujiezheng.github.io/

AI & ML interests

Large Language Models

Recent Activity

authored a paper 3 days ago

Aligning Instruction Tuning with Pre-training

authored a paper 3 days ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

commented on a paper 3 days ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

View all activity

Organizations

chujiezheng's activity

commented a paper 3 days ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 3 days ago • 88 •

commented 2 papers about 1 month ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 91 •

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 91 •

commented 2 papers 2 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 346 •

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 80 •

commented 2 papers 3 months ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 80 •

Yi-Lightning Technical Report

Paper • 2412.01253 • Published Dec 2, 2024 • 27 •

New activity in chujiezheng/Mistral7B-PairRM-SPPO-ExPO 5 months ago

Adding Evaluation Results

#1 opened 5 months ago by

leaderboard-pr-bot

New activity in internlm/internlm2_5-20b-chat 7 months ago

Update tokenizer_config.json

#2 opened 7 months ago by

chujiezheng

New activity in mistralai/Mistral-7B-Instruct-v0.3 9 months ago

no system message?

#14 opened 9 months ago by

mclassHF2023

New activity in princeton-nlp/Llama-3-Instruct-8B-SimPO 9 months ago

add chat_template

#3 opened 9 months ago by

chujiezheng

commented a paper 9 months ago

Weak-to-Strong Extrapolation Expedites Alignment

Paper • 2404.16792 • Published Apr 25, 2024 • 11 •

New activity in chujiezheng/Llama-3-Instruct-8B-SimPO-ExPO 9 months ago

Possibly wrong model

#1 opened 9 months ago by

ByteBrew23

New activity in chujiezheng/Smaug-Llama-3-70B-Instruct-ExPO 9 months ago

Update README.md

#3 opened 9 months ago by

chujiezheng

New activity in chujiezheng/Llama3-8B-Chinese-Chat-ExPO 9 months ago

Update README.md

#2 opened 9 months ago by

chujiezheng

New activity in chujiezheng/Llama3-70B-Chinese-Chat-ExPO 9 months ago

Create README.md

#1 opened 9 months ago by

chujiezheng

New activity in chujiezheng/Smaug-Llama-3-70B-Instruct-ExPO 9 months ago

Update README.md

#2 opened 9 months ago by

chujiezheng

New activity in chujiezheng/Llama3-8B-Chinese-Chat-ExPO 9 months ago

Create README.md

#1 opened 9 months ago by

chujiezheng

New activity in chujiezheng/Smaug-Llama-3-70B-Instruct-ExPO 9 months ago

Create README.md

#1 opened 9 months ago by

chujiezheng

New activity in chujiezheng/LLaMA3-iterative-DPO-final-ExPO 9 months ago

Create README.md

#1 opened 9 months ago by

chujiezheng