Jingming Zhuo

JingmingZ

AI & ML interests

Large Language Models

Recent Activity

updated a dataset 8 days ago

rl-rag/verified_miro_trajectories

published a dataset 8 days ago

rl-rag/verified_miro_trajectories

upvoted a paper 17 days ago

FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction

View all activity

Organizations

updated a dataset 8 days ago

rl-rag/verified_miro_trajectories

Viewer • Updated 8 days ago • 9.88k • 78

published a dataset 8 days ago

rl-rag/verified_miro_trajectories

Viewer • Updated 8 days ago • 9.88k • 78

upvoted a paper 17 days ago

FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction

Paper • 2508.11987 • Published 23 days ago • 67

upvoted a paper about 2 months ago

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Paper • 2507.16812 • Published Jul 22 • 62

upvoted a paper 3 months ago

MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning

Paper • 2506.05331 • Published Jun 5 • 13

liked a dataset 3 months ago

xy06/MINT-CoT-Dataset

Viewer • Updated Jun 10 • 100 • 73 • 7

liked a model 3 months ago

xy06/MINT-CoT-7B

8B • Updated Jun 4 • 26 • 6

upvoted a paper 5 months ago

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Paper • 2504.02826 • Published Apr 3 • 69

upvoted a paper 7 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 203

liked a Space 9 months ago

Open LMM Reasoning Leaderboard

🥇

A Leaderboard that demonstrates LMM reasoning capabilities

upvoted 3 papers 11 months ago

CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution

Paper • 2410.16256 • Published Oct 21, 2024 • 61

ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs

Paper • 2410.12405 • Published Oct 16, 2024 • 13

Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate

Paper • 2410.07167 • Published Oct 9, 2024 • 40

liked a Space 12 months ago

121

Open VLM Video Leaderboard

🌎

VLMEvalKit Eval Results in video understanding benchmark

upvoted a paper over 1 year ago

InternLM2 Technical Report

Paper • 2403.17297 • Published Mar 26, 2024 • 35

authored a paper over 1 year ago

InternLM2 Technical Report

Paper • 2403.17297 • Published Mar 26, 2024 • 35

liked a Space over 1 year ago

876

Open VLM Leaderboard

🌎

VLMEvalKit Evaluation Results Collection

Jingming Zhuo

AI & ML interests

Recent Activity

Organizations

JingmingZ's activity

Open LMM Reasoning Leaderboard

Open VLM Video Leaderboard

Open VLM Leaderboard