Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
1
shawnxzhu
shawnxzhu
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
1 day ago
Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration
upvoted
a
paper
12 days ago
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
upvoted
a
paper
13 days ago
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR
View all activity
Organizations
None yet
shawnxzhu
's datasets
10
Sort: Recently updated
shawnxzhu/DSAA6000Q-Mistral-7B-Instruct-v0.2-lima-dpo
Viewer
•
Updated
May 11
•
1.03k
•
2
shawnxzhu/CHARM-preference20K
Viewer
•
Updated
Apr 12
•
20k
•
4
shawnxzhu/CHARM-preference20K-Qwen2.5-72B-Instruct
Viewer
•
Updated
Apr 12
•
20k
•
3
shawnxzhu/CHARM-preference20K-Llama-3.1-70B-Instruct
Viewer
•
Updated
Apr 12
•
20k
•
3
shawnxzhu/CHARM-preference20K-Llama-3.1-8B-Instruct
Viewer
•
Updated
Apr 12
•
20k
•
7
shawnxzhu/CHARM-preference20K-GPT-4o-mini-2024-07-18
Viewer
•
Updated
Apr 12
•
20k
•
55
shawnxzhu/CHARM-preference20K-gemma-2-27b-it
Viewer
•
Updated
Apr 12
•
20k
•
4
shawnxzhu/CHARM-preference20K-gemma-2-9b-it
Viewer
•
Updated
Apr 12
•
20k
•
6
shawnxzhu/CHARM-preference20K-gemma-2-9b-it-SimPO
Viewer
•
Updated
Apr 12
•
20k
•
3
shawnxzhu/backward-curation
Preview
•
Updated
Apr 8
•
1