Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
SoraWatermarkRemover
Log In
Sign Up
Chenyan Xiong Research Group at CMU
university
https://www.cs.cmu.edu/~cx/
Activity Feed
Follow
9
AI & ML interests
None defined yet.
Recent Activity
SingularityHJY
updated
a dataset
about 1 month ago
cx-cmu/ClueWeb-Reco
yuzc19
updated
a dataset
about 1 month ago
cx-cmu/repro-organic-data-72B
yuzc19
updated
a collection
about 1 month ago
RePro
View all activity
Papers
RePro: Training Language Models to Faithfully Recycle the Web for Pretraining
View all Papers
Team members
7
cx-cmu
's datasets
5
Sort: Recently updated
cx-cmu/ClueWeb-Reco
Viewer
•
Updated
Oct 24
•
87.2M
•
75
•
1
cx-cmu/repro-organic-data-72B
Viewer
•
Updated
Oct 18
•
58.3M
•
156
cx-cmu/repro-rl-data
Viewer
•
Updated
Oct 18
•
41k
•
36
cx-cmu/repro-rephrased-data-72B
Viewer
•
Updated
Oct 18
•
39M
•
831
cx-cmu/CLUE-LLM
Viewer
•
Updated
Jun 11
•
1.21k
•
10