Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
1
Shen Yan
stiger1000
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
updated
a model
about 1 month ago
stiger1000/TC-MoE
upvoted
a
paper
4 months ago
Scaling Law for Quantization-Aware Training
upvoted
a
paper
4 months ago
Model Merging in Pre-training of Large Language Models
View all activity
Organizations
None yet
stiger1000
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
about 1 month ago
stiger1000/TC-MoE
Text Generation
•
2B
•
Updated
Jul 25
•
8
•
1
upvoted
2 papers
4 months ago
Scaling Law for Quantization-Aware Training
Paper
•
2505.14302
•
Published
May 20
•
76
Model Merging in Pre-training of Large Language Models
Paper
•
2505.12082
•
Published
May 17
•
39
upvoted
a
paper
5 months ago
Efficient Pretraining Length Scaling
Paper
•
2504.14992
•
Published
Apr 21
•
20
liked
a model
6 months ago
stiger1000/TC-MoE
Text Generation
•
2B
•
Updated
Jul 25
•
8
•
1
published
a model
6 months ago
stiger1000/TC-MoE
Text Generation
•
2B
•
Updated
Jul 25
•
8
•
1