Shen Yan
stiger1000
·
AI & ML interests
None yet
Recent Activity
updated
a model
about 1 month ago
stiger1000/TC-MoE
upvoted
a
paper
4 months ago
Scaling Law for Quantization-Aware Training
upvoted
a
paper
4 months ago
Model Merging in Pre-training of Large Language Models
Organizations
None yet