Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Siyuan Wang's picture
5 26 121

Siyuan Wang

OldKingMeister
lgg's profile picture srihby's profile picture DrewJin0827's profile picture
·
  • Wangmerlyn

AI & ML interests

ML system

Recent Activity

upvoted a paper 2 days ago
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
liked a model 2 days ago
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
upvoted a paper 7 days ago
Diffusion LLMs Can Do Faster-Than-AR Inference via Discrete Diffusion Forcing
View all activity

Organizations

Purdue University's profile picture fast.ai community's profile picture ONNX Community's profile picture

Collections 2

AI4S
  • Uni-SMART: Universal Science Multimodal Analysis and Research Transformer

    Paper • 2403.10301 • Published Mar 15, 2024 • 55
Long
  • LongRoPE2: Near-Lossless LLM Context Window Scaling

    Paper • 2502.20082 • Published Feb 27 • 39
AI4S
  • Uni-SMART: Universal Science Multimodal Analysis and Research Transformer

    Paper • 2403.10301 • Published Mar 15, 2024 • 55
Long
  • LongRoPE2: Near-Lossless LLM Context Window Scaling

    Paper • 2502.20082 • Published Feb 27 • 39

Papers 1

arxiv:2502.20082

models 2

OldKingMeister/SPMM-Pretrained

Updated May 16 • 1

OldKingMeister/Qwen2.5-1.5B-Instruct-YaRN

Text Generation • 2B • Updated Apr 28 • 4.2k • 1

datasets 2

OldKingMeister/gsm8k-256

Viewer • Updated Jul 12 • 512 • 20

OldKingMeister/gsm8k-16

Viewer • Updated Jul 12 • 32 • 23
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略