Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
zaydzuhri 's Collections
Token Order Prediction
Softpick

Token Order Prediction

updated 7 days ago

Pretrained models from the paper "Predicting the Order of Upcoming Tokens Improves Language Modeling"

Upvote
-

  • Predicting the Order of Upcoming Tokens Improves Language Modeling

    Paper • 2508.19228 • Published 13 days ago • 21

  • zaydzuhri/vanilla-340M-4096-model

    0.4B • Updated 7 days ago • 121

  • zaydzuhri/mtp-340M-4096-model

    0.4B • Updated 7 days ago • 109

  • zaydzuhri/top-340M-4096-model

    0.4B • Updated 7 days ago • 40 • 1

  • zaydzuhri/vanilla-1.8B-4096-model

    2B • Updated 7 days ago • 62

  • zaydzuhri/mtp-1.8B-4096-model

    2B • Updated 7 days ago • 78

  • zaydzuhri/top-1.8B-4096-model

    2B • Updated 7 days ago • 79

  • zaydzuhri/vanilla-7B-4096-model

    7B • Updated 7 days ago • 76

  • zaydzuhri/mtp-7B-4096-model

    7B • Updated 7 days ago • 82

  • zaydzuhri/top-7B-4096-model

    7B • Updated 7 days ago • 67
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略