Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • 免费去水印

  • Log In
  • Sign Up
ldwang 's Collections
MiscSpaces
MiscAgentic
MiscIndustry
MiscKernel
MiscR1
MiscModels
MiscDatasets
MiscTools

MiscSpaces

updated Nov 6
Upvote
1

  • Running
    587

    Scaling test-time compute

    📈
    587

    Implement test-time compute scaling for math problems


  • Running
    Featured
    1.23k

    FineWeb: decanting the web for the finest text data at scale

    🍷
    1.23k

    Generate high-quality text data for LLMs using FineWeb


  • Running
    3.6k

    The Ultra-Scale Playbook

    🌌
    3.6k

    The ultimate guide to training LLM on large GPU Clusters


  • Running
    212

    FineVision: Open Data is All You Need

    📝
    212

    A new open-source dataset for training VLMs


  • Running
    19

    Megatron Memory Estimator

    👁
    19

    Estimate GPU memory usage for Megatron models


  • Running on Zero
    19

    Smol2Operator Demo

    🐢
    19

    Smol2Operator Demo: GUI Agent Model


  • Running on CPU Upgrade
    Featured
    2.67k

    The Smol Training Playbook

    📚
    2.67k

    The secrets to building world-class LLMs


  • Running
    72

    Unlocking On-Policy Distillation for Any Model Family

    📝
    72

    Apply on-policy distillation to any model family

Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets 免费Z-image图片生成 免费去水印 Vibevoice

🎉 Free Image Generator Now Available!

Totally Free + Zero Barriers + No Login Required