2stacks
/

s1.1-0.5B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Model Summary

s1.1-0.5B is a sucessor of s1 with better reasoning performance by leveraging reasoning traces from r1 instead of Gemini. This model was created simply to test the process used to train the original s1.1 cited below using consumer grade GPUs.

Logs: https://wandb.ai/2stacks-sms/s1/runs/ishervdt?nw=nwuser2stacks
Repository: simplescaling/s1
Paper: https://arxiv.org/abs/2501.19393

Thanks to Ryan Marten for helping generate r1 traces for s1K.

Use

The model usage is documented here.

Downloads last month: 50

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

Model tree for 2stacks/s1.1-0.5B

Base model

Qwen/Qwen2.5-0.5B

Finetuned

Qwen/Qwen2.5-0.5B-Instruct

Finetuned

(219)

this model

Quantizations

1 model

Dataset used to train 2stacks/s1.1-0.5B