Doge-tokenizer
Tokenizer for the training model on smollm-corpus, and support reasoning fine-tuning like R1. This tokenizer was trained on 2M samples from:
- FineWeb-Edu 70%
- Cosmopedia v2 20%
- Python-Edu 5%
- FineMath 5%
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API:
The model has no pipeline_tag.