📐 FineMath
FineMath datasets and ablation models
Viewer • Updated • 48.3M • 11.4k • 343Note FineMath datasets
HuggingFaceTB/FineMath-Llama-3B
3B • Updated • 89 • 18Note Llama 3B trained on a mix of FineMath and FineWeb-Edu: better at math and similar to Llama in reasoning, knowledge and common sense
HuggingFaceTB/finemath-classifier
Text Classification • 0.1B • Updated • 7.93k • • 12Note FineMath text classifier to score the mathematical reasoning and educational content
-
HuggingFaceTB/finemath-ablation-finemath-4plus
3B • Updated • 78 • 1 -
HuggingFaceTB/finemath-ablation-finemath-3plus
3B • Updated • 49 -
HuggingFaceTB/finemath-ablation-infiwebmath-4plus
3B • Updated • 48 • 2
HuggingFaceTB/finemath-ablation-infiwebmath-3plus
3B • Updated • 52Note Ablations on FineMath subsets (continual pre-training of base Llama 3.2 3B on 60B tokens)
-
HuggingFaceTB/finemath-ablation-finemath-infimath-3plus
3B • Updated • 56
HuggingFaceTB/finemath-ablation-finemath-infimath-4plus
3B • Updated • 62 • 2Note Ablations on FineMath plus3 and plus4 (continual pre-training of base Llama 3.2 3B on 60B tokens)
-
HuggingFaceTB/finemath-ablation-fwedu
3B • Updated • 51 -
HuggingFaceTB/finemath-ablation-infiwebmath
3B • Updated • 54
HuggingFaceTB/finemath-ablation-owm
3B • Updated • 55Note Ablations on public math datasets and FW-Edu as a baseline (continual pre-training of base Llama 3.2 3B on 60B tokens)
-
HuggingFaceTB/finemath-ablation-3plus-160B
3B • Updated • 57
HuggingFaceTB/finemath-ablation-4plus-160B
3B • Updated • 124Note Longer ablation for 160B on a mix of 40% fineweb-edu 60% FineMath and Infiwebmath 3plus / 4plus