📐 FineMath
FineMath datasets and ablation models
Viewer • Updated • 48.3M • 15k • 358Note FineMath datasets
HuggingFaceTB/FineMath-Llama-3B
3B • Updated • 95 • 22Note Llama 3B trained on a mix of FineMath and FineWeb-Edu: better at math and similar to Llama in reasoning, knowledge and common sense
HuggingFaceTB/finemath-classifier
Text Classification • 0.1B • Updated • 1.11k • 13Note FineMath text classifier to score the mathematical reasoning and educational content
-
HuggingFaceTB/finemath-ablation-finemath-4plus
3B • Updated • 26 • 1 -
HuggingFaceTB/finemath-ablation-finemath-3plus
3B • Updated • 5 -
HuggingFaceTB/finemath-ablation-infiwebmath-4plus
3B • Updated • 17 • 2
HuggingFaceTB/finemath-ablation-infiwebmath-3plus
3B • Updated • 5Note Ablations on FineMath subsets (continual pre-training of base Llama 3.2 3B on 60B tokens)
-
HuggingFaceTB/finemath-ablation-finemath-infimath-3plus
3B • Updated • 15
HuggingFaceTB/finemath-ablation-finemath-infimath-4plus
3B • Updated • 16 • 2Note Ablations on FineMath plus3 and plus4 (continual pre-training of base Llama 3.2 3B on 60B tokens)
-
HuggingFaceTB/finemath-ablation-fwedu
3B • Updated • 13 -
HuggingFaceTB/finemath-ablation-infiwebmath
3B • Updated • 7
HuggingFaceTB/finemath-ablation-owm
3B • Updated • 9Note Ablations on public math datasets and FW-Edu as a baseline (continual pre-training of base Llama 3.2 3B on 60B tokens)
-
HuggingFaceTB/finemath-ablation-3plus-160B
3B • Updated • 14
HuggingFaceTB/finemath-ablation-4plus-160B
3B • Updated • 17Note Longer ablation for 160B on a mix of 40% fineweb-edu 60% FineMath and Infiwebmath 3plus / 4plus