A Fishy Model
This model was trained on with SFT on the ChatML format with 8k context.
Uploaded model
- Developed by: TheTsar1209
- License: apache-2.0
- Finetuned from model : unsloth/Qwen2.5-14B-Instruct-bnb-4bit
This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Open LLM Leaderboard Evaluation Results
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 32.59 |
IFEval (0-Shot) | 56.22 |
BBH (3-Shot) | 48.83 |
MATH Lvl 5 (4-Shot) | 21.15 |
GPQA (0-shot) | 12.53 |
MuSR (0-shot) | 10.15 |
MMLU-PRO (5-shot) | 46.67 |
- Downloads last month
- 12
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
Model tree for TheTsar1209/qwen-carpmuscle-v0.1
Base model
Qwen/Qwen2.5-14B
Finetuned
Qwen/Qwen2.5-14B-Instruct
Quantized
unsloth/Qwen2.5-14B-Instruct-bnb-4bit
Evaluation results
- strict accuracy on IFEval (0-Shot)Open LLM Leaderboard56.220
- normalized accuracy on BBH (3-Shot)Open LLM Leaderboard48.830
- exact match on MATH Lvl 5 (4-Shot)Open LLM Leaderboard21.150
- acc_norm on GPQA (0-shot)Open LLM Leaderboard12.530
- acc_norm on MuSR (0-shot)Open LLM Leaderboard10.150
- accuracy on MMLU-PRO (5-shot)test set Open LLM Leaderboard46.670