output_model
This is a merge of pre-trained language models created using mergekit.
Merge Details
Merge Method
This model was merged using the DARE TIES merge method using Qwen/Qwen3-32B as a base.
Models Merged
The following models were included in the merge:
- LLMcompe-Team-Watanabe/Qwen3-32B-openmathreasoning-sft
- LLMcompe-Team-Watanabe/Qwen3-32B-textbookreasoning-UGPhysics-AoPsInstruct-sft
- LLMcompe-Team-Watanabe/others_second_stage
- LLMcompe-Team-Watanabe/Qwen3-32B-sft-deepscaler-openr1-havard-40k-1ep-lr5e6-8k
- LLMcompe-Team-Watanabe/Qwen3-32B-sft-HIS-Chem-Engineering-45k-1ep-lr5e6-4096
Configuration
The following YAML configuration was used to produce this model:
models:
- model: Qwen/Qwen3-32B
- model: LLMcompe-Team-Watanabe/Qwen3-32B-textbookreasoning-UGPhysics-AoPsInstruct-sft
parameters:
density: 0.53
weight: 0.3
- model: LLMcompe-Team-Watanabe/Qwen3-32B-openmathreasoning-sft
parameters:
density: 0.53
weight: 0.3
- model: LLMcompe-Team-Watanabe/Qwen3-32B-sft-deepscaler-openr1-havard-40k-1ep-lr5e6-8k
parameters:
density: 0.53
weight: 0.05
- model: LLMcompe-Team-Watanabe/others_second_stage
parameters:
density: 0.53
weight: 0.05
- model: LLMcompe-Team-Watanabe/Qwen3-32B-sft-HIS-Chem-Engineering-45k-1ep-lr5e6-4096
parameters:
density: 0.53
weight: 0.1
- model: Qwen/Qwen3-32B
parameters:
density: 0.53
weight: 0.20
merge_method: dare_ties
base_model: Qwen/Qwen3-32B
parameters:
int8_mask: true
normalize: false
dtype: bfloat16
- Downloads last month
- 102,900
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Model tree for LLMcompe-Team-Watanabe/Qwen3-32B-merge-base2-math3-science3-submath05-med05-other1
Merge model
this model