·
AI & ML interests
None yet
Organizations
None yet
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch3-cosine0516-v1
Text Generation
•
8B
•
Updated
•
2
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0515-v2
Text Generation
•
8B
•
Updated
•
1
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0515-v1
Text Generation
•
8B
•
Updated
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0514-v2
Text Generation
•
8B
•
Updated
•
2
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0514-v1
Text Generation
•
8B
•
Updated
•
1
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0513-v1
Text Generation
•
8B
•
Updated
•
1
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0512-v2
Text Generation
•
8B
•
Updated
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0512-v1
Text Generation
•
8B
•
Updated
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0511-v3
Text Generation
•
8B
•
Updated
•
2
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch3-cosine0511-v2
Text Generation
•
8B
•
Updated
•
4
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0510-v1
Text Generation
•
8B
•
Updated
•
2
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch3-cosine0510-v1
Text Generation
•
8B
•
Updated
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0509
Text Generation
•
8B
•
Updated
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-0509
Text Generation
•
8B
•
Updated
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5
Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math3to5-cosine-0507-wRv2
Text Generation
•
8B
•
Updated
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math3to5-cosine-0507-wR
Text Generation
•
8B
•
Updated
•
1
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math3to5-cosine-0507
Text Generation
•
8B
•
Updated
•
1
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math3to5-cosine-0506
Text Generation
•
8B
•
Updated
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-selected-cosine-0505
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-selected-cosine-0502
Text Generation
•
8B
•
Updated
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-selected-cosine-0430
Text Generation
•
8B
•
Updated
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-selected-cosine-0429
Text Generation
•
8B
•
Updated
•
1
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-selected-cosine-noRW-noRP-0428-updatePW
Text Generation
•
8B
•
Updated
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-selected-cosine-0428
Text Generation
•
8B
•
Updated
•
2
Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW-noRP-0427-updatePW
Text Generation
•
3B
•
Updated
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-selected-cosine-noRW-noRP-0427-updatePW
Updated
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-selected-cosine-noRW-noRP-0426
Text Generation
•
8B
•
Updated
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-selected-cosine-noRW-noRP-0426-updatePW
Text Generation
•
8B
•
Updated
•
1