In a Training Loop 🔄
lewtun
·
AI & ML interests
LLMs, LLMs, LLMs
Organizations
lewtun/qwen2-7B-lr-3e-6-tok-1024
Updated
lewtun/qwen2-0.5B-lr-3e-6-tok-1024
Updated
lewtun/qwen2-1.5B-lr-3e-6-tok-1024
Updated
lewtun/qwen2-1.5B-lr-3e-6-tok-2048
Updated
lewtun/qwen2-0.5B-lr-3e-6-tok-2048
Updated
lewtun/qwen2-1.5B-lr-3e-6
2B
•
Updated
•
9
lewtun/qwen2-0.5B-lr-3e-6
0.5B
•
Updated
•
11
0.5B
•
Updated
•
6
2B
•
Updated
•
7
lewtun/EleutherAI_pythia-1b
1B
•
Updated
•
9
lewtun/kto-aligned-model-lora
lewtun/gemma-7b-dpo-full-openhermes-mix1-beta-0.05
Text Generation
•
9B
•
Updated
•
10
lewtun/gemma-7b-dpo-full-openhermes-mix1-beta-0.4
Text Generation
•
9B
•
Updated
•
10
lewtun/gemma-7b-dpo-full-openhermes-mix1-beta-0.01
Text Generation
•
9B
•
Updated
•
11
lewtun/gemma-7b-dpo-full-openhermes-mix1-beta-0.2
Text Generation
•
9B
•
Updated
•
8
lewtun/gemma-7b-dpo-full-openhermes-mix1-beta-0.1
Text Generation
•
9B
•
Updated
•
13
lewtun/gemma-7b-dpo-full-mix1-beta-0.05-epoch-3
Text Generation
•
9B
•
Updated
•
15
•
1
lewtun/gemma-7b-dpo-full-mix1-beta-0.05-epoch-2
Text Generation
•
9B
•
Updated
•
16
lewtun/gemma-7b-sft-full-openhermes-v0
Text Generation
•
9B
•
Updated
•
15
lewtun/gemma-7b-dpo-full-mix2-beta-0.1
Text Generation
•
9B
•
Updated
•
21
lewtun/gemma-7b-dpo-full-ultrafeedback-beta-0.01
Text Generation
•
9B
•
Updated
•
16
lewtun/gemma-7b-dpo-full-mix1-beta-0.4-epoch-3
Text Generation
•
9B
•
Updated
•
14
lewtun/gemma-7b-dpo-full-mix1-beta-0.01
Text Generation
•
9B
•
Updated
•
16
lewtun/gemma-7b-dpo-full-mix1-beta-0.05
Text Generation
•
9B
•
Updated
•
11