In a Training Loop 🔄
lewtun
·
AI & ML interests
LLMs, LLMs, LLMs
Organizations
lewtun/gemma-7b-dpo-full-mix1-beta-0.1-epoch-3
Text Generation
•
9B
•
Updated
•
13
lewtun/gemma-7b-dpo-full-mix1-beta-0.6
Text Generation
•
9B
•
Updated
•
11
lewtun/gemma-7b-dpo-full-mix1-beta-0.4
Text Generation
•
9B
•
Updated
•
14
lewtun/gemma-7b-dpo-full-mix1-beta-0.2
Text Generation
•
9B
•
Updated
•
12
lewtun/gemma-7b-dpo-full-mix1-beta-0.1
Text Generation
•
9B
•
Updated
•
10
lewtun/gemma-7b-dpo-full-ultrafeedback-v0
Text Generation
•
Updated
•
12
lewtun/gemma-7b-dpo-full-mix-beta-0.1
Updated
lewtun/gemma-7b-dpo-full-orca-v0
Text Generation
•
9B
•
Updated
•
18
lewtun/gemma-7b-sft-full-deita-10k-v0
Text Generation
•
9B
•
Updated
•
19
lewtun/gemma-7b-sft-full-ultrachat-v0
Text Generation
•
9B
•
Updated
•
14
•
1
lewtun/gemma-7b-sft-full-longest-1k-v1
Text Generation
•
9B
•
Updated
•
13
lewtun/gemma-7b-sft-full-longest-1k-v0
Text Generation
•
9B
•
Updated
•
15
lewtun/gemma-7b-sft-full-dolly-v3
Text Generation
•
9B
•
Updated
•
10
lewtun/gemma-7b-sft-full-dolly-v2
Text Generation
•
9B
•
Updated
•
13
lewtun/gemma-7b-sft-full-dolly-v1
Text Generation
•
9B
•
Updated
•
10
lewtun/gemma-7b-sft-full-dolly-v0
Text Generation
•
9B
•
Updated
•
11
Text Generation
•
0.5B
•
Updated
•
22
lewtun/zephyr-7b-dpo-qlora-fix
lewtun/zephyr-7b-dpo-qlora-8e0975a
lewtun/zephyr-7b-dpo-qlora
lewtun/handbook-sft-qlora-test
Text Generation
•
7B
•
Updated
•
13
lewtun/zephyr-7b-dpo-full
Text Generation
•
7B
•
Updated
•
17
lewtun/zephyr-7b-sft-qlora
Text Classification
•
0.4B
•
Updated
•
13
Text Generation
•
7B
•
Updated
•
9
Text Generation
•
7B
•
Updated
•
9
lewtun/mistral-7b-sft-ultrachat-arithmo-25
Text Generation
•
Updated
•
12
lewtun/mistral-7b-sft-ultrachat-arithmo-50
Text Generation
•
Updated
•
17
•
1