-
-
-
-
-
-
Inference Providers
Active filters:
dpo
CultriX/Lama-DPOlphin-8B
Text Generation
•
Updated
•
15
•
1
tsavage68/Na_L3_150steps_1e6rate_01beta_cSFTDPO
Text Generation
•
Updated
•
5
tsavage68/Na_L3_100steps_1e6rate_03beta_cSFTDPO
Text Generation
•
Updated
•
5
NicholasCorrado/zephyr-7b-uf-rlced-conifer-dpo-2e
Text Generation
•
Updated
•
10
tsavage68/Na_L3_1000steps_1e6rate_05beta_cSFTDPO
Text Generation
•
Updated
•
6
tsavage68/Na_L3_100steps_1e6rate_05beta_cSFTDPO
Text Generation
•
Updated
•
5
CultriX/Lama-DPOlphin-8B-Q3_K_S-GGUF
Text Generation
•
Updated
•
8
•
1
CultriX/Lama-DPOlphin-8B-Q3_K_M-GGUF
Text Generation
•
Updated
•
11
•
1
CultriX/Lama-DPOlphin-8B-Q4_K_S-GGUF
Text Generation
•
Updated
•
11
•
1
CultriX/Lama-DPOlphin-8B-Q5_K_S-GGUF
Text Generation
•
Updated
•
5
•
1
CultriX/Lama-DPOlphin-8B-Q5_K_M-GGUF
Text Generation
•
Updated
•
10
•
1
CultriX/Lama-DPOlphin-8B-Q6_K-GGUF
Text Generation
•
Updated
•
9
•
2
CultriX/Lama-DPOlphin-8B-Q8_0-GGUF
Text Generation
•
Updated
•
13
•
1
QuantFactory/Fireball-3.1-8B-ORPO-GGUF
Text Generation
•
Updated
•
12
•
2
CultriX/Lama-DPOlphin-8B-Q4_K_M-GGUF
Text Generation
•
Updated
•
6
•
1
mradermacher/Lama-DPOlphin-8B-GGUF
Updated
•
97
•
1
tsavage68/Na_L3_1000steps_1e7rate_01beta_cSFTDPO
Text Generation
•
Updated
•
5
tsavage68/Na_L3_1000steps_1e7rate_03beta_cSFTDPO
Text Generation
•
Updated
•
5
tsavage68/Na_L3_350steps_1e7rate_01beta_cSFTDPO
Text Generation
•
Updated
•
5
tsavage68/Na_L3_250steps_1e7rate_03beta_cSFTDPO
Text Generation
•
Updated
•
4
tsavage68/Na_L3_1000steps_1e7rate_05beta_cSFTDPO
Text Generation
•
Updated
•
4
tsavage68/Na_L3_350steps_1e7rate_05beta_cSFTDPO
Text Generation
•
Updated
•
4
mradermacher/Lama-DPOlphin-8B-i1-GGUF
Updated
•
596
•
1
tsavage68/Na_M2_1000steps_1e7rate_01beta_cSFTDPO
Text Generation
•
Updated
•
8
tsavage68/Na_M2_1000steps_1e7rate_03beta_cSFTDPO
Text Generation
•
Updated
•
4
tsavage68/Na_M2_1000steps_1e7rate_05beta_cSFTDPO
Text Generation
•
Updated
•
5
tsavage68/Na_M2_200steps_1e6rate_01beta_cSFTDPO
Text Generation
•
Updated
•
4
tsavage68/Na_M2_100steps_1e7rate_03beta_cSFTDPO
Text Generation
•
Updated
•
4
SongTonyLi/SFT_D1chosenThenDPO_D2a_Instruct_argilla_math_results
Text Generation
•
Updated
•
5
Jatin313/tiny-chatbot-dpo
Updated