-
-
-
-
-
-
Inference Providers
Active filters:
em-dpo
Sean13/mistral-7b-instruct-v0.2-rdpo-full-eta0.55
Text Generation
•
7B
•
Updated
•
2
Sean13/mistral-7b-instruct-v0.2-rdpo-full-eta0.75
Text Generation
•
7B
•
Updated
•
1
Sean13/mistral-7b-instruct-v0.2-rdpo-full-eta0.99
Text Generation
•
7B
•
Updated
•
1
Sean13/llama-8b-instruct-rdpo-full
Text Generation
•
8B
•
Updated
•
1
Sean13/llama-8b-instruct-rdpo-full-multipref
Text Generation
•
266k
•
Updated
•
2
Sean13/llama-8b-instruct-rdpo-full-multipref-init-eta-0.90
Text Generation
•
266k
•
Updated
•
2
Sean13/llama-8b-instruct-rdpo-full-multipref-init-eta-0.80
Text Generation
•
266k
•
Updated
•
5
Sean13/llama-8b-instruct-rdpo-full-multipref-0.90
Text Generation
•
266k
•
Updated
•
5
Sean13/llama-8b-instruct-rdpo-full-multipref-init-eta-0.99
Text Generation
•
266k
•
Updated
•
2
Sean13/llama-8b-instruct-rdpo-full-multipref-0.99
Text Generation
•
266k
•
Updated
•
1
Sean13/llama-8b-instruct-rdpo-full-multipref-0.80
Text Generation
•
266k
•
Updated
•
2