-
-
-
-
-
-
Inference Providers
Active filters:
8-bit
MaziyarPanahi/firefunction-v2-GGUF
Text Generation
•
Updated
•
1.44M
•
16
lakkeo/stable-cypher-instruct-3b
Text Generation
•
Updated
•
902
•
25
neuralmagic/Meta-Llama-3-70B-Instruct-quantized.w8a16
Text Generation
•
Updated
•
793
•
4
AI-MO/NuminaMath-7B-TIR-GPTQ
Text Generation
•
Updated
•
374
•
7
meta-llama/Llama-Guard-3-8B-INT8
Text Generation
•
Updated
•
4.07k
•
34
MaziyarPanahi/Meta-Llama-3.1-8B-Instruct-GGUF
Text Generation
•
Updated
•
1.45M
•
17
MaziyarPanahi/Meta-Llama-3.1-70B-Instruct-GGUF
Text Generation
•
Updated
•
1.45M
•
39
neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w8a8
Text Generation
•
Updated
•
7.2k
•
14
MaziyarPanahi/gemma-2-2b-it-GGUF
Text Generation
•
Updated
•
1.5M
•
13
nm-testing/Mixtral-8x7B-Instruct-v0.1-W8A8-quantized
Updated
•
7
•
1
LoneStriker/Hermes-3-Llama-3.1-8B-8.0bpw-h8-exl2
Updated
•
5
•
1
neuralmagic/gemma-2-9b-it-quantized.w8a8
Text Generation
•
Updated
•
73
•
3
Statuo/NemoMix-Unleashed-EXL2-8bpw
Text Generation
•
Updated
•
169
•
4
Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int8
Image-Text-to-Text
•
Updated
•
5.1k
•
28
Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int8
Image-Text-to-Text
•
Updated
•
1.25k
•
13
watsonchua/hansard-gemma-2-9b-lora
Updated
•
7
•
1
MaziyarPanahi/Yi-Coder-1.5B-Chat-GGUF
Text Generation
•
Updated
•
1.45M
•
6
MaziyarPanahi/Yi-Coder-9B-Chat-GGUF
Text Generation
•
Updated
•
1.18M
•
3
qeternity/Mistral-Large-Instruct-2407-w8a8
Updated
•
9
•
1
HF1BitLLM/Llama3-8B-1.58-100B-tokens
Text Generation
•
Updated
•
1.96k
•
170
MaziyarPanahi/reader-lm-0.5b-GGUF
Text Generation
•
Updated
•
218
•
3
MaziyarPanahi/solar-pro-preview-instruct-GGUF
Text Generation
•
Updated
•
1.44M
•
24
Qwen/Qwen2-VL-72B-Instruct-GPTQ-Int8
Image-Text-to-Text
•
Updated
•
1.97k
•
10
Qwen/Qwen2.5-0.5B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
9.26k
•
8
Qwen/Qwen2.5-7B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
20.5k
•
12
Qwen/Qwen2.5-14B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
28.1k
•
15
Qwen/Qwen2.5-72B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
3.74k
•
19
MaziyarPanahi/Qwen2.5-1.5B-Instruct-GGUF
Text Generation
•
Updated
•
1.44M
•
4
MaziyarPanahi/Qwen2.5-7B-Instruct-GGUF
Text Generation
•
Updated
•
1.47M
•
9
brunopio/Llama3-8B-1.58-100B-tokens-GGUF
Text Generation
•
Updated
•
1.68M
•
14