-
-
-
-
-
-
Inference Providers
Active filters:
int4
bullpoint/Qwen3-Coder-Next-AWQ-4bit
Text Generation
•
14B
•
Updated
•
18.4k
•
2
alecccdd/moondream3-preview-4bit
Image-Text-to-Text
•
Updated
•
416
•
7
tonera/Beyond_Reality_Zimage_v2_svdq
Text-to-Image
•
Updated
•
36
•
1
ussoewwin/HSWQ-Z-Image-fp8e4m3
Text-to-Image
•
Updated
•
1
Advantech-EIOT/intel_llama-2-chat-7b
Text Generation
•
Updated
RedHatAI/zephyr-7b-beta-marlin
Text Generation
•
1B
•
Updated
•
3
RedHatAI/TinyLlama-1.1B-Chat-v1.0-marlin
Text Generation
•
0.3B
•
Updated
•
70
•
2
RedHatAI/OpenHermes-2.5-Mistral-7B-marlin
Text Generation
•
1B
•
Updated
•
5
•
2
RedHatAI/Nous-Hermes-2-Yi-34B-marlin
Text Generation
•
5B
•
Updated
•
8
•
5
ecastera/ecastera-eva-westlake-7b-spanish-int4-gguf
7B
•
Updated
•
7
•
2
softmax/Llama-2-70b-chat-hf-marlin
Text Generation
•
10B
•
Updated
•
1
softmax/falcon-180B-chat-marlin
Text Generation
•
26B
•
Updated
•
1
study-hjt/Meta-Llama-3-8B-Instruct-GPTQ-Int4
Text Generation
•
8B
•
Updated
•
4
study-hjt/Meta-Llama-3-70B-Instruct-GPTQ-Int4
Text Generation
•
71B
•
Updated
•
49
•
6
study-hjt/Meta-Llama-3-70B-Instruct-AWQ
Text Generation
•
71B
•
Updated
•
49
study-hjt/Qwen1.5-110B-Chat-GPTQ-Int4
Text Generation
•
111B
•
Updated
•
52
•
2
study-hjt/CodeQwen1.5-7B-Chat-GPTQ-Int4
Text Generation
•
7B
•
Updated
•
47
study-hjt/Qwen1.5-110B-Chat-AWQ
Text Generation
•
111B
•
Updated
•
48
modelscope/Yi-1.5-34B-Chat-AWQ
Text Generation
•
34B
•
Updated
•
65
•
1
modelscope/Yi-1.5-6B-Chat-GPTQ
Text Generation
•
6B
•
Updated
•
53
modelscope/Yi-1.5-6B-Chat-AWQ
Text Generation
•
6B
•
Updated
•
58
modelscope/Yi-1.5-9B-Chat-GPTQ
Text Generation
•
9B
•
Updated
•
56
•
1
modelscope/Yi-1.5-9B-Chat-AWQ
Text Generation
•
9B
•
Updated
•
230
modelscope/Yi-1.5-34B-Chat-GPTQ
Text Generation
•
34B
•
Updated
•
61
•
1
jojo1899/Phi-3-mini-128k-instruct-ov-int4
Text Generation
•
Updated
•
2
jojo1899/Llama-2-13b-chat-hf-ov-int4
Text Generation
•
Updated
•
1
jojo1899/Mistral-7B-Instruct-v0.2-ov-int4
Text Generation
•
Updated
•
4
model-scope/glm-4-9b-chat-GPTQ-Int4
Text Generation
•
9B
•
Updated
•
63
•
6
ModelCloud/Mistral-Nemo-Instruct-2407-gptq-4bit
Text Generation
•
12B
•
Updated
•
153
•
5
ModelCloud/Meta-Llama-3.1-8B-Instruct-gptq-4bit
Text Generation
•
8B
•
Updated
•
12
•
4