-
-
-
-
-
-
Inference Providers
Active filters:
8-bit
MaziyarPanahi/Llama-3.2-3B-Instruct-GGUF
Text Generation
•
Updated
•
870k
•
10
MaziyarPanahi/Llama-3.2-1B-Instruct-GGUF
Text Generation
•
Updated
•
1.46M
•
8
qeternity/Qwen2.5-72B-Instruct-W8A8
Updated
•
34
•
3
noneUsername/Mistral-Small-Instruct-2409-W8A8-Dynamic-Per-Token
Updated
•
2
•
1
malenia1/ternary-weight-embedding
Updated
•
632
•
7
MaziyarPanahi/Qwen2.5-7B-Instruct-abliterated-v2-GGUF
Text Generation
•
Updated
•
137
•
3
MaziyarPanahi/llm_3_2_flux_prompt-GGUF
Text Generation
•
Updated
•
434
•
1
akhmat-s/t5-large-quant-grammar-corrector
Text2Text Generation
•
Updated
•
16.3k
•
1
MaziyarPanahi/Llama-3.2-1B-GGUF
Text Generation
•
Updated
•
473
•
1
MaziyarPanahi/SmolLM2-135M-Instruct-GGUF
Text Generation
•
Updated
•
220
•
2
AIFunOver/stable-diffusion-3.5-large-turbo-openvino-8bit
Text-to-Image
•
Updated
•
67
•
1
Qwen/Qwen2.5-Coder-14B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
1.3k
•
3
Qwen/Qwen2.5-Coder-32B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
14.8k
•
16
mlx-community/Qwen2.5-Coder-32B-Instruct-8bit
Text Generation
•
Updated
•
182
•
8
mlx-community/Qwen2.5-Coder-32B-8bit
Text Generation
•
Updated
•
47
•
3
lmstudio-community/Qwen2.5-Coder-32B-Instruct-MLX-8bit
Text Generation
•
Updated
•
248
•
3
PrunaAI/suriya7-conversational-gpt-1-bnb-8bit-smashed
Updated
•
6
•
1
prithivMLmods/Marco-o1-GGUF
Text Generation
•
Updated
•
683
•
12
HuggingFaceTB/SmolLM2-1.7B-Instruct-Q8-mlx
Text Generation
•
Updated
•
96
•
2
DrNicefellow/Qwen-QwQ-32B-Preview-8.0bpw-h8-exl2
Updated
•
18
•
3
tiiuae/Falcon3-7B-Instruct-1.58bit
Text Generation
•
Updated
•
3.18k
•
13
mlx-community/Llama-3.3-70B-Instruct-8bit
Text Generation
•
Updated
•
1.17k
•
10
MaziyarPanahi/Llama-3.3-70B-Instruct-GGUF
Text Generation
•
Updated
•
830k
•
11
CalamitousFelicitousness/Llama-3.3-70B-Instruct-W8A8-INT8
Updated
•
102
•
4
MaziyarPanahi/Pleias-Nano-GGUF
Text Generation
•
Updated
•
268
•
4
Dracones/L3.3-70B-Euryale-v2.3_exl2_8.0bpw
Text Generation
•
Updated
•
17
•
1
prithivMLmods/QwQ-LCoT-3B-Instruct-GGUF
Text Generation
•
Updated
•
212
•
12
IlyaGusev/sainemo_remix_12b_gptq_8bit
Text Generation
•
Updated
•
336
•
4
MaziyarPanahi/patricide-12B-Unslop-Mell-v2-GGUF
Text Generation
•
Updated
•
246
•
3
tiiuae/Falcon3-10B-Instruct-1.58bit
Text Generation
•
Updated
•
110
•
13