-
-
-
-
-
-
Inference Providers
Active filters:
nvidia
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
Text Generation
•
32B
•
Updated
•
261k
•
520
unsloth/Nemotron-3-Nano-30B-A3B-GGUF
Text Generation
•
32B
•
Updated
•
88.3k
•
196
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8
Text Generation
•
32B
•
Updated
•
529k
•
218
nvidia/Qwen2.5-CascadeRL-RM-72B
Text Generation
•
71B
•
Updated
•
16
•
6
Token Classification
•
Updated
•
3.05k
•
51
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16
Text Generation
•
32B
•
Updated
•
8.08k
•
84
nvidia/Qwen3-Next-80B-A3B-Thinking-NVFP4
Text Generation
•
Updated
•
216
•
4
nvidia/Llama-4-Scout-17B-16E-Instruct-NVFP4
56B
•
Updated
•
21.4k
•
16
nvidia/Cosmos-Predict2.5-2B
Updated
•
42.4k
•
42
nvidia/Nemotron-Cascade-8B-Thinking
Text Generation
•
8B
•
Updated
•
1.4k
•
29
Image-Text-to-Text
•
Updated
•
7.31k
•
10
Updated
•
18.5k
•
14
Image-Text-to-Text
•
8B
•
Updated
•
77.2k
•
224
nvidia/Qwen3-Nemotron-235B-A22B-GenRM
Text Generation
•
235B
•
Updated
•
251
•
16
huihui-ai/Huihui-NVIDIA-Nemotron-Nano-9B-v2-abliterated
Text Generation
•
9B
•
Updated
•
64
•
2
nvidia/Cosmos-1.0-Diffusion-7B-Text2World
Text-to-Video
•
Updated
•
2.99k
•
230
roleplaiapp/Llama-3.1-Nemotron-70B-Instruct-HF-Q4_K_M-GGUF
Text Generation
•
71B
•
Updated
•
225
•
2
nvidia/Cosmos-Transfer1-7B
Updated
•
1.31k
•
59
nvidia/Nemotron-H-8B-Base-8K
Text Generation
•
8B
•
Updated
•
11.6k
•
53
nvidia/OpenMath-Nemotron-14B
Text Generation
•
15B
•
Updated
•
183
•
15
nvidia/Cosmos-Predict2-14B-Video2World
Image-to-Video
•
Updated
•
118
•
28
nvidia/Llama-3.1-Nemotron-Nano-4B-v1.1
Text Generation
•
5B
•
Updated
•
1.06k
•
111
unsloth/AceReason-Nemotron-14B-GGUF
Text Generation
•
15B
•
Updated
•
423
•
8
bartowski/nvidia_AceReason-Nemotron-14B-GGUF
Text Generation
•
15B
•
Updated
•
518
•
10
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1
Image-Text-to-Text
•
Updated
•
905k
•
170
nvidia/Cosmos-Predict2-14B-Sample-GR00T-Dreams-GR1
Updated
•
59
•
3
nvidia/Cosmos-Predict2-0.6B-Text2Image
Text-to-Image
•
Updated
•
61
•
7
nvidia/Qwen3-235B-A22B-NVFP4
Text Generation
•
133B
•
Updated
•
3.3k
•
9
nvidia/Qwen3-30B-A3B-NVFP4
Text Generation
•
16B
•
Updated
•
49k
•
18
NVFP4/Qwen3-235B-A22B-Instruct-2507-FP4
Text Generation
•
118B
•
Updated
•
459
•
3