nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 Text Generation • 32B • Updated about 15 hours ago • 291k • 542
RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8-dynamic Text Generation • 8B • Updated 27 days ago • 37.6k • 9
RedHatAI/Llama-3.1-8B-Instruct-speculator.eagle3 Text Generation • 1.0B • Updated 20 days ago • 6.59k • 1
Running 1 Quantization Formats And Cuda Compute Capability Support 🧠 1 Quantization Formats & CUDA Compute Capability Support