base_model: nvidia/OpenReasoning-Nemotron-14B | |
[EXL3](https://github.com/turboderp-org/exllamav3) quantization of [OpenReasoning-Nemotron-14B](https://huggingface.co/nvidia/OpenReasoning-Nemotron-14B), 4 bits per weight. | |
base_model: nvidia/OpenReasoning-Nemotron-14B | |
[EXL3](https://github.com/turboderp-org/exllamav3) quantization of [OpenReasoning-Nemotron-14B](https://huggingface.co/nvidia/OpenReasoning-Nemotron-14B), 4 bits per weight. | |