--- license: other license_name: nvidia-open-model-license license_link: >- https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/ base_model: nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 base_model_relation: quantized quantized_by: turboderp tags: - exl3 --- EXL3 quants of [Llama-3_1-Nemotron-Ultra-253B-v1](https://huggingface.co/nvidia/Llama-3_1-Nemotron-Ultra-253B-v1) [2.00 bits per weight](https://huggingface.co/turboderp/Llama-3.1-Nemotron-Ultra-253B-v1-exl3/tree/2.0bpw) [5.00 bits per weight](https://huggingface.co/turboderp/Llama-3.1-Nemotron-Ultra-253B-v1-exl3/tree/5.0bpw) (more bitrates will follow)