File size: 1,417 Bytes
87c9c3a
 
 
 
 
 
 
 
ffcda56
02cd91f
 
 
 
 
 
 
980d516
1b88513
ba974b0
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
---
license: other
license_name: nvidia-open-model-license
license_link: >-
  https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/
---
EXL3 quants of [Llama-3.3-Nemotron-Super-49B-v1](https://huggingface.co/nvidia/Llama-3_3-Nemotron-Super-49B-v1/tree/main)

[1.80 bits per weight / H4](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/1.8bpw_H4)    
[2.00 bits per weight](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/2.0bpw)    
[2.50 bits per weight](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/2.5bpw)    
[3.00 bits per weight](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/3.0bpw)    
[3.50 bits per weight](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/3.5bpw)    
[4.00 bits per weight](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/4.0bpw)    
[5.00 bits per weight](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/5.0bpw)    
[6.00 bits per weight](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/6.0bpw)    
[8.00 bits per weight / H8](https://huggingface.co/turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/8.0bpw_H8)    

![image/png](https://cdn-uploads.huggingface.co/production/uploads/6383dc174c48969dcf1b4fce/PXwVukMFqjCcCuyaOg0YM.png)