turboderp
/

Llama-3.3-Nemotron-Super-49B-v1-exl3

Model card Files Files and versions

turboderp commited on Apr 14

Commit

87c9c3a

·

verified ·

1 Parent(s): 32230da

Update README.md

Files changed (1) hide show

README.md +17 -3

README.md CHANGED Viewed

@@ -1,3 +1,17 @@
----
-license: apache-2.0
----

+---
+license: other
+license_name: nvidia-open-model-license
+license_link: >-
+  https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/
+---
+EXL3 quants of [Llama-3.3-Nemotron-Super-49B-v1](https://huggingface.co/nvidia/Llama-3_3-Nemotron-Super-49B-v1/tree/main)
+[1.80 bits per weight / H4](https://huggingface.co/turboderp/Llama-3.2-1B-Instruct-exl3/tree/1.8bpw_H4)
+[2.00 bits per weight](https://huggingface.co/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/2.0bpw)
+[2.50 bits per weight](https://huggingface.co/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/2.5bpw)
+[3.00 bits per weight](https://huggingface.co/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/3.0bpw)
+[3.50 bits per weight](https://huggingface.co/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/3.5bpw)
+[4.00 bits per weight](https://huggingface.co/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/4.0bpw)
+[5.00 bits per weight](https://huggingface.co/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/5.0bpw)
+[6.00 bits per weight](https://huggingface.co/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/6.0bpw)
+[8.00 bits per weight](https://huggingface.co/Llama-3.3-Nemotron-Super-49B-v1-exl3/tree/8.0bpw)