gghfez commited on
Commit
66dabd2
1 Parent(s): 9ee67b8

Update README.md

Browse files

Added VRAM requirement

Files changed (1) hide show
  1. README.md +4 -0
README.md CHANGED
@@ -1,3 +1,7 @@
 
 
 
 
1
  ---
2
  base_model:
3
  - openbmb/Eurux-8x22b-nca
 
1
+ 3.75bpw EXL2 quant of https://huggingface.co/gghfez/WizardLM-2-8x22B-Beige
2
+
3
+ This is the biggest quant we can fit into 72GB of VRAM (eg. 3x3090 cards) with a Q4 cache
4
+
5
  ---
6
  base_model:
7
  - openbmb/Eurux-8x22b-nca