KnutJaegersberg
commited on
Commit
•
28fa349
1
Parent(s):
7b942e4
Update README.md
Browse files
README.md
CHANGED
@@ -214,7 +214,7 @@ inference: false
|
|
214 |
This works by upgrading bitsandbytes to the most recent version and installing this pull request of hf transformers:
|
215 |
https://github.com/huggingface/transformers/pull/26037
|
216 |
|
217 |
-
It uses 37 GB VRAM and
|
218 |
|
219 |
# NLLB-MoE
|
220 |
|
|
|
214 |
This works by upgrading bitsandbytes to the most recent version and installing this pull request of hf transformers:
|
215 |
https://github.com/huggingface/transformers/pull/26037
|
216 |
|
217 |
+
It uses 37 GB VRAM and loads in like 20 seconds instead of 15 min, but inference is very slow. It is properly the sota open access model in machine translation.
|
218 |
|
219 |
# NLLB-MoE
|
220 |
|