KnutJaegersberg
/

nllb-moe-54b-4bit

feature-extraction

4-bit precision

Model card Files Files and versions Community

KnutJaegersberg commited on Dec 16, 2023

Commit

28fa349

•

1 Parent(s): 7b942e4

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -214,7 +214,7 @@ inference: false
 This works by upgrading bitsandbytes to the most recent version and installing this pull request of hf transformers:
 https://github.com/huggingface/transformers/pull/26037
-It uses 37 GB VRAM and is very slow, but it is properly the open access sota model in machine translation.
 # NLLB-MoE

 This works by upgrading bitsandbytes to the most recent version and installing this pull request of hf transformers:
 https://github.com/huggingface/transformers/pull/26037
+It uses 37 GB VRAM and loads in like 20 seconds instead of 15 min, but inference is very slow. It is properly the sota open access model in machine translation.
 # NLLB-MoE