quarterturn
/

molmo-flux-captioner

Model card Files Files and versions Community

quarterturn commited on Oct 16, 2024

Commit

c36fe28

·

1 Parent(s): be71aca

update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -17,12 +17,12 @@ For max precision clone Molmo-7B-D-0924:
 ```
 You'll need a 24GB GPU since the model loads at bf16.
-For slightly less preccision, but much lower memory needed, clone molmo-7B-D-bnb-4bit:
 ```
       git lfs install
       git clone https://huggingface.co/cyan2k/molmo-7B-D-bnb-4bit
 ```
-A 12GB GPU should be fine.
 1. create a python3 venv or use conda to create an environment, eg:
    ``` conda create -n caption python=3.11 ```

 ```
 You'll need a 24GB GPU since the model loads at bf16.
+For less precision, but much lower memory needed, clone molmo-7B-D-bnb-4bit:
 ```
       git lfs install
       git clone https://huggingface.co/cyan2k/molmo-7B-D-bnb-4bit
 ```
+A 12GB GPU should be fine. Note that the 4-bit quant produces not just less accurate, but quite different in it's description. YMMV.
 1. create a python3 venv or use conda to create an environment, eg:
    ``` conda create -n caption python=3.11 ```