Cedille
/

de-anna

@@ -19,13 +19,20 @@ Anna was trained on German text with a similar methodology to [Boris](https://hu
 # How to run
 ## Loading the model
 ```
 from transformers import AutoTokenizer, AutoModelForCausalLM
 tokenizer = AutoTokenizer.from_pretrained("Cedille/de-anna")
 model = AutoModelForCausalLM.from_pretrained("Cedille/de-anna")
 ```
 ## Contact us
 For any custom development please contact us at hello@cedille.ai.

 # How to run
 ## Loading the model
+### Base (requires 48+ GB of RAM)
 ```
 from transformers import AutoTokenizer, AutoModelForCausalLM
 tokenizer = AutoTokenizer.from_pretrained("Cedille/de-anna")
 model = AutoModelForCausalLM.from_pretrained("Cedille/de-anna")
 ```
+### Lower memory usage (loads on 16GB of RAM)
+GPT_J models (link) have a parameter to only be loaded once ...
+Combine that with half precision (fp16) ...
+TO DO
+## Generation
+TO DO
 ## Contact us
 For any custom development please contact us at hello@cedille.ai.