AmelieSchreiber commited on
Commit
bbe0e74
1 Parent(s): 76ae9f4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -1
README.md CHANGED
@@ -16,7 +16,15 @@ These are the checkpoints for the first ever QLoRA for ESM-2! They haven't been
16
  You can load and use them similarly to the LoRA models. This is the smallest `esm2_t6_8M_UR50D` model, so the metrics aren't great.
17
  Scaling to larger models for better metrics is in progress. These checkpoints were trained using
18
  [the 600K dataset](https://huggingface.co/datasets/AmelieSchreiber/600K_data). To replicate the training of QLoRA for ESM-2 models,
19
- you can use the `conda-environment.yml` file.
 
 
 
 
 
 
 
 
20
 
21
  ## QLoRA Info
22
 
 
16
  You can load and use them similarly to the LoRA models. This is the smallest `esm2_t6_8M_UR50D` model, so the metrics aren't great.
17
  Scaling to larger models for better metrics is in progress. These checkpoints were trained using
18
  [the 600K dataset](https://huggingface.co/datasets/AmelieSchreiber/600K_data). To replicate the training of QLoRA for ESM-2 models,
19
+ you can use the `conda-environment.yml` file. However, for the next week or two (28/09/2023) you will need to uninstall transformers
20
+ and use this instead:
21
+
22
+ ```
23
+ pip install --upgrade git+https://github.com/huggingface/transformers.git
24
+ ```
25
+
26
+ Once the transformers library is updated, you should be able to simply use the latest version of transformers and gradient checkpointing
27
+ will be fully enabled, and QLoRA compatibility should be fully integrated into ESM-2 models.
28
 
29
  ## QLoRA Info
30