AmelieSchreiber
commited on
Commit
•
bbe0e74
1
Parent(s):
76ae9f4
Update README.md
Browse files
README.md
CHANGED
@@ -16,7 +16,15 @@ These are the checkpoints for the first ever QLoRA for ESM-2! They haven't been
|
|
16 |
You can load and use them similarly to the LoRA models. This is the smallest `esm2_t6_8M_UR50D` model, so the metrics aren't great.
|
17 |
Scaling to larger models for better metrics is in progress. These checkpoints were trained using
|
18 |
[the 600K dataset](https://huggingface.co/datasets/AmelieSchreiber/600K_data). To replicate the training of QLoRA for ESM-2 models,
|
19 |
-
you can use the `conda-environment.yml` file.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
20 |
|
21 |
## QLoRA Info
|
22 |
|
|
|
16 |
You can load and use them similarly to the LoRA models. This is the smallest `esm2_t6_8M_UR50D` model, so the metrics aren't great.
|
17 |
Scaling to larger models for better metrics is in progress. These checkpoints were trained using
|
18 |
[the 600K dataset](https://huggingface.co/datasets/AmelieSchreiber/600K_data). To replicate the training of QLoRA for ESM-2 models,
|
19 |
+
you can use the `conda-environment.yml` file. However, for the next week or two (28/09/2023) you will need to uninstall transformers
|
20 |
+
and use this instead:
|
21 |
+
|
22 |
+
```
|
23 |
+
pip install --upgrade git+https://github.com/huggingface/transformers.git
|
24 |
+
```
|
25 |
+
|
26 |
+
Once the transformers library is updated, you should be able to simply use the latest version of transformers and gradient checkpointing
|
27 |
+
will be fully enabled, and QLoRA compatibility should be fully integrated into ESM-2 models.
|
28 |
|
29 |
## QLoRA Info
|
30 |
|