Update README.md
Browse files
README.md
CHANGED
@@ -14,7 +14,7 @@ metrics:
|
|
14 |
# CELL-E 2
|
15 |
|
16 |
## Model description
|
17 |
-
[![CELL-E_2](images/architecture.png)](https://github.
|
18 |
|
19 |
CELL-E 2 is the second iteration of the original [CELL-E](https://www.biorxiv.org/content/10.1101/2022.05.27.493774v1) model which utilizes an amino acid sequence and nucleus image to make predictions of subcellular protein localization with respect to the nucleus.
|
20 |
|
@@ -31,7 +31,7 @@ We have two spaces available where you can run predictions on your own data!
|
|
31 |
|
32 |
- [Image Prediction](https://huggingface.co/spaces/HuangLab/CELL-E_2-Image_Prediction)
|
33 |
- [Sequence Prediction](https://huggingface.co/spaces/HuangLab/CELL-E_2-Sequence_Prediction)
|
34 |
-
|
35 |
## Model variations
|
36 |
|
37 |
We have made several versions of CELL-E 2 available. The naming scheme follows the structure ```training set_hidden size``` where the hidden size is set to the embedding dimension of the pretrained ESM-2 model.
|
@@ -68,6 +68,7 @@ These models were used the HPA models as checkpoints, but then were finetuned on
|
|
68 |
| [`HPA_1280`](https://huggingface.co/HuangLab/CELL-E_2_HPA_Finetuned_1280) | 10.8 GB | |
|
69 |
| [`HPA_2560`](https://huggingface.co/HuangLab/CELL-E_2_HPA_Finetuned_2560) | 17.5 GB | |
|
70 |
|
|
|
71 |
|
72 |
### How to use
|
73 |
|
@@ -85,12 +86,13 @@ model.sample(text=sequence, condition=nucleus)
|
|
85 |
### BibTeX entry and citation info
|
86 |
|
87 |
```bibtex
|
88 |
-
@
|
89 |
-
|
90 |
-
|
91 |
-
|
92 |
-
|
93 |
-
|
|
|
94 |
}
|
95 |
```
|
96 |
|
|
|
14 |
# CELL-E 2
|
15 |
|
16 |
## Model description
|
17 |
+
[![CELL-E_2](images/architecture.png)](https://bohuanglab.github.io/CELL-E_2/)
|
18 |
|
19 |
CELL-E 2 is the second iteration of the original [CELL-E](https://www.biorxiv.org/content/10.1101/2022.05.27.493774v1) model which utilizes an amino acid sequence and nucleus image to make predictions of subcellular protein localization with respect to the nucleus.
|
20 |
|
|
|
31 |
|
32 |
- [Image Prediction](https://huggingface.co/spaces/HuangLab/CELL-E_2-Image_Prediction)
|
33 |
- [Sequence Prediction](https://huggingface.co/spaces/HuangLab/CELL-E_2-Sequence_Prediction)
|
34 |
+
|
35 |
## Model variations
|
36 |
|
37 |
We have made several versions of CELL-E 2 available. The naming scheme follows the structure ```training set_hidden size``` where the hidden size is set to the embedding dimension of the pretrained ESM-2 model.
|
|
|
68 |
| [`HPA_1280`](https://huggingface.co/HuangLab/CELL-E_2_HPA_Finetuned_1280) | 10.8 GB | |
|
69 |
| [`HPA_2560`](https://huggingface.co/HuangLab/CELL-E_2_HPA_Finetuned_2560) | 17.5 GB | |
|
70 |
|
71 |
+
To reduce download size, we removed the ESM-2 model from the checkpoint. This should be downloaded the first time the code is run, but is otherwise something to be aware of if loading into other projects.
|
72 |
|
73 |
### How to use
|
74 |
|
|
|
86 |
### BibTeX entry and citation info
|
87 |
|
88 |
```bibtex
|
89 |
+
@inproceedings{
|
90 |
+
anonymous2023translating,
|
91 |
+
title={CELL-E 2: Translating Proteins to Pictures and Back with a Bidirectional Text-to-Image Transformer},
|
92 |
+
author={Emaad Khwaja, Yun S. Song, Aaron Agarunov, and Bo Huang},
|
93 |
+
booktitle={Thirty-seventh Conference on Neural Information Processing Systems},
|
94 |
+
year={2023},
|
95 |
+
url={https://openreview.net/forum?id=YSMLVffl5u}
|
96 |
}
|
97 |
```
|
98 |
|