Emaad commited on
Commit
9a729ca
1 Parent(s): 6337872

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -8
README.md CHANGED
@@ -14,7 +14,7 @@ metrics:
14
  # CELL-E 2
15
 
16
  ## Model description
17
- [![CELL-E_2](images/architecture.png)](https://github.com/BoHuangLab/CELL-E_2)
18
 
19
  CELL-E 2 is the second iteration of the original [CELL-E](https://www.biorxiv.org/content/10.1101/2022.05.27.493774v1) model which utilizes an amino acid sequence and nucleus image to make predictions of subcellular protein localization with respect to the nucleus.
20
 
@@ -31,7 +31,7 @@ We have two spaces available where you can run predictions on your own data!
31
 
32
  - [Image Prediction](https://huggingface.co/spaces/HuangLab/CELL-E_2-Image_Prediction)
33
  - [Sequence Prediction](https://huggingface.co/spaces/HuangLab/CELL-E_2-Sequence_Prediction)
34
-
35
  ## Model variations
36
 
37
  We have made several versions of CELL-E 2 available. The naming scheme follows the structure ```training set_hidden size``` where the hidden size is set to the embedding dimension of the pretrained ESM-2 model.
@@ -68,6 +68,7 @@ These models were used the HPA models as checkpoints, but then were finetuned on
68
  | [`HPA_1280`](https://huggingface.co/HuangLab/CELL-E_2_HPA_Finetuned_1280) | 10.8 GB | |
69
  | [`HPA_2560`](https://huggingface.co/HuangLab/CELL-E_2_HPA_Finetuned_2560) | 17.5 GB | |
70
 
 
71
 
72
  ### How to use
73
 
@@ -85,12 +86,13 @@ model.sample(text=sequence, condition=nucleus)
85
  ### BibTeX entry and citation info
86
 
87
  ```bibtex
88
- @article{,
89
- author = {Emaad Khwaja and
90
- Yun S Song and
91
- Aaron Agarunov and
92
- Bo Huang},
93
- title = {{CELL-E 2:} Translating Proteins to Pictures and Back with a Bidirectional Text-to-Image Transforme},
 
94
  }
95
  ```
96
 
 
14
  # CELL-E 2
15
 
16
  ## Model description
17
+ [![CELL-E_2](images/architecture.png)](https://bohuanglab.github.io/CELL-E_2/)
18
 
19
  CELL-E 2 is the second iteration of the original [CELL-E](https://www.biorxiv.org/content/10.1101/2022.05.27.493774v1) model which utilizes an amino acid sequence and nucleus image to make predictions of subcellular protein localization with respect to the nucleus.
20
 
 
31
 
32
  - [Image Prediction](https://huggingface.co/spaces/HuangLab/CELL-E_2-Image_Prediction)
33
  - [Sequence Prediction](https://huggingface.co/spaces/HuangLab/CELL-E_2-Sequence_Prediction)
34
+
35
  ## Model variations
36
 
37
  We have made several versions of CELL-E 2 available. The naming scheme follows the structure ```training set_hidden size``` where the hidden size is set to the embedding dimension of the pretrained ESM-2 model.
 
68
  | [`HPA_1280`](https://huggingface.co/HuangLab/CELL-E_2_HPA_Finetuned_1280) | 10.8 GB | |
69
  | [`HPA_2560`](https://huggingface.co/HuangLab/CELL-E_2_HPA_Finetuned_2560) | 17.5 GB | |
70
 
71
+ To reduce download size, we removed the ESM-2 model from the checkpoint. This should be downloaded the first time the code is run, but is otherwise something to be aware of if loading into other projects.
72
 
73
  ### How to use
74
 
 
86
  ### BibTeX entry and citation info
87
 
88
  ```bibtex
89
+ @inproceedings{
90
+ anonymous2023translating,
91
+ title={CELL-E 2: Translating Proteins to Pictures and Back with a Bidirectional Text-to-Image Transformer},
92
+ author={Emaad Khwaja, Yun S. Song, Aaron Agarunov, and Bo Huang},
93
+ booktitle={Thirty-seventh Conference on Neural Information Processing Systems},
94
+ year={2023},
95
+ url={https://openreview.net/forum?id=YSMLVffl5u}
96
  }
97
  ```
98