Update README.md
Browse files
README.md
CHANGED
@@ -7,8 +7,9 @@ tags:
|
|
7 |
- chemistry
|
8 |
- biology
|
9 |
---
|
10 |
-
Chemlactica-125m is a continually pretrained galactica-125m model for organic molecules.
|
11 |
-
110M+ molecules from PubChem as well as their chemical properties
|
|
|
12 |
and similarities (Tanimoto distance between ECFP fingerprints).
|
13 |
|
14 |
Example prompts:
|
|
|
7 |
- chemistry
|
8 |
- biology
|
9 |
---
|
10 |
+
Chemlactica-125m is a continually pretrained [galactica-125m](https://huggingface.co/facebook/galactica-125m) model for organic molecules.
|
11 |
+
It is pretrained on (soon-to-be-released) 40B tokens covering 110M+ molecules from PubChem as well as their chemical properties
|
12 |
+
(molecular weight, synthetic accessibility score, drug-likeness etc.)
|
13 |
and similarities (Tanimoto distance between ECFP fingerprints).
|
14 |
|
15 |
Example prompts:
|