update readme

Browse files

Files changed (8) hide show

README.md +9 -12
assets/images/architecture.png +0 -0
assets/images/homology_detection.png +0 -0
assets/images/reconstruction.png +0 -0
assets/images/structure_prediction.png +0 -0
assets/images/structure_tokenizer/launch_protein_viewer.png +0 -0
assets/images/structure_tokenizer/select_files.png +0 -0
assets/images/structure_tokenizer/visualize_reconstruction.png +0 -0

README.md CHANGED Viewed

@@ -14,12 +14,12 @@ AIDO.StructureTokenizer is a VQ-VAE-based tokenizer designed for protein structu
 ## Model Description
-TODO model figure
 **AIDO.StructureTokenizer** is built on a Vector Quantized Variational Autoencoder (VQ-VAE) architecture with the following components:
-- Equivariant Encoder: Encodes backbone structures into a latent space that maintains rotational and translational symmetries using the Equiformer architecture.
 - Discrete Codebook: Maps continuous latent vectors into 512 discrete structural tokens.
-- Invariant Decoder: Reconstructs full 3D structures, including side chains, from the structural tokens using an architecture adapted from ESMFold.
 This model strikes a balance between reconstruction fidelity and structural locality, optimizing its suitability for downstream tasks such as structure prediction, homology detection, and multimodal protein modeling.
@@ -30,17 +30,14 @@ This model strikes a balance between reconstruction fidelity and structural loca
 - Reconstructing Structures (See [below](#reconstructing-structures))
 - Structure Prediction (See [this section](https://huggingface.co/genbio-ai/AIDO.Protein2StructureToken-16B/blob/main/README.md#structure-prediction) in genbio-ai/AIDO.Protein2StructureToken-16B)
-### Hyperparameters
-TODO
-### Training details
-TODO
 ## Results
-TODO
 ## How to Use
 Please see `experiments/AIDO.StructureTokenizer` in [Model Generator](https://github.com/genbio-ai/modelgenerator) for more details.

 ## Model Description
+![Model Architecture](./assets/images/architecture.png)
 **AIDO.StructureTokenizer** is built on a Vector Quantized Variational Autoencoder (VQ-VAE) architecture with the following components:
+- Equivariant Encoder (6M): Encodes backbone structures into a latent space that maintains rotational and translational symmetries using the Equiformer architecture.
 - Discrete Codebook: Maps continuous latent vectors into 512 discrete structural tokens.
+- Invariant Decoder (300M): Reconstructs full 3D structures, including side chains, from the structural tokens using an architecture adapted from ESMFold.
 This model strikes a balance between reconstruction fidelity and structural locality, optimizing its suitability for downstream tasks such as structure prediction, homology detection, and multimodal protein modeling.
 - Reconstructing Structures (See [below](#reconstructing-structures))
 - Structure Prediction (See [this section](https://huggingface.co/genbio-ai/AIDO.Protein2StructureToken-16B/blob/main/README.md#structure-prediction) in genbio-ai/AIDO.Protein2StructureToken-16B)
 ## Results
+### Reconstructing Structures
+![Reconstruction Results](./assets/images/reconstruction.png)
+### Homology Detection
+![Homology Detection Results](./assets/images/homology_detection.png)
+### Structure Prediction
+![Structure Prediction Results](./assets/images/structure_prediction.png)
 ## How to Use
 Please see `experiments/AIDO.StructureTokenizer` in [Model Generator](https://github.com/genbio-ai/modelgenerator) for more details.

assets/images/architecture.png ADDED Viewed

assets/images/homology_detection.png ADDED Viewed

assets/images/reconstruction.png ADDED Viewed

assets/images/structure_prediction.png ADDED Viewed

assets/images/structure_tokenizer/launch_protein_viewer.png ADDED Viewed

assets/images/structure_tokenizer/select_files.png ADDED Viewed

assets/images/structure_tokenizer/visualize_reconstruction.png ADDED Viewed